论文标题
使用多个统计和自然语言工具包的描述分类和摘要分类
Classification of descriptions and summary using multiple passes of statistical and natural language toolkits
论文作者
论文摘要
本文档描述了一种可能用于检查实体相对于其名称的摘要 /定义的相关性的可能方法。该分类器侧重于实体名称与其摘要 /定义的相关性,换句话说,这是名称相关性检查。从这种方法获得的百分比得分可以单独使用或用于补充从其他指标获得的分数来得出最终分类;在文档结束时,还概述了潜在的改进。本文档侧重于实现目标得分的数据集是包名称及其各自摘要的列表(来自pypi.org)。
This document describes a possible approach that can be used to check the relevance of a summary / definition of an entity with respect to its name. This classifier focuses on the relevancy of an entity's name to its summary / definition, in other words, it is a name relevance check. The percentage score obtained from this approach can be used either on its own or used to supplement scores obtained from other metrics to arrive upon a final classification; at the end of the document, potential improvements have also been outlined. The dataset that this document focuses on achieving an objective score is a list of package names and their respective summaries (sourced from pypi.org).