论文标题
移动其他方式:探索单词移动距离扩展
Moving Other Way: Exploring Word Mover Distance Extensions
论文作者
论文摘要
“移动”一词的距离(WMD)是两个文本的流行语义相似性度量。该立场论文研究了WMD的几种可能扩展。我们将语料库中单词的频率作为加权因素和矢量空间的几何形状进行实验。我们在六个文档分类数据集上验证WMD可能的扩展。与WMD相比,一些提出的扩展在K-Nearest邻居分类错误方面显示出更好的结果。
The word mover's distance (WMD) is a popular semantic similarity metric for two texts. This position paper studies several possible extensions of WMD. We experiment with the frequency of words in the corpus as a weighting factor and the geometry of the word vector space. We validate possible extensions of WMD on six document classification datasets. Some proposed extensions show better results in terms of the k-nearest neighbor classification error than WMD.