通过推荐系统中的公制学习来检测专业的恶意用户

论文标题

通过推荐系统中的公制学习来检测专业的恶意用户

Detect Professional Malicious User with Metric Learning in Recommender Systems

论文作者

Xu, Yuanbo, Yang, Yongjian, Wang, En, Zhuang, Fuzhen, Xiong, Hui

论文摘要

在电子商务中，在线零售商通常患有专业恶意用户（PMU），他们利用负面评论和对其消费产品的评级较低，以威胁零售商的非法利润。具体而言，PMU检测有三个挑战：1）专业恶意使用者不会进行任何异常或非法互动（他们从不同时留下太多的负面评论和低评级），并且他们采取了掩盖自己的策略来伪装自己。因此，传统的离群检测方法被其掩盖策略所混淆。 2）PMU检测模型应同时考虑评分和审查，这使PMU检测成为多模式问题。 3）没有针对公共专业恶意用户的标签的数据集，这使得PMU检测成为无监督的学习问题。为此，我们提出了一种无监督的多模式学习模型：MMD，该模型使用评级和评论的专业恶意用户检测到指标学习。 MMD首先利用修改后的RNN将信息审查投射到情感评分中，该评分共同考虑了评分和审查。然后，提出了专业的恶意用户分析（MUP）来捕捉情感分数和评分之间的情感差距。 MUP过滤用户并构建候选PMU集。我们应用基于公制的学习聚类来学习适当的度量矩阵以进行PMU检测。最后，我们可以利用此指标和标记的用户来检测PMU。具体而言，我们将注意力机制应用于公制学习中，以提高模型的性能。四个数据集中的广泛实验表明，我们提出的方法可以解决此无监督的检测问题。此外，通过将MMD作为预处理阶段来增强最先进的建议模型的性能。

In e-commerce, online retailers are usually suffering from professional malicious users (PMUs), who utilize negative reviews and low ratings to their consumed products on purpose to threaten the retailers for illegal profits. Specifically, there are three challenges for PMU detection: 1) professional malicious users do not conduct any abnormal or illegal interactions (they never concurrently leave too many negative reviews and low ratings at the same time), and they conduct masking strategies to disguise themselves. Therefore, conventional outlier detection methods are confused by their masking strategies. 2) the PMU detection model should take both ratings and reviews into consideration, which makes PMU detection a multi-modal problem. 3) there are no datasets with labels for professional malicious users in public, which makes PMU detection an unsupervised learning problem. To this end, we propose an unsupervised multi-modal learning model: MMD, which employs Metric learning for professional Malicious users Detection with both ratings and reviews. MMD first utilizes a modified RNN to project the informational review into a sentiment score, which jointly considers the ratings and reviews. Then professional malicious user profiling (MUP) is proposed to catch the sentiment gap between sentiment scores and ratings. MUP filters the users and builds a candidate PMU set. We apply a metric learning-based clustering to learn a proper metric matrix for PMU detection. Finally, we can utilize this metric and labeled users to detect PMUs. Specifically, we apply the attention mechanism in metric learning to improve the model's performance. The extensive experiments in four datasets demonstrate that our proposed method can solve this unsupervised detection problem. Moreover, the performance of the state-of-the-art recommender models is enhanced by taking MMD as a preprocessing stage.

下载PDF全文

下载文献需遵守相关版权规定

论文标题