论文标题
学习个人资料:用于几次学习的用户元元素网络
Learning to Profile: User Meta-Profile Network for Few-Shot Learning
论文作者
论文摘要
元学习方法在视觉和语言领域取得了巨大成功。但是,很少有研究讨论用于大规模工业应用的元学习实践。尽管电子商务公司已经花了很多精力来学习表征来提供更好的用户体验,但我们认为在此步骤中不能停止这种努力。除了学习强大的知名度外,关于如何有效地转移学习表示的挑战性问题也会同时提出。本文介绍了我们从三个方面解决这些挑战所做的贡献。 1)元学习模型:在使用电子商务用户行为数据的表示学习的背景下,我们提出了一个称为Meta-Profile网络的元学习框架,该框架扩展了匹配网络和关系网络的想法,以进行知识传输和快速适应; 2)编码策略:为了保留大规模长期顺序行为数据的高忠诚,我们提出了一个时间热图编码策略,该策略允许模型有效地编码数据; 3)深层网络体系结构:多模式模型与多任务学习体系结构相结合来解决跨域知识学习和标签问题不足。此外,我们认为,工业模型不仅应该在准确性方面具有良好的表现,而且在极端条件下具有更好的鲁棒性和不确定性性能。我们通过在各种极端情况下进行广泛的控制实验,即分布外检测,数据不足和类不平衡方案来评估模型的性能。与基线模型相比,元数据网络在模型性能方面显示出显着改善。
Meta-learning approaches have shown great success in vision and language domains. However, few studies discuss the practice of meta-learning for large-scale industrial applications. Although e-commerce companies have spent many efforts on learning representations to provide a better user experience, we argue that such efforts cannot be stopped at this step. In addition to learning a strong profile, the challenging question about how to effectively transfer the learned representation is raised simultaneously. This paper introduces the contributions that we made to address these challenges from three aspects. 1) Meta-learning model: In the context of representation learning with e-commerce user behavior data, we propose a meta-learning framework called the Meta-Profile Network, which extends the ideas of matching network and relation network for knowledge transfer and fast adaptation; 2) Encoding strategy: To keep high fidelity of large-scale long-term sequential behavior data, we propose a time-heatmap encoding strategy that allows the model to encode data effectively; 3) Deep network architecture: A multi-modal model combined with multi-task learning architecture is utilized to address the cross-domain knowledge learning and insufficient label problems. Moreover, we argue that an industrial model should not only have good performance in terms of accuracy, but also have better robustness and uncertainty performance under extreme conditions. We evaluate the performance of our model with extensive control experiments in various extreme scenarios, i.e. out-of-distribution detection, data insufficiency and class imbalance scenarios. The Meta-Profile Network shows significant improvement in the model performance when compared to baseline models.