论文标题
多样化相关短语
Diversifying Relevant Phrases
论文作者
论文摘要
给定的着陆页或与不同文档的匹配查询的多种关键字建议是在线广告中的一个活跃研究领域。现代搜索引擎为广告商提供了诸如动态搜索广告和智能广告系列之类的产品,它们可以从广告客户的产品清单中提取有意义的关键字/短语。这些关键字/短语代表了广告商的利益。在本文中,我们解决了为任何给定文档获得相关但多样化的关键字/短语的问题。我们将其作为优化问题提出,从而最大程度地限制了多样性和相关性之间的参数折衷,这些权衡受到了可能的关键字/短语数量的限制。我们表明这是一个组合NP-HARD优化问题。我们提出了两种基于凸松弛的方法,其复杂性和性能各不相同。在第一种方法中,我们表明优化问题减少了特征值问题。在第二种方法中,我们表明优化问题可以减少在L1球上最小化二次形式的形式。随后,我们表明这等同于半定义优化问题。为了证明我们提出的配方的功效,我们将其评估在各种现实世界数据集上,并将其与最新的启发式方法进行比较。
Diverse keyword suggestions for a given landing page or matching queries to diverse documents is an active research area in online advertising. Modern search engines provide advertisers with products like Dynamic Search Ads and Smart Campaigns where they extract meaningful keywords/phrases from the advertiser's product inventory. These keywords/phrases are representative of a diverse spectrum of advertiser's interests. In this paper, we address the problem of obtaining relevant yet diverse keywords/phrases for any given document. We formulate this as an optimization problem, maximizing the parameterized trade-off between diversity and relevance constrained over number of possible keywords/phrases. We show that this is a combinatorial NP-hard optimization problem. We propose two approaches based on convex relaxations varying in complexity and performance. In the first approach, we show that the optimization problem reduces to an eigen value problem. In the second approach, we show that the optimization problem reduces to minimizing a quadratic form over an l1-ball. Subsequently, we show that this is equivalent to a semi-definite optimization problem. To prove the efficacy of our proposed formulation, we evaluate it on various real-world datasets and compare it to the state-of-the-art heuristic approaches.