论文标题
Cube有趣:新颖性,相关性,特殊性和惊喜
Cube Interestingness: Novelty, Relevance, Peculiarity and Surprise
论文作者
论文摘要
在本文中,我们讨论了在数据立方体环境中评估查询有趣性的方法。我们假设一个分层多维数据库,存储数据立方体和级别层次结构。我们从对人类行为和计算机科学研究领域的相关工作进行全面审查。我们将查询的兴趣定义为沿差异维度的分数的矢量,例如新颖性,相关性,惊喜和特殊性,并通过可用于评估这些趣味性的每个维度的信息的分类法进行补充。我们提供句法(独立于结果的)检查,以及以定量方式评估有趣性的不同维度的扩展(依赖结果)措施和算法。我们还报告了我们进行的用户研究的发现,分析了每个维度的重要性,随时间的演变以及研究参与者的行为。
In this paper, we discuss methods to assess the interestingness of a query in an environment of data cubes. We assume a hierarchical multidimensional database, storing data cubes and level hierarchies. We start with a comprehensive review of related work in the fields of studies of human behavior and computer science. We define the interestingness of a query as a vector of scores along difference dimensions, like novelty, relevance, surprise and peculiarity and complement this definition with a taxonomy of the information that can be used to assess each of these dimensions of interestingness. We provide both syntactic (result-independent) checks and extensional (result-dependent) measures and algorithms for assessing the different dimensions of interestingness in a quantitative fashion. We also report our findings on a user study that we conducted, analyzing the significance of each dimension, its evolution over time and the behavior of the study's participants.