论文标题

在关系数据库中支持关键字查询中的模式参考

Supporting Schema References in Keyword Queries over Relational Databases

论文作者

Martins, Paulo, da Silva, Altigran, Cavalcanti, João, de Moura, Edleno

论文摘要

关系关键字搜索(R-KWS)系统使NAIVE/非正式用户能够从关系数据库中探索和检索信息,而无需了解模式详细信息或查询语言。这些系统从输入查询中获取关键字,找到与这些关键字相对应的目标数据库的元素,并寻找使用有关参考完整性约束的信息,即键/外键对来“连接”这些元素的方法。尽管文献中已经提出了几个这样的系统,但其中大多数仅支持其关键字的查询是指目标数据库的内容,并且很少有支持查询,其中关键字是指数据库架构的元素。本文提出了Lathe,这是一种旨在支持此类查询的新型R-KWS。为此,在我们的工作中,我们首先概括了众所周知的查询匹配概念(QMS)和候选网络(CJNS),以处理引用架构元素的关键字,并提出新算法来生成它们。然后,我们引入了一种方法,以自动选择CJN,这些CJN在发出关键字查询时更有可能表示用户意图。这种方法包括两项主要创新:一种用于选择更好的QM的排名算法,产生较少但更好的CJN的产生,以及急切的评估策略,用于修剪无用的无用CJN。我们提供了一组全面的实验集,该实验与先前在最新的R-KWS系统和方法的实验中使用的查询集和数据集执行。我们的结果表明,即使对于具有复杂模式的大型数据库,车床可以处理更广泛的关键字查询,同时保持高效。

Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve information from relational databases without knowing schema details or query languages. These systems take the keywords from the input query, locate the elements of the target database that correspond to these keywords, and look for ways to "connect" these elements using information on referential integrity constraints, i.e., key/foreign key pairs. Although several such systems have been proposed in the literature, most of them only support queries whose keywords refer to the contents of the target database and just very few support queries in which keywords refer to elements of the database schema. This paper proposes LATHE, a novel R-KwS designed to support such queries. To this end, in our work, we first generalize the well-known concepts of Query Matches (QMs) and Candidate Joining Networks (CJNs) to handle keywords referring to schema elements and propose new algorithms to generate them. Then, we introduce an approach to automatically select the CJNs that are more likely to represent the user intent when issuing a keyword query. This approach includes two major innovations: a ranking algorithm for selecting better QMs, yielding the generation of fewer but better CJNs, and an eager evaluation strategy for pruning void useless CJNs. We present a comprehensive set of experiments performed with query sets and datasets previously used in experiments with state-of-the-art R-KwS systems and methods. Our results indicate that LATHE can handle a wider variety of keyword queries while remaining highly effective, even for large databases with intricate schemas.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源