论文标题

Semtui:表格数据的交互式语义丰富的框架

SemTUI: a Framework for the Interactive Semantic Enrichment of Tabular Data

论文作者

Ripamonti, Marco, De Paoli, Flavio, Palmonari, Matteo

论文摘要

数据集的大量可用性促进了\ acrshort {ml}和\ acrshort {ai}技术的使用,以收集洞察力,研究趋势并预测数据世界中看不见的行为。如今,收集和集成来自不同来源的数据主要是一项手动活动,需要在时间和金钱上以高成本的专家用户知识。因此,有必要使收集和链接来自许多不同来源的数据的过程,以使数据集准备好执行所需的分析。在这项工作中,我们提出了一个名为Semtui的综合框架的开发,以通过使用语义使富集过程灵活,完整和有效。这种方法是促进外部服务快速整合以执行诸如和解与扩展等丰富任务;并为用户提供图形接口以支持其他任务,例如改进以纠正自动富集算法提供的模棱两可的结果。事实证明,由任务驱动的用户评估是可以理解的,可用的,并且能够在涉及具有不同技能和经验的人的用户测试中少花费和时间来实现桌面丰富。

The large availability of datasets fosters the use of \acrshort{ml} and \acrshort{ai} technologies to gather insights, study trends, and predict unseen behaviours out of the world of data. Today, gathering and integrating data from different sources is mainly a manual activity that requires the knowledge of expert users at an high cost in terms of both time and money. It is, therefore, necessary to make the process of gathering and linking data from many different sources affordable to make datasets ready to perform the desired analysis. In this work, we propose the development of a comprehensive framework, named SemTUI, to make the enrichment process flexible, complete, and effective through the use of semantics. The approach is to promote fast integration of external services to perform enrichment tasks such as reconciliation and extension; and to provide users with a graphical interface to support additional tasks, such as refinement to correct ambiguous results provided by automatic enrichment algorithms. A task-driven user evaluation proved SemTUI to be understandable, usable, and capable of achieving table enrichment with little effort and time with user tests that involved people with different skills and experiences.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源