论文标题

学会denoise RAW移动UI布局以大规模改进数据集

Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale

论文作者

Li, Gang, Baechler, Gilles, Tragut, Manuel, Li, Yang

论文摘要

移动屏幕的布局是UI设计研究和对屏幕语义理解的关键数据源。但是,现有数据集中的UI布局通常是嘈杂的,其视觉表示不匹配,或者由难以分析和模型的通用或特定类型组成。在本文中,我们提出了使用深度学习方法来确定UI布局的粘土管道,从而使我们可以自动改进现有的移动UI布局数据集。我们的管道同时采用屏幕截图和RAW UI布局,并通过删除不正确的节点并将语义上有意义的类型分配给每个节点来注释RAW布局。为了尝试我们的数据清洁管道,我们根据屏幕截图和RICO的RICO(公共移动UI语料库RICO)创建了59,555个人类通知的屏幕布局的粘土数据集。我们的深层模型可实现高精度,F1得分为82.7%,用于检测没有有效的视觉表示的布局对象,而识别对象类型的布局对象则达到了85.9%,这显着胜过启发式基线。我们的工作为创建大规模的高质量UI布局数据集奠定了基础,用于数据驱动的移动UI研究,并减少了非常昂贵的手动标签工作的需求。

The layout of a mobile screen is a critical data source for UI design research and semantic understanding of the screen. However, UI layouts in existing datasets are often noisy, have mismatches with their visual representation, or consists of generic or app-specific types that are difficult to analyze and model. In this paper, we propose the CLAY pipeline that uses a deep learning approach for denoising UI layouts, allowing us to automatically improve existing mobile UI layout datasets at scale. Our pipeline takes both the screenshot and the raw UI layout, and annotates the raw layout by removing incorrect nodes and assigning a semantically meaningful type to each node. To experiment with our data-cleaning pipeline, we create the CLAY dataset of 59,555 human-annotated screen layouts, based on screenshots and raw layouts from Rico, a public mobile UI corpus. Our deep models achieve high accuracy with F1 scores of 82.7% for detecting layout objects that do not have a valid visual representation and 85.9% for recognizing object types, which significantly outperforms a heuristic baseline. Our work lays a foundation for creating large-scale high quality UI layout datasets for data-driven mobile UI research and reduces the need of manual labeling efforts that are prohibitively expensive.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源