论文标题
不明飞行物$^2 $:统一的框架,用于Omni superized对象检测
UFO$^2$: A Unified Framework towards Omni-supervised Object Detection
论文作者
论文摘要
现有的对象检测工作通常依赖于单一的注释:使用准确但昂贵的边界框或更便宜但表现力较低的图像级标签对模型进行训练。但是,现实世界的注释通常在形式上是多种多样的,这挑战了这些现有作品。在本文中,我们提出了UFO $^2 $,这是一个统一的对象检测框架,可以同时处理不同形式的监督。具体而言,UFO $^2 $结合了强大的监督(例如盒子),各种形式的部分监督(例如,类标签,点和涂鸦)以及未标记的数据。通过严格的评估,我们证明了每种形式的标签可用于从头开始训练模型或进一步改善预训练的模型。我们还使用UFO $^2 $来调查预算意识到的Omni审议学习,即在固定的注释预算下研究了各种注释策略:我们表明,竞争性绩效不需要所有数据的较强标签。最后,我们演示了UFO $^2 $的概括,检测了1,000多个不同的对象,而无需限制框注释。
Existing work on object detection often relies on a single form of annotation: the model is trained using either accurate yet costly bounding boxes or cheaper but less expressive image-level tags. However, real-world annotations are often diverse in form, which challenges these existing works. In this paper, we present UFO$^2$, a unified object detection framework that can handle different forms of supervision simultaneously. Specifically, UFO$^2$ incorporates strong supervision (e.g., boxes), various forms of partial supervision (e.g., class tags, points, and scribbles), and unlabeled data. Through rigorous evaluations, we demonstrate that each form of label can be utilized to either train a model from scratch or to further improve a pre-trained model. We also use UFO$^2$ to investigate budget-aware omni-supervised learning, i.e., various annotation policies are studied under a fixed annotation budget: we show that competitive performance needs no strong labels for all data. Finally, we demonstrate the generalization of UFO$^2$, detecting more than 1,000 different objects without bounding box annotations.