论文标题
自动图像内容提取:在大型视觉档案的人文摄影研究中操作机器学习
Automatic Image Content Extraction: Operationalizing Machine Learning in Humanistic Photographic Studies of Large Visual Archives
论文作者
论文摘要
将机器学习工具应用于数字化图像档案中,有可能彻底改变人文和社会科学视觉研究的定量研究。与传统上可能的相比,处理大量照片的能力并通过广泛的变量分析它们的能力将有助于更深入地了解材料。总体而言,这些更改将有助于将工作流程从简单的手动任务转移到更苛刻的阶段。 在本文中,我们介绍了自动图像内容提取(AICE)框架,以基于机器学习的搜索和大型图像档案的分析。我们通过重新设计和扩展传统的视觉内容分析方法,与当前和新兴的最新机器学习工具兼容,并涵盖自动内容分析的新型机器学习机会,从而开发了多学科研究项目的框架作为未来摄影研究的框架。所提出的框架可以应用于人文和社会科学的多个领域,并且可以将其调整并缩放到各种研究环境中。我们还提供有关不同机器学习技术现状的信息,并表明已经有各种可公开可用的方法适合广泛的视觉内容分析任务。
Applying machine learning tools to digitized image archives has a potential to revolutionize quantitative research of visual studies in humanities and social sciences. The ability to process a hundredfold greater number of photos than has been traditionally possible and to analyze them with an extensive set of variables will contribute to deeper insight into the material. Overall, these changes will help to shift the workflow from simple manual tasks to more demanding stages. In this paper, we introduce Automatic Image Content Extraction (AICE) framework for machine learning-based search and analysis of large image archives. We developed the framework in a multidisciplinary research project as framework for future photographic studies by reformulating and expanding the traditional visual content analysis methodologies to be compatible with the current and emerging state-of-the-art machine learning tools and to cover the novel machine learning opportunities for automatic content analysis. The proposed framework can be applied in several domains in humanities and social sciences, and it can be adjusted and scaled into various research settings. We also provide information on the current state of different machine learning techniques and show that there are already various publicly available methods that are suitable to a wide-scale of visual content analysis tasks.