论文标题
尼尔:音频注释的开源工具
NEAL: An open-source tool for audio annotation
论文作者
论文摘要
被动声监测在生态学,生物多样性和保护研究中广泛使用。通过声学监控收集的数据集通常非常大,并且可以使用人工智能和机器学习模型自动处理,旨在复制领域专家的工作。这些模型是受监督的学习算法,需要对专家生产的高质量注释进行培训。由于专家通常是资源限制的,因此需要一个具有成本效益的音频的过程来从数据中获得最大使用。我们提出了一个开源交互式音频数据注释工具NEAL(Nature+Energy Audio Labeller)。该工具使用R和相关的闪亮框架构建,提供了一个反应性环境,用户可以快速注释音频文件并调整自动更改用户界面相应元素的设置。该应用程序的设计目的是让专家观鸟者和公民科学家为声学注释项目做出贡献。 R编程在生物声学中的普及和灵活性意味着可以为其他鸟类标签数据集修改闪亮的应用程序,甚至可以修改为通用音频标签任务。我们通过标记从爱尔兰的风电场收集的数据来演示该应用程序。
Passive acoustic monitoring is used widely in ecology, biodiversity, and conservation studies. Data sets collected via acoustic monitoring are often extremely large and built to be processed automatically using Artificial Intelligence and Machine learning models, which aim to replicate the work of domain experts. These models, being supervised learning algorithms, need to be trained on high quality annotations produced by experts. Since the experts are often resource-limited, a cost-effective process for annotating audio is needed to get maximal use out of the data. We present an open-source interactive audio data annotation tool, NEAL (Nature+Energy Audio Labeller). Built using R and the associated Shiny framework, the tool provides a reactive environment where users can quickly annotate audio files and adjust settings that automatically change the corresponding elements of the user interface. The app has been designed with the goal of having both expert birders and citizen scientists contribute to acoustic annotation projects. The popularity and flexibility of R programming in bioacoustics means that the Shiny app can be modified for other bird labelling data sets, or even to generic audio labelling tasks. We demonstrate the app by labelling data collected from wind farm sites across Ireland.