论文标题
使用定向响度图对空间音频质量进行客观评估
Objective Assessment of Spatial Audio Quality using Directional Loudness Maps
论文作者
论文摘要
这项工作引入了从立体/双耳音频信号中提取的功能,目的是代表处理的空间听觉场景中感知质量降解的量度。特征提取技术基于一个简化的立体信号模型,该模型考虑使用振幅平移(AP)技术朝着给定方向定位的听觉事件。我们将立体声信号分解为一组方向信号,用于在短时傅立叶变换域中给定的AP值,并计算其整体响度以形成方向响度表示或地图。然后,我们比较参考信号的定向响度图和一个不断变化的版本,以得出旨在描述听力测试中报告的相关感知降解分数的失真度量。然后,在最新的听觉测试数据库上测试了该措施,并使用最先进的感知音频编解码器处理的立体信号使用非波形保护技术(例如带宽扩展和关节立体编码),以对现有质量预测器提出挑战,以提出挑战。结果表明,可以将派生的失真度量纳入为扩展到现有的自动化感知质量评估算法,以改善对空间编码的音频信号的预测。
This work introduces a feature extracted from stereophonic/binaural audio signals aiming to represent a measure of perceived quality degradation in processed spatial auditory scenes. The feature extraction technique is based on a simplified stereo signal model considering auditory events positioned towards a given direction in the stereo field using amplitude panning (AP) techniques. We decompose the stereo signal into a set of directional signals for given AP values in the Short-Time Fourier Transform domain and calculate their overall loudness to form a directional loudness representation or maps. Then, we compare directional loudness maps of a reference signal and a deteriorated version to derive a distortion measure aiming to describe the associated perceived degradation scores reported in listening tests. The measure is then tested on an extensive listening test database with stereo signals processed by state-of-the-art perceptual audio codecs using non waveform-preserving techniques such as bandwidth extension and joint stereo coding, known for presenting a challenge to existing quality predictors. Results suggest that the derived distortion measure can be incorporated as an extension to existing automated perceptual quality assessment algorithms for improving prediction on spatially coded audio signals.