论文标题

基于复发阶段与深神经网络解开的相位重建

Phase reconstruction based on recurrent phase unwrapping with deep neural networks

论文作者

Masuyama, Yoshiki, Yatabe, Kohei, Koizumi, Yuma, Oikawa, Yasuhiro, Harada, Noboru

论文摘要

从给定的振幅光谱图估算相位的相位重建是声学信号处理中的一个主动研究字段,其中包括音频合成在内的许多应用。为了利用来自数据的丰富知识,几项研究提出了基于相位重建方法的深神经网络(DNN)。但是,对相位重建的DNN训练并不是一件容易的事,因为相位对波形的变化很敏感。为了克服这个问题,我们提出了一种基于DNN的两阶段重建方法。在提出的方法中,DNNS估计相位衍生物而不是相位本身,这使我们能够避免敏感性问题。然后,基于估计的衍生物递归估计相位,该衍生物被称为经常性相位解析(RPU)。实验结果证实,所提出的方法的表现优于DNN的直接相估计。

Phase reconstruction, which estimates phase from a given amplitude spectrogram, is an active research field in acoustical signal processing with many applications including audio synthesis. To take advantage of rich knowledge from data, several studies presented deep neural network (DNN)--based phase reconstruction methods. However, the training of a DNN for phase reconstruction is not an easy task because phase is sensitive to the shift of a waveform. To overcome this problem, we propose a DNN-based two-stage phase reconstruction method. In the proposed method, DNNs estimate phase derivatives instead of phase itself, which allows us to avoid the sensitivity problem. Then, phase is recursively estimated based on the estimated derivatives, which is named recurrent phase unwrapping (RPU). The experimental results confirm that the proposed method outperformed the direct phase estimation by a DNN.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源