非线性基于微观表达识别的主动成像改善原点

论文标题

非线性基于微观表达识别的主动成像改善原点

Non-Linearities Improve OrigiNet based on Active Imaging for Micro Expression Recognition

论文作者

Verma, Monu, Vipparthi, Santosh Kumar, Singh, Girdhari

论文摘要

微观表达识别（MER）是一项非常具有挑战性的任务，因为该表达在性质上非常短，并且需要与空间和时间动力学的参与进行特征建模。现有的MER系统利用CNN网络发现次要肌肉运动和微妙变化的重要特征。但是，现有网络无法在面部外观的空间特征与面部动力的时间变化之间建立关系。因此，这些网络无法有效捕获表达区域的微小变化和微妙的变化。为了解决这些问题，我们介绍了一个主动的成像概念，以将视频的表达区域的主动变化隔离到单个框架中，同时保留面部外观信息。此外，我们提出了一个浅的CNN网络：基于本地接收场的增强学习网络（Originet），该网络有效地学习了视频中微表达的重要特征。在本文中，我们提出了一个新的精制整流线性单元（RRELU），该单元克服了消失的梯度和垂死的问题。与现有激活函数相比，RRELU扩展了衍生物的范围。 RRELU不仅注入非线性，而且还通过施加加法和乘法属性来捕获真实的边缘。此外，我们提出了一个增强的功能学习块，以通过嵌入两个平行的完全连接层来提高网络的学习能力。通过在四个综合ME数据集上进行一项主题实验来评估拟议的Originet的性能。实验结果表明，原点的表现优于较少计算复杂性的最先进技术。

Micro expression recognition (MER)is a very challenging task as the expression lives very short in nature and demands feature modeling with the involvement of both spatial and temporal dynamics. Existing MER systems exploit CNN networks to spot the significant features of minor muscle movements and subtle changes. However, existing networks fail to establish a relationship between spatial features of facial appearance and temporal variations of facial dynamics. Thus, these networks were not able to effectively capture minute variations and subtle changes in expressive regions. To address these issues, we introduce an active imaging concept to segregate active changes in expressive regions of a video into a single frame while preserving facial appearance information. Moreover, we propose a shallow CNN network: hybrid local receptive field based augmented learning network (OrigiNet) that efficiently learns significant features of the micro-expressions in a video. In this paper, we propose a new refined rectified linear unit (RReLU), which overcome the problem of vanishing gradient and dying ReLU. RReLU extends the range of derivatives as compared to existing activation functions. The RReLU not only injects a nonlinearity but also captures the true edges by imposing additive and multiplicative property. Furthermore, we present an augmented feature learning block to improve the learning capabilities of the network by embedding two parallel fully connected layers. The performance of proposed OrigiNet is evaluated by conducting leave one subject out experiments on four comprehensive ME datasets. The experimental results demonstrate that OrigiNet outperformed state-of-the-art techniques with less computational complexity.

下载PDF全文

下载文献需遵守相关版权规定

论文标题