论文标题

Le-HGR:嵌入式AR设备的轻巧有效的基于RGB的在线手势识别网络

LE-HGR: A Lightweight and Efficient RGB-based Online Gesture Recognition Network for Embedded AR Devices

论文作者

Xie, Hongwei, Wang, Jiafang, Shao, Baitao, Gu, Jian, Li, Mingyang

论文摘要

在线手势识别(HGR)技术对于增强现实(AR)的应用至关重要,以实现自然的人向计算机互动和沟通。近年来,低成本AR设备的消费市场一直在迅速增长,而该领域的技术成熟度仍然有限​​。这些设备是低价,有限的内存和受资源约束的计算单元的典型特征,这使得在线HGR成为一个具有挑战性的问题。为了解决这个问题,我们提出了一个轻巧和计算高效的HGR框架,即le-HGR,以在具有低计算能力的嵌入式设备上实现实时识别。我们还表明,所提出的方法具有很高的精度和鲁棒性,能够在各种复杂的相互作用环境中达到高端性能。为了实现我们的目标,我们首先提出了一个级联的多任务卷积神经网络(CNN),以同时预测手部检测的概率,并在线回归手关键点位置。我们表明,通过拟议的级联体系结构设计,可以在很大程度上消除假阳性估计值。此外,引入了相关的映射方法,以通过预测位置跟踪手迹线,该位置解决了多方面的干扰。随后,我们提出了一个痕量序列神经网络(Traceseqnn),以通过利用跟踪轨迹的运动特征来识别手势。最后,我们提供了各种实验结果,以表明所提出的框架能够以显着降低的计算成本来实现最先进的准确性,这是在低成本商业设备(如移动设备和AR/VR耳机)中实现实时应用的关键属性。

Online hand gesture recognition (HGR) techniques are essential in augmented reality (AR) applications for enabling natural human-to-computer interaction and communication. In recent years, the consumer market for low-cost AR devices has been rapidly growing, while the technology maturity in this domain is still limited. Those devices are typical of low prices, limited memory, and resource-constrained computational units, which makes online HGR a challenging problem. To tackle this problem, we propose a lightweight and computationally efficient HGR framework, namely LE-HGR, to enable real-time gesture recognition on embedded devices with low computing power. We also show that the proposed method is of high accuracy and robustness, which is able to reach high-end performance in a variety of complicated interaction environments. To achieve our goal, we first propose a cascaded multi-task convolutional neural network (CNN) to simultaneously predict probabilities of hand detection and regress hand keypoint locations online. We show that, with the proposed cascaded architecture design, false-positive estimates can be largely eliminated. Additionally, an associated mapping approach is introduced to track the hand trace via the predicted locations, which addresses the interference of multi-handedness. Subsequently, we propose a trace sequence neural network (TraceSeqNN) to recognize the hand gesture by exploiting the motion features of the tracked trace. Finally, we provide a variety of experimental results to show that the proposed framework is able to achieve state-of-the-art accuracy with significantly reduced computational cost, which are the key properties for enabling real-time applications in low-cost commercial devices such as mobile devices and AR/VR headsets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源