论文标题

使用Metasurfaces光学器件进行有效深度学习的端到端框架

End-to-End Framework for Efficient Deep Learning Using Metasurfaces Optics

论文作者

Burgos, Carlos Mauricio Villegas, Yang, Tianqi, Vamivakas, Nick, Zhu, Yuhao

论文摘要

使用卷积神经网络(CNN)的深度学习已显示出明显超过许多传统视觉算法的表现。尽管努力通过算法和专门的硬件来提高CNN效率,但深度学习仍然很难在资源受限的环境中部署。在本文中,我们提出了一个端到端框架,以在自由空间中探索光学上的CNN,就像计算摄像头一样。与现有的基于自由空间光学的方法相比,这些方法仅限于处理单渠道(即灰度)输入,我们建议基于纳米级元表面光学器件的第一种通用方法,可以直接从自然场景中处理RGB数据。我们的系统达到了节能的数量级,简化了传感器的设计,同时牺牲了很少的网络精度。

Deep learning using Convolutional Neural Networks (CNNs) has been shown to significantly out-performed many conventional vision algorithms. Despite efforts to increase the CNN efficiency both algorithmically and with specialized hardware, deep learning remains difficult to deploy in resource-constrained environments. In this paper, we propose an end-to-end framework to explore optically compute the CNNs in free-space, much like a computational camera. Compared to existing free-space optics-based approaches which are limited to processing single-channel (i.e., grayscale) inputs, we propose the first general approach, based on nanoscale meta-surface optics, that can process RGB data directly from the natural scenes. Our system achieves up to an order of magnitude energy saving, simplifies the sensor design, all the while sacrificing little network accuracy.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源