通过训练有素的跨表面编码器在硬件中的实时高光谱成像

论文标题

通过训练有素的跨表面编码器在硬件中的实时高光谱成像

Real-time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders

论文作者

Makarenko, Maksim, Burguete-Lopez, Arturo, Wang, Qizhou, Getman, Fedor, Giancola, Silvio, Ghanem, Bernard, Fratalocchi, Andrea

论文摘要

高光谱成像引起了极大的关注，以识别计算机视觉中图像分类和自动模式识别的光谱特征。快照高光谱成像的最先进的实现取决于笨重，非集成和昂贵的光学元素，包括镜头，光谱仪和滤镜。这些宏观组件不允许快速数据处理，例如实时和高分辨率视频。这项工作介绍了Hyplex，这是一种新的集成体系结构，涉及上述局限性。 Hyplex是CMOS兼容，快速的高光谱摄像头，用纳米级的跨曲面代替了通过人工智能设计的纳米级元素。 Hyplex不需要光谱仪，而是利用常规的单色摄像机，以廉价成本为实时和高分辨率高光谱成像开辟了可能性。 Hyplex利用了模型驱动的优化，该优化将物理元信息层与基于端到端训练的现代视觉计算方法联系起来。我们设计和实施了Hyplex的原型版本，并将其性能与典型成像任务（例如光谱重建和语义分割）的最新成像进行了比较。在所有基准测试中，Hyplex报告了最小的重建误差。据我们所知，我们还介绍了最大的公开标记的高光谱数据集，用于语义细分。

Hyperspectral imaging has attracted significant attention to identify spectral signatures for image classification and automated pattern recognition in computer vision. State-of-the-art implementations of snapshot hyperspectral imaging rely on bulky, non-integrated, and expensive optical elements, including lenses, spectrometers, and filters. These macroscopic components do not allow fast data processing for, e.g real-time and high-resolution videos. This work introduces Hyplex, a new integrated architecture addressing the limitations discussed above. Hyplex is a CMOS-compatible, fast hyperspectral camera that replaces bulk optics with nanoscale metasurfaces inversely designed through artificial intelligence. Hyplex does not require spectrometers but makes use of conventional monochrome cameras, opening up the possibility for real-time and high-resolution hyperspectral imaging at inexpensive costs. Hyplex exploits a model-driven optimization, which connects the physical metasurfaces layer with modern visual computing approaches based on end-to-end training. We design and implement a prototype version of Hyplex and compare its performance against the state-of-the-art for typical imaging tasks such as spectral reconstruction and semantic segmentation. In all benchmarks, Hyplex reports the smallest reconstruction error. We additionally present what is, to the best of our knowledge, the largest publicly available labeled hyperspectral dataset for semantic segmentation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题