论文标题
您(视觉)思想的一分钱:大脑活动的自然观察自然电影的重建
A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain Activity
论文作者
论文摘要
从fMRI大脑记录中重建自然视频非常具有挑战性,这是两个主要原因:(i)由于fMRI数据获取很困难,我们只有有限的监督样本,这还不足以覆盖自然视频的巨大空间; (ii)fMRI记录的时间分辨率远低于自然视频的帧速率。在本文中,我们提出了一种自我监督的自然电影重建方法。通过对编码编码自然视频的编码使用循环一致性,我们可以:(i)利用培训视频的完整框架,而不仅限于与fMRI录音相对应的剪辑; (ii)利用受试者在fMRI机器内从未见过的大量外部自然视频。这些使得可以通过几个数量级来增加适用的培训数据,从而将自然视频先验引入解码网络以及时间连贯性。我们的方法大大优于竞争方法,因为这些方法仅在有限的监督数据上训练。我们进一步介绍了自然视频的新的简单暂时性先验,当将其折叠到我们的fMRI解码器中时,它允许我们以最高X8的X8的较高框架速率(HFR)重建视频。
Reconstructing natural videos from fMRI brain recordings is very challenging, for two main reasons: (i) As fMRI data acquisition is difficult, we only have a limited amount of supervised samples, which is not enough to cover the huge space of natural videos; and (ii) The temporal resolution of fMRI recordings is much lower than the frame rate of natural videos. In this paper, we propose a self-supervised approach for natural-movie reconstruction. By employing cycle-consistency over Encoding-Decoding natural videos, we can: (i) exploit the full framerate of the training videos, and not be limited only to clips that correspond to fMRI recordings; (ii) exploit massive amounts of external natural videos which the subjects never saw inside the fMRI machine. These enable increasing the applicable training data by several orders of magnitude, introducing natural video priors to the decoding network, as well as temporal coherence. Our approach significantly outperforms competing methods, since those train only on the limited supervised data. We further introduce a new and simple temporal prior of natural videos, which - when folded into our fMRI decoder further - allows us to reconstruct videos at a higher frame-rate (HFR) of up to x8 of the original fMRI sample rate.