论文标题

部分可观测时空混沌系统的无模型预测

Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing

论文作者

Zheng, Chengyu, song, Ning, Zhang, Ruoyu, Huang, Lei, Wei, Zhiqiang, Nie, Jie

论文摘要

遥感中的图像文本检索旨在提供灵活的信息以进行数据分析和应用。近年来,最新的方法致力于``缩放脱钩''和``语义脱钩''策略,以进一步增强表示能力的能力。但是,这些先前的方法集中在分离量表或语义上,但忽略了联合模型中的这两个想法,这极大地限制了跨模式检索模型的性能。为了解决这些问题,我们提出了一个新颖的规模语义关节解耦网络(SSJDN),用于遥感图像文本检索。具体而言,我们设计了双向尺度解耦(BSD)模块,该模块可利用显着特征提取(SFE)和显着性引导的抑制(SGS)单位,以适应性地提取潜在特征并抑制双向模式中其他尺度上麻烦的特征,以产生不同的尺度线索。此外,我们通过利用类别的语义标签作为先验知识来监督图像和文本,探讨了重要的语义相关信息,我们设计了标签监视的语义解耦(LSD)模块。最后,我们设计了一个语义引导的三重损失(STL),该损失(STL)会自适应地生成一个常数以调整损耗函数以提高匹配相同的语义图像和文本的概率,并缩短检索模型的收敛时间。在四个基准遥感数据集进行的数值实验中,我们提出的SSJDN优于最先进的方法。

Image-text retrieval in remote sensing aims to provide flexible information for data analysis and application. In recent years, state-of-the-art methods are dedicated to ``scale decoupling'' and ``semantic decoupling'' strategies to further enhance the capability of representation. However, these previous approaches focus on either the disentangling scale or semantics but ignore merging these two ideas in a union model, which extremely limits the performance of cross-modal retrieval models. To address these issues, we propose a novel Scale-Semantic Joint Decoupling Network (SSJDN) for remote sensing image-text retrieval. Specifically, we design the Bidirectional Scale Decoupling (BSD) module, which exploits Salience Feature Extraction (SFE) and Salience-Guided Suppression (SGS) units to adaptively extract potential features and suppress cumbersome features at other scales in a bidirectional pattern to yield different scale clues. Besides, we design the Label-supervised Semantic Decoupling (LSD) module by leveraging the category semantic labels as prior knowledge to supervise images and texts probing significant semantic-related information. Finally, we design a Semantic-guided Triple Loss (STL), which adaptively generates a constant to adjust the loss function to improve the probability of matching the same semantic image and text and shorten the convergence time of the retrieval model. Our proposed SSJDN outperforms state-of-the-art approaches in numerical experiments conducted on four benchmark remote sensing datasets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源