基于嵌套多尺度融合网络的遥感图像全色融合
CSTR:
作者:
作者单位:

同济大学 测绘与地理信息学院, 上海 200092

作者简介:

E-mail: wqm11111@126.com

通讯作者:

中图分类号:

基金项目:


Remote sensing image pansharpening based on a nested multi-scale fusion network
Author:
Affiliation:

College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, China

Fund Project:

Supported by the Fundamental Research Funds for the Central Universities (22120260026); the Excellent Young Scientists Fund of the National Natural Science Foundation of China (42222108); the General Program of the National Natural Science Foundation of China (42171345)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    全色融合通过融合多光谱数据与全色图像,生成全色图像空间分辨率下的多光谱图像。现有基于深度学习的全色融合方法难以兼顾局部高频细节提取与全局光谱一致性保真,进而使得融合后的影像在增强空间分辨率时易伴随光谱畸变,而在维持光谱特征时又容易导致空间细节的平滑。同时,全色与多光谱影像的跨分辨率差异会导致网络在融合过程中发生特征错位,进而引发局部边缘重影、色彩溢出等视觉失真现象。为此,本文提出了一种嵌套多尺度融合网络(NMSFusion)。该方法从宏观与微观层面进行协同建模。在微观层级,利用多尺度门控块(MSGB)提取全色图像的局部高频细节;在宏观层级,通过无伪影残差融合模块(ARFM)统筹全局语义,确保融合结果与原始多光谱影像的光谱一致性。此外,该方法引入了自适应坐标编码模块(ACEM),拟缓解不同分辨率特征错位问题。最后,针对传统深层网络在随机权重初始化下易破坏原始色彩分布,导致训练初期面临收敛困难与光谱畸变的问题,本文提出了一种即插即用的恒等初始化机制(IIM)。该机制通过约束网络各层的初始权重参数,强制模型在训练起点的输出为插值后的多光谱影像,从而为模型优化提供了合理的初始状态,有效促进了网络的稳定收敛。在三个遥感数据集上的实验表明,NMSFusion在量化指标与视觉评价上均优于9种主流的全色融合网络。同时,消融实验分析表明,各模块不仅协同提升了模型的融合性能,还保证了网络结构的高效与轻量化。

    Abstract:

    Pansharpening aims to fuse a coarse spatial resolution multispectral (MS) image with a fine spatial resolution panchromatic (PAN) image to generate a fused image with fine spatial and spectral resolution. Existing deep learning-based pansharpening methods still face difficulties in simultaneously extracting local high-frequency details and preserving global spectral consistency. Consequently, fused images are prone to spectral distortion when spatial details are enhanced. Moreover, the spatial resolution gap between PAN and MS images often causes feature misalignment during the fusion process, resulting in visual artifacts such as local edge ghosting and color distortion. To address these issues, this paper proposes a Nested Multi-Scale Fusion Network (NMSFusion), which performs synergistic modeling at both local and global levels. At the local level, a Multi-Scale Gated Block (MSGB) is utilized to extract high-frequency details from the PAN image. At the global level, an Artifact-Free Residual Fusion Module (ARFM) is designed to coordinate global semantics, ensuring spectral consistency between the fused images and the original MS images. Additionally, an Adaptive Coordinate Encoding Module (ACEM) is introduced to alleviate the feature misalignment caused by the resolution gap. Finally, conventional deep networks with random weight initialization tend to disrupt the original color distribution, leading to convergence difficulties and spectral distortion in the early training stages. To tackle this, we propose a plug-and-play Identity Initialization Mechanism (IIM). By constraining the initial weight of network layers, IIM forces the initial network output to equal the interpolated MS image, thereby providing a reasonable initial state for optimization and promoting stable convergence. Experimental results on three remote sensing datasets demonstrate that NMSFusion outperforms nine representative pansharpening methods in both quantitative metrics and visual assessments. Furthermore, ablation studies demonstrate that the proposed modules not only synergistically enhance the fusion performance but also guarantee an efficient and lightweight network architecture.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-03-20
  • 最后修改日期:2026-04-22
  • 录用日期:2026-04-27
  • 在线发布日期: 2026-05-09
  • 出版日期:
文章二维码