BDMFuse:基于红外与可见光图像基础特征和细节特征的多尺度融合
CSTR:
作者:
作者单位:

1.河南农业大学 信息与管理科学学院,河南 郑州 450046;2.Universidade Nova de Lisboa, NOVA Information Management School, Lisboa1070-312, Portugal

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:


BDMFuse: Multi-scale network fusion for infrared and visible images based on base and detail features
Author:
Affiliation:

1.College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China;2.NOVA Information Management School, Universidade Nova de Lisboa, Lisboa1070-312, Portugal

Fund Project:

Supported by the Henan Province Key Research and Development Project (231111211300), the Central Government of Henan Province Guides Local Science and Technology Development Funds (Z20231811005), Henan Province Key Research and Development Project (231111110100), Henan Provincial Outstanding Foreign Scientist Studio (GZS2024006), and Henan Provincial Joint Fund for Scientific and Technological Research and Development Plan (Application and Overcoming Technical Barriers) (242103810028)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    红外与可见光图像的融合结果应该突出红外图像的显著目标,保留可见光图像的纹理细节。为满足上述要求,提出一种基于自编码器的红外与可见光图像融合方法。编码器根据优化目标构建基础编码器和细节编码器,用于提取图像的低频信息与高频信息。这种提取方式可能会导致部分信息未被捕捉,因此提出补偿编码器来补充信息。同时,采取多尺度分解来更全面地提取图像特征。解码器将低频、高频和补充信息相加获取多尺度特征。随后,引入注意力策略与Fusion模块进行多尺度融合,实现图像重建。在三个数据集上的实验结果表明,该网络生成的融合图像能有效保留突出目标,同时更符合人类的视觉感知。

    Abstract:

    The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images. To meet these requirements, an autoencoder-based method for infrared and visible image fusion is proposed. The encoder designed according to the optimization objective consists of a base encoder and a detail encoder, which is used to extract low-frequency and high-frequency information from the image. This extraction may lead to some information not being captured, so a compensation encoder is proposed to supplement the missing information. Multi-scale decomposition is also employed to extract image features more comprehensively. The decoder combines low-frequency, high-frequency and supplementary information to obtain multi-scale features. Subsequently, the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction. Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.

    参考文献
    相似文献
    引证文献
引用本文

司海平,赵文汭,李婷婷,李飞涛,FERNANDO Bacao,孙昌霞,李艳玲. BDMFuse:基于红外与可见光图像基础特征和细节特征的多尺度融合[J].红外与毫米波学报,2025,44(2):275~284]. SI Hai-Ping, ZHAO Wen-Rui, LI Ting-Ting, LI Fei-Tao, FERNADO Bacao, SUN Chang-Xia, LI Yan-Ling. BDMFuse: Multi-scale network fusion for infrared and visible images based on base and detail features[J]. J. Infrared Millim. Waves,2025,44(2):275~284.]

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-07-25
  • 最后修改日期:2025-02-09
  • 录用日期:2024-09-13
  • 在线发布日期: 2025-02-08
  • 出版日期: 2025-04-25
文章二维码