ISSN 1004-4140
CN 11-3017/P
樊雪林, 文昱齐, 乔志伟. 基于Transformer增强型U-net的CT图像稀疏重建与伪影抑制[J]. CT理论与应用研究(中英文), 2024, 33(1): 1-12. DOI: 10.15953/j.ctta.2023.183.
引用本文: 樊雪林, 文昱齐, 乔志伟. 基于Transformer增强型U-net的CT图像稀疏重建与伪影抑制[J]. CT理论与应用研究(中英文), 2024, 33(1): 1-12. DOI: 10.15953/j.ctta.2023.183.
FAN X L, WEN Y Q, QIAO Z W. Sparse Reconstruction of Computed Tomography Images with Transformer Enhanced U-net[J]. CT Theory and Applications, 2024, 33(1): 1-12. DOI: 10.15953/j.ctta.2023.183. (in Chinese).
Citation: FAN X L, WEN Y Q, QIAO Z W. Sparse Reconstruction of Computed Tomography Images with Transformer Enhanced U-net[J]. CT Theory and Applications, 2024, 33(1): 1-12. DOI: 10.15953/j.ctta.2023.183. (in Chinese).

基于Transformer增强型U-net的CT图像稀疏重建与伪影抑制

Sparse Reconstruction of Computed Tomography Images with Transformer Enhanced U-net

  • 摘要: 实现低剂量计算机断层成像(CT)的一个有效办法是减少投影角度,但投影角度较少会产生严重的条状伪影,降低图像的临床使用价值。针对该问题,提出一种耦合卷积神经网络(CNN)和多种注意力机制的U型网络(TE-unet)。首先采用U型架构提取多尺度特征信息;其次提出一个包含CNN和多种注意力的模块提取图像特征;最后在跳跃连接处加入Transformer块过滤信息,抑制不相关特征,突出重要特征。所提网络结合CNN的局部特征提取能力和Transformer的全局信息捕获能力,辅以多种注意力机制,实现了良好的去条状伪影能力。在60个投影角度下,与经典的Uformer网络相比,峰值信噪比(PSNR)高出0.3178 dB,结构相似度(SSIM)高出0.002,均方根误差(RMSE)降低0.0005。实验结果表明,所提TE-unet重建的图像精度更高,图像细节保留的更好,可以更好地压制条状伪影。

     

    Abstract: An effective way to achieve low-dose computed tomography (CT) is to reduce the projection angle while maintaining the same radiation dose at each angle. However, a fewer projection angle can result in severe strip artifacts, reducing the practicality and clinical value of the image. To address this issue, a U-shaped network (Transformer Enhanced U-net, TE-unet) coupled with convolutional neural network (CNN) and multiple attention mechanisms was proposed. Firstly, a U-shaped architecture was adopted to fuse multi-scale feature information; Secondly, a module that includes CNN and multiple types of attention was proposed to extract image features; Finally, transformer blocks were added at skip connections to filter information, suppress irrelevant features, and highlight important features. This network combines the local feature extraction ability of CNN and the global information capture ability of Transformer, supplemented by various attention mechanisms, to achieve good ability to remove stripe artifacts. At 60 projection angles, compared to the classic uformer network, peak signal to noise ratio (PSNR) is 0.3178 dB higher, Structural Similarity (SSIM) is 0.002 higher, and Root Mean Square Error (RMSE) is 0.0005 lower. The experimental results show that the proposed TE-unet network reconstructs images with higher accuracy, preserves better image details, and can better suppress strip artifacts.

     

/

返回文章
返回