Skip to content

IMOP-lab/TriFTM-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TriFTM-Net: Tri-Path Fourier-Temporal Modulation Network for macular edema Pathology Segmentation and Reconstruction in High-Precision Intraoperative Navigation

TriFTM-Net:TriFTM-Net: Tri-Path Fourier-Temporal Modulation Network for macular edema Pathology Segmentation and Reconstruction in High-Precision Intraoperative Navigation

Xingru Huang, Shuaibin Chen, Huawei Wang, Yaoqi Sun, Tangsen Huang, Jian Huang, Yihao Guo, Hong He, Minhong Sun, Jin Liu, Zhiwen Zheng, Xiaoshuai Zhang and Shaowei Jiang

Hangzhou Dianzi University IMOP-lab

Figure 1: Detailed network structure of the TriFTM-Net.

We propose TriFTM-Net, a 2D medical image segmentation network for high-precision intraoperative navigation that integrates spatial, frequency, and spatiotemporal domain information. TriFTM-Net achieves the best performance on key metrics compared to 13 previous methods on the OIMHS dataset.

We will first introduce our methods and principles, then describe the experimental environment, and finally present the experimental results.

Methods

TPSHE

Figure 2: Structure of the TPSHE.

TPSHE, as an innovative encoder integrating spatial, spectral, and time-frequency domains, is capable of capturing both global structural information, such as the choroid and retina, and localized features, including macular edema.

FRM

Figure 3: Structure of the FRM.

FRM is integrated between the encoder and decoder to optimize the capture and utilization of fine-grained information within deep feature maps. This approach suppresses irrelevant factors, mitigates noise interference, and enhances the network's capacity to represent detailed features, such as those associated with macular edema.

HFRM

Figure 4: Structure of the HFRM.

HFRM enables the model to accurately reconstruct the high-level features required for detail enhancement from the sampled features, avoiding the loss of detail information. By utilizing multi-scale convolution operations and integrating features of different levels, it significantly improves the model's performance. The DCB ensures the accuracy of the enhanced features.

Installation

Experiments were conducted under identical hardware and software environments: four servers, each equipped with two NVIDIA Geforce RTX 3080 10GB GPUs and 128GB of memory. The project was implemented using Python 3.9.0, PyTorch 1.13.1, and CUDA 11.7.64, with Distributed Data Parallel (DDP) training and evaluation. The optimizer used was AdamW with an initial learning rate set at 0.0001. Model weights were randomly initialized, the batch size was set to 4, and the training spanned 100 epochs.

Experiment

Compare with others on the OIMHS dataset

Figure 5: Comparison experiments between our method and 13 previous segmentation methods on the OIMHS dataset.

Figure 6: The visual results of our method compared to the existing 13 segmentation methods on the OIMHS dataset.

Ablation study

Key components of TriFTM-Net

Figure 7: Ablation experiments on key components of TriFTM-Net on the OIMHS dataset.

FDB introduces frequency domain information to suppress image noise. SDB incorporates spatiotemporal domain information to focus on multiple scales of the image. HRM exhibits strong generalization capability in focusing on deep feature maps. HFRM shows good results in restoring image details and feature re-encoding.

Ablation studies of HFRM

Figure 8: Ablation experiments on HFRM on the OIMHS dataset.

Adding HFRM after each upsampling not only helps to restore image details but also benefits the subsequent segmentation of large-scale information in the image.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published