ACC-UNet

A Completely Convolutional UNet model for the 2020s

This repository is the official implementation of ACC-UNet : A Completely Convolutional UNet model for the 2020s using PyTorch.

Citation

Ibtehaz, N., Kihara, D. (2023). ACC-UNet: A Completely Convolutional UNet Model for the 2020s. In: Greenspan, H., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2023. MICCAI 2023. Lecture Notes in Computer Science, vol 14222. Springer, Cham. https://doi.org/10.1007/978-3-031-43898-1_66

@InProceedings{10.1007/978-3-031-43898-1_66,
author="Ibtehaz, Nabil and Kihara, Daisuke",
editor="Greenspan, Hayit and Madabhushi, Anant and Mousavi, Parvin and Salcudean, Septimiu and Duncan, James and Syeda-Mahmood, Tanveer and Taylor, Russell",
title="ACC-UNet: A Completely Convolutional UNet Model for the 2020s",
booktitle="Medical Image Computing and Computer Assisted Intervention -- MICCAI 2023",
year="2023",
publisher="Springer Nature Switzerland",
pages="692--702",
isbn="978-3-031-43898-1"
}

Introduction

This decade is marked by the introduction of Vision Transformer, a radical paradigm shift in broad computer vision. The similar trend is followed in medical imaging, UNet, one of the most influential architectures, has been redesigned with transformers. Recently, the efficacy of convolutional models in vision is being reinvestigated by seminal works such as ConvNext, which elevates a ResNet to Swin Transformer level. Deriving inspiration from this, we aim to improve a purely convolutional UNet model so that it can be on par with the transformer-based models, e.g, Swin-Unet or UCTransNet. We examined several advantages of the transformer-based UNet models, primarily long-range dependencies and cross-level skip connections. We attempted to emulate them through convolution operations and thus propose, ACC-UNet, a completely convolutional UNet model that brings the best of both worlds, the inherent inductive biases of convnets with the design decisions of transformers. ACC-UNet was evaluated on 5 different medical image segmentation benchmarks and consistently outperformed convnets, transformers and their hybrids. Notably, ACC-UNet outperforms state-of-the-art models Swin-Unet and UCTransNet by $2.64 \pm 2.54%$ and $0.45 \pm 1.61%$ in terms of dice score, respectively, while using a fraction of their parameters ($59.26%$ and $24.24%$).

Network Architecture

We propose a convolutional UNet, ACC-UNet (Fig. A). We started with a vanilla UNet model and reduced the number of filters in all the layers by half. Then, we replaced the convolutional blocks from the encoder and decoder with our proposed HANC blocks (Fig. B). For all the blocks, we considered $inv_fctr = 3$, but used $inv_fctr = 34$ for the last decoder block at level 3 to mimic the $9$ times increase at stage 3 of Swin Transformer. $k=3$, which considers up to $4\times4$ patches (used in Swin Transformer), was selected for all but the bottleneck level. Next, we modified the skip connections by using residual blocks (Fig. C) similar to ResPaths to reduce semantic gap, and stacked 3 MLFC blocks (Fig. D) to fuse the multi-level features. All the convolutional layers were batch-normalized, activated by Leaky-RELU and recalibrated by squeeze and excitation.

To summarize, in an UNet model, we replaced the classical convolutional blocks with our proposed HANC blocks that perform an approximate version of self-attention and modified the skip connection with MLFC blocks which consider the feature maps from different encoder levels. The proposed model has $16.77$ M parameters, roughly a $2$M increase than the vanilla UNet model.

Model Implementation

A PyTorch implementation of the model can be found in /ACC_UNet directory

Experiments

Please refer to the /Experiments directory

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ACC_UNet		ACC_UNet
Experiments		Experiments
Reproducibility		Reproducibility
imgs		imgs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ACC_UNet

ACC_UNet

Experiments

Experiments

Reproducibility

Reproducibility

imgs

imgs

LICENSE

LICENSE

README.md

README.md

Repository files navigation

ACC-UNet

A Completely Convolutional UNet model for the 2020s

Citation

Introduction

Network Architecture

Model Implementation

Experiments

About

Releases

Packages

Languages

License

kiharalab/ACC-UNet

Folders and files

Latest commit

History

Repository files navigation

ACC-UNet

A Completely Convolutional UNet model for the 2020s

Citation

Introduction

Network Architecture

Model Implementation

Experiments

About

Topics

Resources

License

Stars

Watchers

Forks

Languages