UHDformer [AAAI-24]

This is the official PyTorch codes for the paper
Correlation Matching Transformation Transformers for UHD Image Restoration
Cong Wang, Jinshan Pan, Wei Wang, Gang Fu, Siyuan Liang, Mengzhu Wang, Xiao-Ming Wu, Jun Liu

Abstract

This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. 
UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. 
The former learns multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter
explores more representative features learning from the highresolution ones to facilitate better restoration. 
To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one. 
To that end, we propose two new modules: Dual-path Correlation Matching Transformation module (DualCMT) and Adaptive Channel Modulator (ACM). 
The DualCMT selects top C/r (r is greater or equal to 1 which controls the squeezing level) correlation channels
from the max-pooling/mean-pooling high-resolution features to replace low-resolution ones in Transformers, which can effectively
squeeze useless content to improve the feature representation in low-resolution space to facilitate better recovery. 
The ACM is exploited to adaptively modulate multi-level high-resolution features, enabling to provide more useful features to low-resolution space for better learning. 
Experimental results show that our UHDformer reduces about ninetyseven percent model sizes compared with most state-of-theart methods while significantly improving performance under different training sets on 3 UHD image restoration tasks, including
low-light image enhancement, image dehazing, and image deblurring.

Overall of UHDformer

Overall framework of the proposed UHDformer. It mainly contains two parts: (a) learning in high-resolution space and (b) learning in low-resolution space. The former explores multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter learns the low-resolution features via the Correlation Matching Transformation Transformer Block (as shown in (c)). Before transforming high-resolution features to low-resolution space, we use an Adaptive Channel Modulator (as shown in (d)) to adaptively modulate multi-level high-resolution features to provide more representative content to low-resolution space.

Dual-path Correlation Matching Transformation

(a) Dual-path Correlation Matching Transformation (DualCMT) and (b) Correlation Matching Transformation (CMT)}. DualCMT contains Mean-Pooling and Max-Pooling-based CMT followed by a Gated Feature Refinement (GFR) module. CMT selects the top C/r (C \text{denotes the number of channels; r>=1 is the squeezing factor which controls the squeezing level) channels from the pooling high-resolution features to replace the low-resolution features. GFR is used to refine the features for better representation.

Main Results

Low-light Image Enhancement on UHD-LL

Image Dehazing on UHD-Haze

Image Deblurring on UHD-Blur

Dependencies and Installation

Ubuntu >= 18.04
CUDA >= 11.0
Other required packages in requirements.txt

# git clone this repository
git clone https://github.com/supersupercong/UHDformer.git
cd UHDformer 

# create new anaconda env
conda create -n uhdformer python=3.8
source activate uhdformer 

# install python dependencies
pip3 install -r requirements.txt
python setup.py develop

Datasets Download

UHD-LL, UHD-Haze, UHD-Blur

Pre-trained Model

[UHD-LL, UHD-Haze, UHD-Blur, LOL, SOTS-ITS] (https://drive.google.com/drive/folders/1XwIWOjOepUA-kXoaeOlzMDijVK4LMj-1?usp=sharing)

Visual Results

UHD-LL&UHD-Haze&UHD-Blur&LOL-SOTS

Train

bash train.sh

Test

bash test.sh

Citation

@inproceedings{wang2024uhdformer,
  title={Correlation Matching Transformation Transformers for UHD Image Restoration},
  author={Wang, Cong and Pan, Jinshan and Wang, Wei and Fu, Gang and Liang, Siyuan and Wang, Mengzhu and Wu, Xiao-Ming and Liu, Jun},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={38},
  number={6},
  pages={5336--5344},
  year={2024}
}

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Contact

Any questions can contact: Cong Wang [supercong94@gmail.com]

Acknowledgement

This project is based on FeMaSR.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
basicsr		basicsr
imgs		imgs
options		options
scripts		scripts
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
_init__.py		_init__.py
comput_psnr_ssim.py		comput_psnr_ssim.py
inference_uhdformer.py		inference_uhdformer.py
requirements.txt		requirements.txt
setup.py		setup.py
test.sh		test.sh
train.sh		train.sh
train_uhdformer.yml		train_uhdformer.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UHDformer [AAAI-24]

Abstract

Overall of UHDformer

Dual-path Correlation Matching Transformation

Main Results

Low-light Image Enhancement on UHD-LL

Image Dehazing on UHD-Haze

Image Deblurring on UHD-Blur

Dependencies and Installation

Datasets Download

Pre-trained Model

Visual Results

Train

Test

Citation

License

Contact

Acknowledgement

About

Releases

Packages

Languages

License

supersupercong/UHDformer

Folders and files

Latest commit

History

Repository files navigation

UHDformer [AAAI-24]

Abstract

Overall of UHDformer

Dual-path Correlation Matching Transformation

Main Results

Low-light Image Enhancement on UHD-LL

Image Dehazing on UHD-Haze

Image Deblurring on UHD-Blur

Dependencies and Installation

Datasets Download

Pre-trained Model

Visual Results

Train

Test

Citation

License

Contact

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages