Cat-AIR: Content- & Task-Aware All-in-One Image Restoration

¹The Ohio State University, ²Microsoft, ³Johns Hopkins University

^*Indicates Equal Contribution, ^†Indicates Corresponding Author

TL;DR

We present Cat-AIR 🚀, an efficient all-in-one image restoration framework that adaptively balances local and global information via alternating spatial-channel attention, using task- and content-aware routing to allocate convolution or self-attention based on region and task complexity.

🔥🔥 More content comming soon: codes, demos, models.

Abstract

All-in-one image restoration seeks to recover high-quality images from various types of degradation using a single model, without prior knowledge of the corruption source. However, existing methods often struggle to effectively and efficiently handle multiple degradation types. We present Cat-AIR, a novel Content And Task-aware framework for All-in-one Image Restoration. Cat-AIR incorporates an alternating spatial-channel attention mechanism that adaptively balances the local and global information for different tasks. Specifically, we introduce cross-layer channel attentions and cross-feature spatial attentions that allocate computations based on content and task complexity. Furthermore, we propose a smooth learning strategy that allows for seamless adaptation to new restoration tasks while maintaining performance on existing ones. Extensive experiments demonstrate that Cat-AIR achieves state-of-the-art results across a wide range of restoration tasks, requiring fewer FLOPs than previous methods, establishing new benchmarks for efficient all-in-one image restoration.

Cat-AIR architecture

Overview of Cat-AIR. Our design features alternating channel and spatial attention mechanisms, where channel attention complexity scales across layers and spatial attention complexity adapts based on features. Prompt modules between decoder blocks are inserted to identify degradations.

Method

The core of the method lies in an alternating spatial-channel attention mechanism, which adaptively balances local and global information through cross-layer channel attention and cross-feature spatial attention. These mechanisms dynamically allocate computational resources based on both task and content complexity—using lightweight convolution for simple regions and powerful self-attention for complex ones.

Spatial Attention Mechanism — Cross-Feature Spatial attention

BibTeX

@misc{jiang2025cataircontenttaskawareallinone, title={Cat-AIR: Content and Task-Aware All-in-One Image Restoration}, author={Jiachen Jiang and Tianyu Ding and Ke Zhang and Jinxin Zhou and Tianyi Chen and Ilya Zharkov and Zhihui Zhu and Luming Liang}, year={2025}, eprint={2503.17915}, archivePrefix={arXiv}, primaryClass={eess.IV}, url={https://arxiv.org/abs/2503.17915}, }

Cat-AIR: Content- & Task-Aware All-in-One Image Restoration

Denoise

Derain

Dehaze

Deblur

Light Enhancement

TL;DR

Demo App 🤗

Video Presentation

Abstract

FLOPs vs PSNR comparison.

Cat-AIR architecture

Method

Results Visualization

Mask Visualization

BibTeX