Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network

ACM Multimedia (ACM MM), 2024

[Paper] [Code]

Summary

Face super-resolution aims to reconstruct a high-resolution face image from a low-resolution face image. Previous methods typically employ an encoder-decoder structure to extract facial structural features, where the direct downsampling inevitably introduces distortions, especially to high-frequency features such as edges. To address this issue, we propose a wavelet-based feature enhancement network, which mitigates feature distortion by losslessly decomposing the input feature into high and low-frequency components using the wavelet transform and processing them separately. To improve the efficiency of facial feature extraction, a full domain Transformer is further proposed to enhance local, regional, and global facial features. Such designs allow our method to perform better without stacking many modules as previous methods did. Experiments show that our method effectively balances performance, model size, and speed.

Overview of our method, where the cascaded of WFD and WFU constitute the wavelet-based encoder-decoder.

Visual Results

Citation

@inproceedings{li2024efficient,
title={Efficient face super-resolution via wavelet-based feature enhancement network},
author={Wenjie Li, Heng Guo, Xuannan Liu, Kongming Liang, Jiani Hu, Zhanyu Ma, Jun Guo,
booktitle={Proceedings of the 32nd ACM International Conference on Multimedia},
pages={4515--4523},
year={2024}
}