Affordable Access

Access to the full text

Convolutional Transformer-Based Image Compression

Authors
  • Arezki, Bouzid
  • Feng, Fangchen
  • Mokraoui, Anissa
Type
Preprint
Publication Date
Sep 06, 2024
Submission Date
Sep 06, 2024
Identifiers
DOI: 10.23919/SPA59660.2023.10274433
Source
arXiv
License
Green
External links

Abstract

In this paper, we present a novel transformer-based architecture for end-to-end image compression. Our architecture incorporates blocks that effectively capture local dependencies between tokens, eliminating the need for positional encoding by integrating convolutional operations within the multi-head attention mechanism. We demonstrate through experiments that our proposed framework surpasses state-of-the-art CNN-based architectures in terms of the trade-off between bit-rate and distortion and achieves comparable results to transformer-based methods while maintaining lower computational complexity.

Report this publication

Statistics

Seen <100 times