No-MambAAD : Revitalizing Conv-Only Networks for Unsupervised Anomaly Detection
Pysyvä osoite
Kuvaus
©2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Most of the current state-of-the-art visual unsupervised anomaly detection (UAD) methods leverage complex neural architecture modules: Transformer-based methods provide high-quality anomaly detection performance due to their global feature extraction capability, similar to the recent Mamba based methods that combine the strengths of CNNs and Transformers. Some of the simpler reconstruction-based UAD methods are purely CNN-based, which offers linear complexity, but is performance-restricted by feature extraction locality. Hence, the architecture variants have inherent design trade-offs: CNNs lacks long-range feature interaction, Transformers struggle with quadratic complexity, and Mamba based solutions suffer in high parameter count and scalability. In this work we propose to revisit CNN-based approaches by introducing novel stripmodulation and gated-mixer mechanisms, and propose No-MambAAD, a novel visual UAD method absent of Mamba and Attention blocks. The proposed method offers similar or better anomaly detection performance than the current state-of-the-art approaches and outperforms the current state-of-the-art across multiple benchmarks with 38% smaller parameter count.
Emojulkaisu
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
ISBN
979-8-3315-9994-2
ISSN
2160-7516
2160-7508
2160-7508
Aihealue
OKM-julkaisutyyppi
A4 Artikkeli konferenssijulkaisussa