Skip to main navigation Skip to search Skip to main content

Cascade wavelet transform based convolutional neural networks with application to image classification

  • Jieqi Sun
  • , Yafeng Li
  • , Qijun Zhao
  • , Ziyu Guo
  • , Ning Li
  • , Tao Hai
  • , Wenbo Zhang
  • , Dong Chen
  • Baoji University of Arts and Sciences
  • College of Computer Science
  • Peking University

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Pooling has been the core ingredient of modern convolutional neural networks (CNNs). Although classic pooling methods are simple and effective, it will inevitably lead to the problem that some features that make a great contribution to classification may be ignored. To solve this issue, this paper presents a novel cascade wavelet transform module, which makes full use of different frequency components and can be seamlessly integrated into the existing CNNs by replacing the existing pooling operation. In our method, wavelet transforms are performed in both spatial and channel domain. In spatial domain, using 2D discrete wavelet transform, we design a spatial pooling layer with attention mechanism by integrating low-frequency and high-frequency information. In channel domain, based on 1D discrete wavelet transform, a channel pooling layer with the attention mechanism is proposed for the final feature reconstruction. We call the proposed cascade wavelet transform based CNNs CasDWTNets. Compared to the traditional CNNs, experiments demonstrate that CasDWTNets obtain outstanding consistency and accuracy in image classification. Code will be made available.

Original languageEnglish
Pages (from-to)285-295
Number of pages11
JournalNeurocomputing
Volume514
DOIs
StatePublished - 1 Dec 2022
Externally publishedYes

Keywords

  • Attention mechanism
  • Cascade wavelet transforms
  • Convolutional neural networks
  • Image classification

Fingerprint

Dive into the research topics of 'Cascade wavelet transform based convolutional neural networks with application to image classification'. Together they form a unique fingerprint.

Cite this