TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	GFNet-H-B	Percentage correct	99.0	# 19
Image Classification	CIFAR-10	GFNet-H-B	PARAMS	54M	# 229
Image Classification	CIFAR-100	GFNet-H-B	Percentage correct	90.3	# 24
Image Classification	CIFAR-100	GFNet-H-B	PARAMS	54M	# 195
Image Classification	Flowers-102	GFNet-H-B	Accuracy	98.8	# 17
Image Classification	Flowers-102	GFNet-H-B	PARAMS	54M	# 49
Image Classification	ImageNet	GFNet-H-B	Top 1 Accuracy	82.9%	# 445
Image Classification	ImageNet	GFNet-H-B	Number of params	54M	# 737
Image Classification	ImageNet	GFNet-H-B	Hardware Burden	None	# 1
Image Classification	ImageNet	GFNet-H-B	Operations per network pass	None	# 1
Image Classification	ImageNet	GFNet-H-B	GFLOPs	8.6	# 281
Domain Generalization	ImageNet-A	GFNet-S	Top-1 accuracy %	14.3	# 31
Domain Generalization	ImageNet-C	GFNet-S	mean Corruption Error (mCE)	53.8	# 29
Image Classification	Stanford Cars	GFNet-H-B	Accuracy	93.2	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/image-classification-on-stanford-cars)](https://paperswithcode.com/sota/image-classification-on-stanford-cars?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/image-classification-on-flowers-102)](https://paperswithcode.com/sota/image-classification-on-flowers-102?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/domain-generalization-on-imagenet-c)](https://paperswithcode.com/sota/domain-generalization-on-imagenet-c?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/domain-generalization-on-imagenet-a)](https://paperswithcode.com/sota/domain-generalization-on-imagenet-a?p=global-filter-networks-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-filter-networks-for-image/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=global-filter-networks-for-image)`

Global Filter Networks for Image Classification

NeurIPS 2021 · Yongming Rao, Wenliang Zhao, Zheng Zhu, Jiwen Lu, Jie zhou ·

Recent advances in self-attention and pure multi-layer perceptrons (MLP) models for vision have shown great potential in achieving promising performance with fewer inductive biases. These models are generally based on learning interaction among spatial locations from raw data. The complexity of self-attention and MLP grows quadratically as the image size increases, which makes these models hard to scale up when high-resolution features are required. In this paper, we present the Global Filter Network (GFNet), a conceptually simple yet computationally efficient architecture, that learns long-term spatial dependencies in the frequency domain with log-linear complexity. Our architecture replaces the self-attention layer in vision transformers with three key operations: a 2D discrete Fourier transform, an element-wise multiplication between frequency-domain features and learnable global filters, and a 2D inverse Fourier transform. We exhibit favorable accuracy/complexity trade-offs of our models on both ImageNet and downstream tasks. Our results demonstrate that GFNet can be a very competitive alternative to transformer-style models and CNNs in efficiency, generalization ability and robustness. Code is available at https://github.com/raoyongming/GFNet

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

raoyongming/GFNet official

400

liuruiyang98/Jittor-MLP

162

mindspore-courses/External-Attentio…

dslisleedh/MLP_based_models-tensorf…

Tasks

Add Remove

Classification

Domain Generalization

Image Classification

Datasets

CIFAR-10

ImageNet

CIFAR-100

Oxford 102 Flower

ADE20K

Stanford Cars

ImageNet-C

ImageNet-A

Results from the Paper

Add Remove

Ranked #9 on Image Classification on Stanford Cars (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	GFNet-H-B	Percentage correct	99.0	# 19	Compare
Image Classification	CIFAR-10	GFNet-H-B	PARAMS	54M	# 229	Compare
Image Classification	CIFAR-100	GFNet-H-B	Percentage correct	90.3	# 24	Compare
Image Classification	CIFAR-100	GFNet-H-B	PARAMS	54M	# 195	Compare
Image Classification	Flowers-102	GFNet-H-B	Accuracy	98.8	# 17	Compare
Image Classification	Flowers-102	GFNet-H-B	PARAMS	54M	# 49	Compare
Image Classification	ImageNet	GFNet-H-B	Top 1 Accuracy	82.9%	# 445	Compare
			Number of params	54M	# 737	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
			GFLOPs	8.6	# 281	Compare
Domain Generalization	ImageNet-A	GFNet-S	Top-1 accuracy %	14.3	# 31	Compare
Domain Generalization	ImageNet-C	GFNet-S	mean Corruption Error (mCE)	53.8	# 29	Compare
Image Classification	Stanford Cars	GFNet-H-B	Accuracy	93.2	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Global Filter Networks for Image Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove