TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	EXTRA DATA	REMOVE
Fine-Grained Image Classification	FGVC Aircraft	CMAL-Net	Accuracy	94.7%	# 4
Fine-Grained Image Classification	Stanford Cars	CMAL-Net	Accuracy	97.1%	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learn-from-each-other-to-classify-better/fine-grained-image-classification-on-stanford)](https://paperswithcode.com/sota/fine-grained-image-classification-on-stanford?p=learn-from-each-other-to-classify-better)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learn-from-each-other-to-classify-better/fine-grained-image-classification-on-fgvc)](https://paperswithcode.com/sota/fine-grained-image-classification-on-fgvc?p=learn-from-each-other-to-classify-better)`

Learn from Each Other to Classify Better: Cross-layer Mutual Attention Learning for Fine-grained Visual Classification

Pattern Recognition 2023 · Dichao Liu, Longjiao Zhao, Yu Wang, Jien Kato ·

Fine-grained visual classification (FGVC) is valuable yet challenging. The difficulty of FGVC mainly lies in its intrinsic inter-class similarity, intra-class variation, and limited training data. Moreover, with the popularity of deep convolutional neural networks, researchers have mainly used deep, abstract, semantic information for FGVC, while shallow, detailed information has been neglected. This work proposes a cross-layer mutual attention learning network (CMAL-Net) to solve the above problems. Specifically, this work views the shallow to deep layers of CNNs as “experts” knowledgeable about different perspectives. We let each expert give a category prediction and an attention region indicating the found clues. Attention regions are treated as information carriers among experts, bringing three benefits: (ⅰ) helping the model focus on discriminative regions; (ⅱ) providing more training data; (ⅲ) allowing experts to learn from each other to improve the overall performance. CMAL-Net achieves state-of-the-art performance on three competitive datasets: FGVC-Aircraft, Stanford Cars, and Food-11.

PDF Abstract

Code

Add Remove Mark official

Dichao-Liu/CMAL

Tasks

Add Remove

Fine-Grained Image Classification

Datasets

Stanford Cars

FGVC-Aircraft

Results from the Paper

Add Remove

Ranked #1 on Fine-Grained Image Classification on Stanford Cars (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Uses Extra Training Data	Benchmark
Fine-Grained Image Classification	FGVC Aircraft	CMAL-Net	Accuracy	94.7%	# 4		Compare
Fine-Grained Image Classification	Stanford Cars	CMAL-Net	Accuracy	97.1%	# 1		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learn from Each Other to Classify Better: Cross-layer Mutual Attention Learning for Fine-grained Visual Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove