TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Test Agnostic Long-Tailed Learning	CIFAR-100-LT	DirMixE	Average Top-1 Accuracy	52.54	# 1
Long-tail Learning	CIFAR-100-LT (ρ=100)	DirMixE	Error Rate	51.62	# 33
Test Agnostic Long-Tailed Learning	CIFAR-10-LT	DirMixE	Average Top-1 Accuracy	86.76	# 1
Long-tail Learning	CIFAR-10-LT (ρ=100)	DirMixE	Error Rate	16.74	# 13
Test Agnostic Long-Tailed Learning	ImageNet-LT	DirMixE	Average Top-1 Accuracy	60.46	# 1
Long-tail Learning	ImageNet-LT	DirMixE(ResNeXt-50)	Top-1 Accuracy	58.61	# 19
Long-tail Learning	iNaturalist 2018	DirMixE	Top-1 Accuracy	73.21%	# 22
Test Agnostic Long-Tailed Learning	iNaturalist 2018	DirMixE	Average Top-1 Accuracy	73.24	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/test-agnostic-long-tailed-learning-on-cifar-2)](https://paperswithcode.com/sota/test-agnostic-long-tailed-learning-on-cifar-2?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/test-agnostic-long-tailed-learning-on-cifar-3)](https://paperswithcode.com/sota/test-agnostic-long-tailed-learning-on-cifar-3?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/test-agnostic-long-tailed-learning-on)](https://paperswithcode.com/sota/test-agnostic-long-tailed-learning-on?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/test-agnostic-long-tailed-learning-on-1)](https://paperswithcode.com/sota/test-agnostic-long-tailed-learning-on-1?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/long-tail-learning-on-cifar-10-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-100?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/long-tail-learning-on-imagenet-lt)](https://paperswithcode.com/sota/long-tail-learning-on-imagenet-lt?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/long-tail-learning-on-inaturalist-2018)](https://paperswithcode.com/sota/long-tail-learning-on-inaturalist-2018?p=harnessing-hierarchical-label-distribution)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harnessing-hierarchical-label-distribution/long-tail-learning-on-cifar-100-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-100?p=harnessing-hierarchical-label-distribution)`

Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition

13 May 2024 · Zhiyong Yang, Qianqian Xu, Zitai Wang, Sicong Li, Boyu Han, Shilong Bao, Xiaochun Cao, Qingming Huang ·

This paper explores test-agnostic long-tail recognition, a challenging long-tail task where the test label distributions are unknown and arbitrarily imbalanced. We argue that the variation in these distributions can be broken down hierarchically into global and local levels. The global ones reflect a broad range of diversity, while the local ones typically arise from milder changes, often focused on a particular neighbor. Traditional methods predominantly use a Mixture-of-Expert (MoE) approach, targeting a few fixed test label distributions that exhibit substantial global variations. However, the local variations are left unconsidered. To address this issue, we propose a new MoE strategy, $\mathsf{DirMixE}$, which assigns experts to different Dirichlet meta-distributions of the label distribution, each targeting a specific aspect of local variations. Additionally, the diversity among these Dirichlet meta-distributions inherently captures global variations. This dual-level approach also leads to a more stable objective function, allowing us to sample different test distributions better to quantify the mean and variance of performance outcomes. Theoretically, we show that our proposed objective benefits from enhanced generalization by virtue of the variance-based regularization. Comprehensive experiments across multiple benchmarks confirm the effectiveness of $\mathsf{DirMixE}$. The code is available at \url{https://github.com/scongl/DirMixE}.

PDF Abstract

Code

Add Remove Mark official

scongl/dirmixe official

Tasks

Add Remove

Image Classification

Long-tail Learning

Test Agnostic Long-Tailed Learning

Datasets

CIFAR-10

CIFAR-100

iNaturalist ImageNet-LT CIFAR100-LT

Results from the Paper

Edit

Ranked #1 on Test Agnostic Long-Tailed Learning on CIFAR-10-LT

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Test Agnostic Long-Tailed Learning	CIFAR-100-LT	DirMixE	Average Top-1 Accuracy	52.54	# 1	Compare
Long-tail Learning	CIFAR-100-LT (ρ=100)	DirMixE	Error Rate	51.62	# 33	Compare
Test Agnostic Long-Tailed Learning	CIFAR-10-LT	DirMixE	Average Top-1 Accuracy	86.76	# 1	Compare
Long-tail Learning	CIFAR-10-LT (ρ=100)	DirMixE	Error Rate	16.74	# 13	Compare
Test Agnostic Long-Tailed Learning	ImageNet-LT	DirMixE	Average Top-1 Accuracy	60.46	# 1	Compare
Long-tail Learning	ImageNet-LT	DirMixE(ResNeXt-50)	Top-1 Accuracy	58.61	# 19	Compare
Long-tail Learning	iNaturalist 2018	DirMixE	Top-1 Accuracy	73.21%	# 22	Compare
Test Agnostic Long-Tailed Learning	iNaturalist 2018	DirMixE	Average Top-1 Accuracy	73.24	# 1	Compare

Methods

Add Remove

MoE

Edit Social Preview

Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove