TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-A	Top-1 Error Rate	3.0%	# 34
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-A	Search Time (GPU days)	0	# 1
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-A	Parameters	5.02M	# 37
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-A	FLOPS	1.95G	# 2
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-C	Top-1 Error Rate	3.18%	# 36
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-C	Search Time (GPU days)	0	# 1
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-C	Parameters	3.82M	# 32
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-C	FLOPS	1.2G	# 2
Neural Architecture Search	ImageNet	NN-MASS-B	Top-1 Error Rate	26.7	# 123
Neural Architecture Search	ImageNet	NN-MASS-B	Accuracy	73.3	# 100
Neural Architecture Search	ImageNet	NN-MASS-B	FLOPs	393M	# 118
Neural Architecture Search	ImageNet	NN-MASS-B	Params	3.7M	# 55
Neural Architecture Search	ImageNet	NN-MASS-B	MACs	393M	# 111
Neural Architecture Search	ImageNet	NN-MASS-A	Top-1 Error Rate	27.1	# 126
Neural Architecture Search	ImageNet	NN-MASS-A	Accuracy	72.9	# 103
Neural Architecture Search	ImageNet	NN-MASS-A	FLOPs	200M	# 109
Neural Architecture Search	ImageNet	NN-MASS-A	Params	2.3M	# 59
Neural Architecture Search	ImageNet	NN-MASS-A	MACs	200M	# 70

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/how-does-topology-of-neural-architectures/neural-architecture-search-on-cifar-10)](https://paperswithcode.com/sota/neural-architecture-search-on-cifar-10?p=how-does-topology-of-neural-architectures)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/how-does-topology-of-neural-architectures/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=how-does-topology-of-neural-architectures)`

How does topology of neural architectures impact gradient propagation and model performance?

CVPR 2021 · Kartikeya Bhardwa, Guihong Li2, Radu Marculescu ·

DenseNets introduce concatenation-type skip connections that achieve state-of-the-art accuracy in several computer vision tasks. In this paper, we reveal that the topology of the concatenation-type skip connections is closely related to the gradient propagation which, in turn, enables a predictable behavior of DNNs’ test performance. To this end, we introduce a new metric called NN-Mass to quantify how effectively information flows through DNNs. Moreover, we empirically show that NN-Mass also works for other types of skip connections, e.g., for ResNets, Wide-ResNets (WRNs), and MobileNets, which contain addition-type skip connections (i.e., residuals or inverted residuals). As such, for both DenseNet-like CNNs and ResNets/WRNs/MobileNets, our theoretically grounded NN-Mass can identify models with similar accuracy, despite having significantly different size/compute requirements. Detailed experiments on both synthetic and real datasets (e.g., MNIST, CIFAR-10, CIFAR100, ImageNet) provide extensive evidence for our insights. Finally, the closed-form equation of our NN-Mass enables us to design significantly compressed DenseNets (for CIFAR10) and MobileNets (for ImageNet) directly at initialization without time-consuming training and/or searching.

PDF Abstract

Code

Add Remove Mark official

SLDGroup/NN_Mass official

Tasks

Add Remove

Model Compression

Neural Architecture Search

Datasets

CIFAR-10

ImageNet

CIFAR-100

MNIST

Results from the Paper

Add Remove

Ranked #34 on Neural Architecture Search on CIFAR-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-A	Top-1 Error Rate	3.0%	# 34	Compare
			Search Time (GPU days)	0	# 1	Compare
			Parameters	5.02M	# 37	Compare
			FLOPS	1.95G	# 2	Compare
Neural Architecture Search	CIFAR-10	NN-MASS- CIFAR-C	Top-1 Error Rate	3.18%	# 36	Compare
			Search Time (GPU days)	0	# 1	Compare
			Parameters	3.82M	# 32	Compare
			FLOPS	1.2G	# 2	Compare
Neural Architecture Search	ImageNet	NN-MASS-B	Top-1 Error Rate	26.7	# 123	Compare
			Accuracy	73.3	# 100	Compare
			FLOPs	393M	# 118	Compare
			Params	3.7M	# 55	Compare
			MACs	393M	# 111	Compare
Neural Architecture Search	ImageNet	NN-MASS-A	Top-1 Error Rate	27.1	# 126	Compare
			Accuracy	72.9	# 103	Compare
			FLOPs	200M	# 109	Compare
			Params	2.3M	# 59	Compare
			MACs	200M	# 70	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

How does topology of neural architectures impact gradient propagation and model performance?

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove