A Comprehensive Study on Torchvision Pre-trained Models for Fine-grained Inter-species Classification

This study aims to explore different pre-trained models offered in the Torchvision package which is available in the PyTorch library. And investigate their effectiveness on fine-grained images classification. Transfer Learning is an effective method of achieving extremely good performance with insufficient training data. In many real-world situations, people cannot collect sufficient data required to train a deep neural network model efficiently. Transfer Learning models are pre-trained on a large data set, and can bring a good performance on smaller datasets with significantly lower training time. Torchvision package offers us many models to apply the Transfer Learning on smaller datasets. Therefore, researchers may need a guideline for the selection of a good model. We investigate Torchvision pre-trained models on four different data sets: 10 Monkey Species, 225 Bird Species, Fruits 360, and Oxford 102 Flowers. These data sets have images of different resolutions, class numbers, and different achievable accuracies. We also apply their usual fully-connected layer and the Spinal fully-connected layer to investigate the effectiveness of SpinalNet. The Spinal fully-connected layer brings better performance in most situations. We apply the same augmentation for different models for the same data set for a fair comparison. This paper may help future Computer Vision researchers in choosing a proper Transfer Learning model.

PDF Abstract

Results from the Paper


 Ranked #1 on Fine-Grained Image Classification on Bird-225 (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Fine-Grained Image Classification 10 Monkey Species WideResNet-101(Spinal FC) Accuracy 99.26 # 1
Fine-Grained Image Classification 10 Monkey Species Inception-v3 (Spinal FC) Accuracy 99.26 # 1
Fine-Grained Image Classification 10 Monkey Species VGG-19_bn Accuracy 98.90 # 3
Fine-Grained Image Classification Bird-225 WideResNet-101 (Spinal FC) Accuracy 99.56 # 1
Fine-Grained Image Classification Bird-225 WideResNet-101 Accuracy 99.38 # 3
Fine-Grained Image Classification Fruits-360 ResNeXt-101 Accuracy (%) 99.98 # 1
Fine-Grained Image Classification Oxford 102 Flowers DenseNet-201 Accuracy 98.29 # 14
Fine-Grained Image Classification Oxford 102 Flowers DenseNet-201(Spinal FC) Accuracy 98.36 # 12

Methods


No methods listed for this paper. Add relevant methods here