TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	City level (25 km)	28.3	# 8
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	Region level (200 km)	45.1	# 7
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	Country level (750 km)	74.7	# 2
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	Continent level (2500 km)	88.2	# 2
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	Training images	1.1M	# 1
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	Reference images	0	# 1
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Street level (1 km)	-	# 12
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	City level (25 km)	22.4	# 9
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Region level (200 km)	37.4	# 5
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Country level (750 km)	61.3	# 3
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Continent level (2500 km)	80.4	# 3
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Training Images	1.1M	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-generalized-zero-shot-learners-for/photo-geolocation-estimation-on-im2gps)](https://paperswithcode.com/sota/photo-geolocation-estimation-on-im2gps?p=learning-generalized-zero-shot-learners-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-generalized-zero-shot-learners-for/photo-geolocation-estimation-on-im2gps3k)](https://paperswithcode.com/sota/photo-geolocation-estimation-on-im2gps3k?p=learning-generalized-zero-shot-learners-for)`

Learning Generalized Zero-Shot Learners for Open-Domain Image Geolocalization

1 Feb 2023 · Lukas Haas, Silas Alberti, Michal Skreta ·

Image geolocalization is the challenging task of predicting the geographic coordinates of origin for a given photo. It is an unsolved problem relying on the ability to combine visual clues with general knowledge about the world to make accurate predictions across geographies. We present $\href{https://huggingface.co/geolocal/StreetCLIP}{\text{StreetCLIP}}$, a robust, publicly available foundation model not only achieving state-of-the-art performance on multiple open-domain image geolocalization benchmarks but also doing so in a zero-shot setting, outperforming supervised models trained on more than 4 million images. Our method introduces a meta-learning approach for generalized zero-shot learning by pretraining CLIP from synthetic captions, grounding CLIP in a domain of choice. We show that our method effectively transfers CLIP's generalized zero-shot capabilities to the domain of image geolocalization, improving in-domain generalized zero-shot performance without finetuning StreetCLIP on a fixed set of classes.

PDF Abstract

Code

Add Remove Mark official

geolocal/StreetCLIP official

Tasks

Add Remove

Generalized Zero-Shot Learning

Meta-Learning

Photo geolocation estimation

Zero-Shot Learning

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #1 on Photo geolocation estimation on Im2GPS (Training images metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Photo geolocation estimation	Im2GPS	StreetCLIP (Zero-Shot)	City level (25 km)	28.3	# 8	Compare
			Region level (200 km)	45.1	# 7	Compare
			Country level (750 km)	74.7	# 2	Compare
			Continent level (2500 km)	88.2	# 2	Compare
			Training images	1.1M	# 1	Compare
			Reference images	0	# 1	Compare
Photo geolocation estimation	Im2GPS3k	StreetCLIP (Zero-Shot)	Street level (1 km)	-	# 12	Compare
			City level (25 km)	22.4	# 9	Compare
			Region level (200 km)	37.4	# 5	Compare
			Country level (750 km)	61.3	# 3	Compare
			Continent level (2500 km)	80.4	# 3	Compare
			Training Images	1.1M	# 12	Compare

Methods

Add Remove

CLIP

Edit Social Preview

Learning Generalized Zero-Shot Learners for Open-Domain Image Geolocalization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove