TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS	Jaccard (Mean)	89.7	# 22
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS	F-measure (Mean)	91.6	# 27
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS	J&F	90.6	# 26
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS	Speed (FPS)	100.1	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS (BL30K)	Jaccard (Mean)	90.3	# 19
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS (BL30K)	F-measure (Mean)	92.6	# 20
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS (BL30K)	J&F	91.4	# 20
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS (BL30K)	Speed (FPS)	100.1	# 1
Video Object Segmentation	DAVIS 2016	MobileVOS (val)	Jaccard (Mean)	90.3	# 6
Video Object Segmentation	DAVIS 2016	MobileVOS (val)	F-Score	92.6	# 7
Video Object Segmentation	DAVIS 2016	MobileVOS (val)	J&F	91.4	# 6
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS	F-measure (Mean)	87.1	# 28
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS	J&F	80.2	# 43
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS	Speed (FPS)	90.6	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS	Params(M)	8.1	# 4
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS (BL30K)	F-measure (Mean)	88.9	# 16
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS (BL30K)	J&F	82.3	# 36
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS (BL30K)	Speed (FPS)	90.6	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS (BL30K)	Params(M)	8.1	# 4
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	Mean Jaccard & F-Measure	83.3	# 6
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	Jaccard (Seen)	83.2	# 5
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	Jaccard (Unseen)	76.9	# 8
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	F-Measure (Seen)	87.7	# 5
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	F-Measure (Unseen)	85.3	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilevos-real-time-video-object-segmentation/video-object-segmentation-on-davis-2016)](https://paperswithcode.com/sota/video-object-segmentation-on-davis-2016?p=mobilevos-real-time-video-object-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilevos-real-time-video-object-segmentation/video-object-segmentation-on-youtube-vos-2019-2)](https://paperswithcode.com/sota/video-object-segmentation-on-youtube-vos-2019-2?p=mobilevos-real-time-video-object-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilevos-real-time-video-object-segmentation/visual-object-tracking-on-davis-2016)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2016?p=mobilevos-real-time-video-object-segmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilevos-real-time-video-object-segmentation/visual-object-tracking-on-davis-2017)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2017?p=mobilevos-real-time-video-object-segmentation)`

MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation

CVPR 2023 · Roy Miles, Mehmet Kerim Yucel, Bruno Manganelli, Albert Saa-Garriga ·

This paper tackles the problem of semi-supervised video object segmentation on resource-constrained devices, such as mobile phones. We formulate this problem as a distillation task, whereby we demonstrate that small space-time-memory networks with finite memory can achieve competitive results with state of the art, but at a fraction of the computational cost (32 milliseconds per frame on a Samsung Galaxy S22). Specifically, we provide a theoretically grounded framework that unifies knowledge distillation with supervised contrastive representation learning. These models are able to jointly benefit from both pixel-wise contrastive learning and distillation from a pre-trained teacher. We validate this loss by achieving competitive J&F to state of the art on both the standard DAVIS and YouTube benchmarks, despite running up to 5x faster, and with 32x fewer parameters.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Contrastive Learning

Knowledge Distillation

Representation Learning

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

YouTube-VOS 2018

Referring Expressions for DAVIS 2016 & 2017

BL30K

Results from the Paper

Edit

Ranked #6 on Video Object Segmentation on YouTube-VOS 2019

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS	Jaccard (Mean)	89.7	# 22	Compare
			F-measure (Mean)	91.6	# 27	Compare
			J&F	90.6	# 26	Compare
			Speed (FPS)	100.1	# 1	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2016	MobileVOS (BL30K)	Jaccard (Mean)	90.3	# 19	Compare
			F-measure (Mean)	92.6	# 20	Compare
			J&F	91.4	# 20	Compare
			Speed (FPS)	100.1	# 1	Compare
Video Object Segmentation	DAVIS 2016	MobileVOS (val)	Jaccard (Mean)	90.3	# 6	Compare
			F-Score	92.6	# 7	Compare
			J&F	91.4	# 6	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS	F-measure (Mean)	87.1	# 28	Compare
			J&F	80.2	# 43	Compare
			Speed (FPS)	90.6	# 1	Compare
			Params(M)	8.1	# 4	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	MobileVOS (BL30K)	F-measure (Mean)	88.9	# 16	Compare
			J&F	82.3	# 36	Compare
			Speed (FPS)	90.6	# 1	Compare
			Params(M)	8.1	# 4	Compare
Video Object Segmentation	YouTube-VOS 2019	MobileVOS	Mean Jaccard & F-Measure	83.3	# 6	Compare
			Jaccard (Seen)	83.2	# 5	Compare
			Jaccard (Unseen)	76.9	# 8	Compare
			F-Measure (Seen)	87.7	# 5	Compare
			F-Measure (Unseen)	85.3	# 7	Compare

Methods

Add Remove

Contrastive Learning • Knowledge Distillation

Edit Social Preview

MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove