TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D(r101)	mean Corruption Error (mCE)	100.01	# 10
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D(r101)	mean Resilience Rate (mRR)	55.04	# 16
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D (r101)	mean Corruption Error (mCE)	100.01	# 10
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D (r101)	mean Resilience Rate (mRR)	55.04	# 16

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sparse4d-multi-view-3d-object-detection-with/robust-camera-only-3d-object-detection-on)](https://paperswithcode.com/sota/robust-camera-only-3d-object-detection-on?p=sparse4d-multi-view-3d-object-detection-with)`

Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion

19 Nov 2022 · Xuewu Lin, Tianwei Lin, Zixiang Pei, Lichao Huang, Zhizhong Su ·

Bird-eye-view (BEV) based methods have made great progress recently in multi-view 3D detection task. Comparing with BEV based methods, sparse based methods lag behind in performance, but still have lots of non-negligible merits. To push sparse 3D detection further, in this work, we introduce a novel method, named Sparse4D, which does the iterative refinement of anchor boxes via sparsely sampling and fusing spatial-temporal features. (1) Sparse 4D Sampling: for each 3D anchor, we assign multiple 4D keypoints, which are then projected to multi-view/scale/timestamp image features to sample corresponding features; (2) Hierarchy Feature Fusion: we hierarchically fuse sampled features of different view/scale, different timestamp and different keypoints to generate high-quality instance feature. In this way, Sparse4D can efficiently and effectively achieve 3D detection without relying on dense view transformation nor global attention, and is more friendly to edge devices deployment. Furthermore, we introduce an instance-level depth reweight module to alleviate the ill-posed issue in 3D-to-2D projection. In experiment, our method outperforms all sparse based methods and most BEV based methods on detection task in the nuScenes dataset.

PDF Abstract

Code

Add Remove Mark official

linxuewu/sparse4d official

322

Tasks

Add Remove

3D Object Detection

object-detection

Object Detection

Robust Camera Only 3D Object Detection

Datasets

nuScenes nuScenes-C

Results from the Paper

Edit

Ranked #10 on Robust Camera Only 3D Object Detection on nuScenes-C

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D(r101)	mean Corruption Error (mCE)	100.01	# 10	Compare
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D(r101)	mean Resilience Rate (mRR)	55.04	# 16	Compare
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D (r101)	mean Corruption Error (mCE)	100.01	# 10	Compare
Robust Camera Only 3D Object Detection	nuScenes-C	Sparse4D (r101)	mean Resilience Rate (mRR)	55.04	# 16	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove