TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	GEN1 Detection	S5-ViT-B	mAP	47.4	# 3
Object Detection	GEN1 Detection	S5-ViT-B	Params	18.2	# 5
Object Detection	GEN1 Detection	S4D-ViT-B	mAP	46.2	# 7
Object Detection	GEN1 Detection	S4D-ViT-B	Params	16.5	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/state-space-models-for-event-cameras/object-detection-on-gen1-detection)](https://paperswithcode.com/sota/object-detection-on-gen1-detection?p=state-space-models-for-event-cameras)`

State Space Models for Event Cameras

23 Feb 2024 · Nikola Zubić, Mathias Gehrig, Davide Scaramuzza ·

Today, state-of-the-art deep neural networks that process event-camera data first convert a temporal window of events into dense, grid-like input representations. As such, they exhibit poor generalizability when deployed at higher inference frequencies (i.e., smaller temporal windows) than the ones they were trained on. We address this challenge by introducing state-space models (SSMs) with learnable timescale parameters to event-based vision. This design adapts to varying frequencies without the need to retrain the network at different frequencies. Additionally, we investigate two strategies to counteract aliasing effects when deploying the model at higher frequencies. We comprehensively evaluate our approach against existing methods based on RNN and Transformer architectures across various benchmarks, including Gen1 and 1 Mpx event camera datasets. Our results demonstrate that SSM-based models train 33% faster and also exhibit minimal performance degradation when tested at higher frequencies than the training input. Traditional RNN and Transformer models exhibit performance drops of more than 20 mAP, with SSMs having a drop of 3.76 mAP, highlighting the effectiveness of SSMs in event-based vision tasks.

PDF Abstract

Code

Add Remove Mark official

uzh-rpg/ssms_event_cameras official

lindermanlab/S5

217

Tasks

Add Remove

Event-based vision

Object Detection

Datasets

GEN1 Detection Prophesee GEN4 Dataset

Results from the Paper

Add Remove

Ranked #3 on Object Detection on GEN1 Detection

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	GEN1 Detection	S5-ViT-B	mAP	47.4	# 3	Compare
Object Detection	GEN1 Detection	S5-ViT-B	Params	18.2	# 5	Compare
Object Detection	GEN1 Detection	S4D-ViT-B	mAP	46.2	# 7	Compare
Object Detection	GEN1 Detection	S4D-ViT-B	Params	16.5	# 4	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

State Space Models for Event Cameras

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove