TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Event-based Object Segmentation	DDD17-SEG	ETNet	mIoU	0.34	# 2
Event-based Object Segmentation	DSEC-SEG	ETNet	mIoU	0.36	# 2
Video Reconstruction	Event-Camera Dataset	ET-Net	Mean Squared Error	0.047	# 2
Video Reconstruction	Event-Camera Dataset	ET-Net	LPIPS	0.224	# 2
Video Reconstruction	MVSEC	ET-Net	Mean Squared Error	0.107	# 2
Video Reconstruction	MVSEC	ET-Net	LPIPS	0.489	# 2
Event-based Object Segmentation	MVSEC-SEG	ETNet	mIoU	0.37	# 2
Event-based Object Segmentation	RGBE-SEG	ETNet	mIoU	0.35	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/event-based-object-segmentation-on-ddd17-seg)](https://paperswithcode.com/sota/event-based-object-segmentation-on-ddd17-seg?p=event-based-video-reconstruction-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/event-based-object-segmentation-on-dsec-seg)](https://paperswithcode.com/sota/event-based-object-segmentation-on-dsec-seg?p=event-based-video-reconstruction-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/video-reconstruction-on-event-camera-dataset)](https://paperswithcode.com/sota/video-reconstruction-on-event-camera-dataset?p=event-based-video-reconstruction-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/video-reconstruction-on-mvsec)](https://paperswithcode.com/sota/video-reconstruction-on-mvsec?p=event-based-video-reconstruction-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/event-based-object-segmentation-on-mvsec-seg)](https://paperswithcode.com/sota/event-based-object-segmentation-on-mvsec-seg?p=event-based-video-reconstruction-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/event-based-video-reconstruction-using/event-based-object-segmentation-on-rgbe-seg)](https://paperswithcode.com/sota/event-based-object-segmentation-on-rgbe-seg?p=event-based-video-reconstruction-using)`

Event-Based Video Reconstruction Using Transformer

ICCV 2021 · Wenming Weng, Yueyi Zhang, Zhiwei Xiong ·

Event cameras, which output events by detecting spatio-temporal brightness changes, bring a novel paradigm to image sensors with high dynamic range and low latency. Previous works have achieved impressive performances on event-based video reconstruction by introducing convolutional neural networks (CNNs). However, intrinsic locality of convolutional operations is not capable of modeling long-range dependency, which is crucial to many vision tasks. In this paper, we present a hybrid CNN-Transformer network for event-based video reconstruction (ET-Net), which merits the fine local information from CNN and global contexts from Transformer. In addition, we further propose a Token Pyramid Aggregation strategy to implement multi-scale token integration for relating internal and intersected semantic concepts in the token-space. Experimental results demonstrate that our proposed method achieves superior performance over state-of-the-art methods on multiple real-world event datasets. The code is available at https://github.com/WarranWeng/ET-Net

PDF Abstract

Code

Add Remove Mark official

warranweng/et-net official

Tasks

Add Remove

Event-based Object Segmentation

Event-Based Video Reconstruction

Video Reconstruction

Datasets

MS COCO

Perceptual Similarity

Event-Camera Dataset

MVSEC

RGBE-SEG

MVSEC-SEG

DSEC-SEG

DDD17-SEG

Results from the Paper

Add Remove

Ranked #2 on Event-based Object Segmentation on DSEC-SEG

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Event-based Object Segmentation	DDD17-SEG	ETNet	mIoU	0.34	# 2	Compare
Event-based Object Segmentation	DSEC-SEG	ETNet	mIoU	0.36	# 2	Compare
Video Reconstruction	Event-Camera Dataset	ET-Net	Mean Squared Error	0.047	# 2	Compare
Video Reconstruction	Event-Camera Dataset	ET-Net	LPIPS	0.224	# 2	Compare
Video Reconstruction	MVSEC	ET-Net	Mean Squared Error	0.107	# 2	Compare
Video Reconstruction	MVSEC	ET-Net	LPIPS	0.489	# 2	Compare
Event-based Object Segmentation	MVSEC-SEG	ETNet	mIoU	0.37	# 2	Compare
Event-based Object Segmentation	RGBE-SEG	ETNet	mIoU	0.35	# 2	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Event-Based Video Reconstruction Using Transformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove