TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image-to-Image Translation	BRATS	ResViT	PSNR	26.90	# 1
Image-to-Image Translation	IXI	TransUNet	PSNR	32.49 ± 1.74	# 5
Image-to-Image Translation	IXI	ResViT	PSNR	35.71 ± 1.77	# 1
Image-to-Image Translation	IXI	A-UNet	PSNR	32.43 ± 1.74	# 6
Image-to-Image Translation	IXI	pix2pix	PSNR	33.62 ± 2.07	# 4
Image-to-Image Translation	IXI	pGAN	PSNR	33.95 ± 1.67	# 2
Image-to-Image Translation	IXI	SAGAN	PSNR	33.71 ± 1.61	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/resvit-residual-vision-transformers-for-multi/image-to-image-translation-on-brats)](https://paperswithcode.com/sota/image-to-image-translation-on-brats?p=resvit-residual-vision-transformers-for-multi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/resvit-residual-vision-transformers-for-multi/image-to-image-translation-on-ixi-dataset)](https://paperswithcode.com/sota/image-to-image-translation-on-ixi-dataset?p=resvit-residual-vision-transformers-for-multi)`

ResViT: Residual vision transformers for multi-modal medical image synthesis

30 Jun 2021 · Onat Dalmaz, Mahmut Yurt, Tolga Çukur ·

Generative adversarial models with convolutional neural network (CNN) backbones have recently been established as state-of-the-art in numerous medical image synthesis tasks. However, CNNs are designed to perform local processing with compact filters, and this inductive bias compromises learning of contextual features. Here, we propose a novel generative adversarial approach for medical image synthesis, ResViT, that leverages the contextual sensitivity of vision transformers along with the precision of convolution operators and realism of adversarial learning.} ResViT's generator employs a central bottleneck comprising novel aggregated residual transformer (ART) blocks that synergistically combine residual convolutional and transformer modules. Residual connections in ART blocks promote diversity in captured representations, while a channel compression module distills task-relevant information. A weight sharing strategy is introduced among ART blocks to mitigate computational burden. A unified implementation is introduced to avoid the need to rebuild separate synthesis models for varying source-target modality configurations. Comprehensive demonstrations are performed for synthesizing missing sequences in multi-contrast MRI, and CT images from MRI. Our results indicate superiority of ResViT against competing CNN- and transformer-based methods in terms of qualitative observations and quantitative metrics.

PDF Abstract

Code

Add Remove Mark official

icon-lab/ResViT official

118

CV-Reimplementation/ResViT-Reimplem…

Tasks

Add Remove

Image Generation

Image-to-Image Translation

Inductive Bias

Datasets

IXI

Results from the Paper

Edit

Ranked #1 on Image-to-Image Translation on IXI

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image-to-Image Translation	BRATS	ResViT	PSNR	26.90	# 1	Compare
Image-to-Image Translation	IXI	TransUNet	PSNR	32.49 ± 1.74	# 5	Compare
Image-to-Image Translation	IXI	ResViT	PSNR	35.71 ± 1.77	# 1	Compare
Image-to-Image Translation	IXI	A-UNet	PSNR	32.43 ± 1.74	# 6	Compare
Image-to-Image Translation	IXI	pix2pix	PSNR	33.62 ± 2.07	# 4	Compare
Image-to-Image Translation	IXI	pGAN	PSNR	33.95 ± 1.67	# 2	Compare
Image-to-Image Translation	IXI	SAGAN	PSNR	33.71 ± 1.61	# 3	Compare

Methods

Add Remove

Convolution

Edit Social Preview

ResViT: Residual vision transformers for multi-modal medical image synthesis

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove