Search Results for author: Vidit Goel

Found 10 papers, 7 papers with code

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

1 code implementation • 11 Apr 2024 • Moreno D'Incà, Elia Peruzzo, Massimiliano Mancini, Dejia Xu, Vidit Goel, Xingqian Xu, Zhangyang Wang, Humphrey Shi, Nicu Sebe

In this paper, we tackle the challenge of open-set bias detection in text-to-image generative models presenting OpenBias, a new pipeline that identifies and quantifies the severity of biases agnostically, without access to any precompiled set.

Bias Detection Fairness +3

Paper
Code

VASE: Object-Centric Appearance and Shape Manipulation of Real Videos

no code implementations • 4 Jan 2024 • Elia Peruzzo, Vidit Goel, Dejia Xu, Xingqian Xu, Yifan Jiang, Zhangyang Wang, Humphrey Shi, Nicu Sebe

Recently, several works tackled the video editing task fostered by the success of large-scale text-to-image generative models.

Video Editing

Paper
Add Code

Video Instance Matting

1 code implementation • 7 Nov 2023 • Jiachen Li, Roberto Henschel, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Humphrey Shi

To remedy this deficiency, we propose Video Instance Matting~(VIM), that is, estimating alpha mattes of each instance at each frame of a video sequence.

Binarization Image Matting +4

Paper
Code

Interactive Neural Painting

no code implementations • 31 Jul 2023 • Elia Peruzzo, Willi Menapace, Vidit Goel, Federica Arrigoni, Hao Tang, Xingqian Xu, Arman Chopikyan, Nikita Orlov, Yuxiao Hu, Humphrey Shi, Nicu Sebe, Elisa Ricci

This paper advances the state of the art in this emerging research domain by proposing the first approach for Interactive NP.

Decoder

Paper
Add Code

PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor

1 code implementation • 30 Mar 2023 • Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Xingqian Xu, Nicu Sebe, Trevor Darrell, Zhangyang Wang, Humphrey Shi

We propose PAIR Diffusion, a generic framework that can enable a diffusion model to control the structure and appearance properties of each object in the image.

Object

478

Paper
Code

VMFormer: End-to-End Video Matting with Transformer

1 code implementation • 26 Aug 2022 • Jiachen Li, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Yunchao Wei, Humphrey Shi

In this paper, we propose VMFormer: a transformer-based end-to-end method for video matting.

Decoder Video Matting

103

Paper
Code

VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

1 code implementation • CVPR 2022 • Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong Wang

The learned implicit neural representation can be decoded to videos of arbitrary spatial resolution and frame rate.

Space-time Video Super-resolution Video Frame Interpolation +1

258

Paper
Code

MSN: Efficient Online Mask Selection Network for Video Instance Segmentation

1 code implementation • 19 Jun 2021 • Vidit Goel, Jiachen Li, Shubhika Garg, Harsh Maheshwari, Humphrey Shi

Our method improves the masks from segmentation and propagation branches in an online manner using the Mask Selection Network (MSN) hence limiting the noise accumulation during mask tracking.

Ranked #26 on Video Instance Segmentation on YouTube-VIS validation

Instance Segmentation Segmentation +4

Paper
Code

Mask Selection and Propagation for Unsupervised Video Object Segmentation

1 code implementation • 5 Jan 2021 • Shubhika Garg, Vidit Goel

We efficiently handle problems present in existing methods such as drift while temporal propagation, tracking and addition of new objects.

Ranked #1 on Unsupervised Video Object Segmentation on SegTrack v2

Segmentation Semantic Segmentation +2

Paper
Code

IROS 2019 Lifelong Robotic Vision Challenge -- Lifelong Object Recognition Report

no code implementations • 26 Apr 2020 • Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, ShiLiang Pu, Debdoot Sheet, Soonyong Song, Youngsung Son, Zhengwei Wang, Tomas E. Ward, Jianwen Wu, Meiqing Wu, Di Xie, Yangsheng Xu, Lin Yang, Qiaoyong Zhong, Liguang Zhou

This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams).

Continual Learning Object +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.