no code implementations • ECCV 2020 • Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang
We propose to employ phrase expressions as another interaction input to infer the attributes of target object.
no code implementations • 15 Mar 2024 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga
Generative object compositing emerges as a promising new avenue for compositional image editing.
no code implementations • 6 Nov 2023 • Hanrong Ye, Jason Kuen, Qing Liu, Zhe Lin, Brian Price, Dan Xu
On the highly competitive ADE20K and COCO benchmarks, our data generation method markedly improves the performance of state-of-the-art segmentation models in semantic segmentation, panoptic segmentation, and instance segmentation.
1 code implementation • 19 Oct 2023 • Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing
We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.
Ranked #1 on Semi-Supervised Video Object Segmentation on MOSE
1 code implementation • ICCV 2023 • Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee
To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.
Ranked #1 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)
Open-Vocabulary Video Segmentation Open-World Video Segmentation +7
no code implementations • 20 Jul 2023 • Josh Myers-Dean, Yifei Fan, Brian Price, Wilson Chan, Danna Gurari
Interactive segmentation entails a human marking an image to guide how a model either creates or edits a segmentation.
1 code implementation • CVPR 2023 • Tai-Yu Pan, Qing Liu, Wei-Lun Chao, Brian Price
Second, we introduce a novel approach to improve part segmentation on unseen objects, inspired by an interesting finding -- for unseen objects, the pixel-wise features extracted by the model often reveal high-quality part segments.
no code implementations • CVPR 2023 • Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown
Inspired by neural implicit representations for 2D images, we propose a method that optimizes a lightweight multi-layer-perceptron (MLP) model during the gamut reduction step to predict the clipped values.
1 code implementation • 12 Apr 2023 • Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang, Brian Price, Shiyu Chang
Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance.
no code implementations • CVPR 2023 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga
Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results.
1 code implementation • 2 Dec 2022 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga
Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results.
1 code implementation • 27 Jul 2022 • Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee
A key of OTVM is the joint modeling of trimap propagation and alpha prediction.
no code implementations • CVPR 2022 • Yutong Dai, Brian Price, He Zhang, Chunhua Shen
Deep image matting methods have achieved increasingly better results on benchmarks (e. g., Composition-1k/alphamatting. com).
no code implementations • CVPR 2022 • Fanqing Lin, Brian Price, Tony Martinez
Recently, feature backpropagating refinement scheme (f-BRS) has been proposed for the task of interactive segmentation, which enables efficient optimization of a small set of auxiliary variables inserted into the pretrained network to produce object segmentation that better aligns with user inputs.
no code implementations • 21 Dec 2021 • Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari
Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes.
1 code implementation • CVPR 2021 • Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi
We also introduce Text Refinement Network (TexRNet), a novel text segmentation approach that adapts to the unique properties of text, e. g. non-convex boundary, diverse texture, etc., which often impose burdens on traditional segmentation models.
1 code implementation • 1 Sep 2020 • Brian Davis, Chris Tensmeyer, Brian Price, Curtis Wigington, Bryan Morse, Rajiv Jain
This paper presents a GAN for generating images of handwritten lines conditioned on arbitrary text and latent style vectors.
1 code implementation • 6 Apr 2020 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari
We demonstrate how to increase overall model capacity to achieve improved performance, by introducing objectness, which is class-agnostic and so not prone to overfitting, for complementary use with class-specific features.
no code implementations • 25 Mar 2020 • Peng Zhou, Brian Price, Scott Cohen, Gregg Wilensky, Larry S. Davis
In this paper, we target refining the boundaries in high resolution images given low resolution masks.
3 code implementations • 17 Mar 2020 • Marco Forte, Brian Price, Scott Cohen, Ning Xu, François Pitié
We propose a novel interactive architecture and a novel training scheme that are both tailored to better exploit the user workflow.
3 code implementations • 5 Sep 2019 • Brian Davis, Bryan Morse, Scott Cohen, Brian Price, Chris Tensmeyer
Automatic, template-free extraction of information from form images is challenging due to the variety of form layouts.
no code implementations • ICCV 2019 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari
Many people search for foreground objects to use when editing images.
1 code implementation • 5 Aug 2019 • Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan
Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e. g. bar charts, pie charts, and line graphs.
1 code implementation • Eurographics 2019 - Short Papers 2019 • Mahmoud Afifi, Brian Price, Scott Cohen, and Michael S. Brown
We present a method to perform automatic image recoloring based on the distribution of colors associated with objects present in an image.
no code implementations • 7 Apr 2019 • Samuel Grieggs, Bingyu Shen, Greta Rauch, Pei Li, Jiaqi Ma, David Chiang, Brian Price, Walter J. Scheirer
The subtleties of human perception, as measured by vision scientists through the use of psychophysics, are important clues to the internal workings of visual recognition.
4 code implementations • ECCV 2018 • Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, Thomas Huang
End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.
Ranked #12 on Video Object Segmentation on YouTube-VOS 2018 (F-Measure (Unseen) metric)
no code implementations • ECCV 2018 • Hengshuang Zhao, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Brian Price, Jiaya Jia
We present a new image search technique that, given a background image, returns compatible foreground objects for image compositing tasks.
no code implementations • ECCV 2018 • Hoang Le, Long Mai, Brian Price, Scott Cohen, Hailin Jin, Feng Liu
Instead of relying on pre-defined low-level image features, our method adaptively predicts object boundaries according to image content and user interactions.
1 code implementation • ECCV 2018 • Curtis Wigington, Chris Tensmeyer, Brian Davis, William Barrett, Brian Price, Scott Cohen
Despite decades of research, offline handwriting recognition (HWR) of degraded historical documents remains a challenging problem, which if solved could greatly improve the searchability of online cultural heritage archives.
Ranked #12 on Handwritten Text Recognition on IAM
no code implementations • CVPR 2018 • Andrew Gilbert, John Collomosse, Hailin Jin, Brian Price
Content-aware image completion or in-painting is a fundamental tool for the correction of defects or removal of objects in images.
no code implementations • 22 Mar 2018 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari
Deep generative models have shown success in automatically synthesizing missing image regions using surrounding context.
1 code implementation • CVPR 2018 • Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich
One property that remains lacking in image captions generated by contemporary methods is discriminability: being able to tell two images apart given the caption for one of them.
1 code implementation • CVPR 2018 • Kushal Kafle, Brian Price, Scott Cohen, Christopher Kanan
Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them.
1 code implementation • Pacific Graphics 2017 • Rang Nguyen, Brian Price, Scott Cohen, and Michael S. Brown
Methods such as color transfer are effective in making an image share similar colors with a target image; however, color transfer is not suitable for modifying multiple images.
no code implementations • 2 Jul 2017 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang
In this paper, we propose a novel segmentation approach that uses a rectangle as a soft constraint by transforming it into an Euclidean distance map.
no code implementations • CVPR 2017 • Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos
We consider the problem of two-frame depth from defocus in conditions unsuitable for existing methods yet typical of everyday photography: a handheld cellphone camera, a small aperture, a non-stationary scene and sparse surface texture.
no code implementations • CVPR 2017 • Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng
This paper presents the first study on forecasting human dynamics from static images.
8 code implementations • CVPR 2017 • Ning Xu, Brian Price, Scott Cohen, Thomas Huang
We evaluate our algorithm on the image matting benchmark, our testing set, and a wide variety of real images.
no code implementations • NeurIPS 2016 • Peng Wang, Xiaohui Shen, Bryan Russell, Scott Cohen, Brian Price, Alan L. Yuille
This paper introduces an approach to regularize 2. 5D surface normal and depth predictions at each pixel given a single input image.
no code implementations • CVPR 2015 • Jianming Zhang, Shugao Ma, Mehrnoosh Sameki, Stan Sclaroff, Margrit Betke, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech
We study the problem of Salient Object Subitizing, i. e. predicting the existence and the number of salient objects in an image using holistic cues.
1 code implementation • CVPR 2016 • Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech
Our system leverages a Convolutional-Neural-Network model to generate location proposals of salient objects.
no code implementations • CVPR 2016 • Dongliang Cheng, Abdelrahman Abdelhamed, Brian Price, Scott Cohen, Michael S. Brown
Existing methods attempt to estimate a spatially varying illumination map, however, results are error prone and the resulting illumination maps are too low-resolution to be used for proper spatially varying white-balance correction.
no code implementations • CVPR 2016 • Jie Feng, Brian Price, Scott Cohen, Shih-Fu Chang
While these methods achieve better results than color-based methods, they are still limited in either using depth as an additional color channel or simply combining depth with color in a linear way.
no code implementations • WS 2016 • Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal
Motivated by the application of fact-level image understanding, we present an automatic method for data collection of structured visual facts from images with captions.
3 code implementations • CVPR 2016 • Jimei Yang, Brian Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang
We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network.
3 code implementations • CVPR 2016 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang
Interactive object selection is a very important research problem and has many applications.
Ranked #11 on Interactive Segmentation on SBD
no code implementations • ICCV 2015 • Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown
A limitation in color constancy research is the inability to establish ground truth colors for evaluating corrected images.
no code implementations • ICCV 2015 • Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech
Powered by this fast MBD transform algorithm, the proposed salient object detection method runs at 80 FPS, and significantly outperforms previous methods with similar speed on four large benchmark datasets, and achieves comparable or better performance than state-of-the-art methods.
Ranked #6 on Video Salient Object Detection on VOS-T (using extra training data)
no code implementations • 16 Nov 2015 • Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal
We show that learning visual facts in a structured way enables not only a uniform but also generalizable visual understanding.
no code implementations • 17 Aug 2015 • Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price
In this paper, we propose a novel deep neural network framework embedded with low-level features (LCNN) for salient object detection in complex images.
no code implementations • CVPR 2015 • Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille
By allowing for interactions between the depth and semantic information, the joint network provides more accurate depth prediction than a state-of-the-art CNN trained solely for depth prediction [5].
no code implementations • CVPR 2015 • Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown
More recent state-of-the-art methods employ learning-based techniques that produce better results, but often rely on complex features and have long evaluation and training times.
no code implementations • CVPR 2015 • Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, Ming-Hsuan Yang
The transferred local shape masks constitute a patch-level segmentation solution space and we thus develop a novel cascade algorithm, PatchCut, for coarse-to-fine object segmentation.
2 code implementations • 27 May 2015 • Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price
For most natural images, some boundary superpixels serve as the background labels and the saliency of other superpixels are determined by ranking their similarities to the boundary labels based on an inner propagation scheme.
no code implementations • ICCV 2015 • Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan Yuille
Segmenting semantic objects from images and parsing them into their respective semantic parts are fundamental steps towards detailed object understanding in computer vision.
no code implementations • CVPR 2014 • Ejaz Ahmed, Scott Cohen, Brian Price
With the tag provided by the user we do a text query of an image database to gather exemplars of the object.
no code implementations • CVPR 2014 • Jimei Yang, Brian Price, Scott Cohen, Ming-Hsuan Yang
This paper presents a scalable scene parsing algorithm based on image retrieval and superpixel matching.
no code implementations • CVPR 2013 • Ehsan Shahrian, Deepu Rajan, Brian Price, Scott Cohen
The first is that the range in which the foreground and background are sampled is often limited to such an extent that the true foreground and background colors are not present.