Search Results for author: Dhruv Mahajan

Found 36 papers, 16 papers with code

Context Diffusion: In-Context Aware Image Generation

no code implementations • 6 Dec 2023 • Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context.

Image Generation In-Context Learning

Paper
Add Code

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

no code implementations • 17 Nov 2023 • Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Evaluation results show our method improves visual quality by 14%, prompt alignment by 16. 2% and scene diversity by 15. 3%, compared to prompt engineering the base Emu model for stickers generation.

Image Generation Prompt Engineering

Paper
Add Code

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

no code implementations • 27 Sep 2023 • Xiaoliang Dai, Ji Hou, Chih-Yao Ma, Sam Tsai, Jialiang Wang, Rui Wang, Peizhao Zhang, Simon Vandenhende, Xiaofang Wang, Abhimanyu Dubey, Matthew Yu, Abhishek Kadian, Filip Radenovic, Dhruv Mahajan, Kunpeng Li, Yue Zhao, Vladan Petrovic, Mitesh Kumar Singh, Simran Motwani, Yi Wen, Yiwen Song, Roshan Sumbaly, Vignesh Ramanathan, Zijian He, Peter Vajda, Devi Parikh

Training text-to-image models with web scale image-text pairs enables the generation of a wide range of visual concepts from text.

Image Generation

Paper
Add Code

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

1 code implementation • CVPR 2023 • Filip Radenovic, Abhimanyu Dubey, Abhishek Kadian, Todor Mihaylov, Simon Vandenhende, Yash Patel, Yi Wen, Vignesh Ramanathan, Dhruv Mahajan

Vision-language models trained with contrastive learning on large-scale noisy data are becoming increasingly popular for zero-shot recognition problems.

Contrastive Learning Text Spotting +1

117

Paper
Code

PACO: Parts and Attributes of Common Objects

1 code implementation • CVPR 2023 • Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng, Baishan Guo, Rui Wang, Aaron Marquez, Rama Kovvuri, Abhishek Kadian, Amir Mousavi, Yiwen Song, Abhimanyu Dubey, Dhruv Mahajan

This motivates the need for large datasets which go beyond traditional object masks and provide richer annotations such as part masks and attributes.

2D Object Detection Attribute +1

256

Paper
Code

Neural Basis Models for Interpretability

1 code implementation • 27 May 2022 • Filip Radenovic, Abhimanyu Dubey, Dhruv Mahajan

However, these models are typically black-box deep neural networks, explained post-hoc via methods with known faithfulness limitations.

Additive models Interpretable Machine Learning

Paper
Code

Scalable Interpretability via Polynomials

1 code implementation • 27 May 2022 • Abhimanyu Dubey, Filip Radenovic, Dhruv Mahajan

We demonstrate by human subject evaluations that SPAMs are demonstrably more interpretable in practice, and are hence an effortless replacement for DNNs for creating interpretable and high-performance systems suitable for large-scale machine learning.

Additive models BIG-bench Machine Learning +1

Paper
Code

Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals

1 code implementation • 24 Mar 2022 • Simon Vandenhende, Dhruv Mahajan, Filip Radenovic, Deepti Ghadiyaram

A visual counterfactual explanation replaces image regions in a query image with regions from a distractor image such that the system's decision on the transformed image changes to the distractor class.

counterfactual Counterfactual Explanation +1

Paper
Code

Revisiting Weakly Supervised Pre-Training of Visual Perception Models

2 code implementations • CVPR 2022 • Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der Maaten

Model pre-training is a cornerstone of modern visual recognition systems.

Ranked #1 on Out-of-Distribution Generalization on ImageNet-W (using extra training data)

Fine-Grained Image Classification Out-of-Distribution Generalization +3

168

Paper
Code

Adaptive Methods for Aggregated Domain Generalization

1 code implementation • 9 Dec 2021 • Xavier Thomas, Dhruv Mahajan, Alex Pentland, Abhimanyu Dubey

In this paper, we propose a domain-adaptive approach to this problem, which operates in two steps: (a) we cluster training data within a carefully chosen feature space to create pseudo-domains, and (b) using these pseudo-domains we learn a domain-adaptive classifier that makes predictions using information about both the input and the pseudo-domain it belongs to.

Ranked #16 on Domain Generalization on PACS

Domain Generalization

Paper
Code

Large-Scale Attribute-Object Compositions

no code implementations • 24 May 2021 • Filip Radenovic, Animesh Sinha, Albert Gordo, Tamara Berg, Dhruv Mahajan

We study the problem of learning how to predict attribute-object compositions from images, and its generalization to unseen compositions missing from the training data.

Attribute Object

Paper
Add Code

Adaptive Methods for Real-World Domain Generalization

no code implementations • CVPR 2021 • Abhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, Dhruv Mahajan

We show that the existing approaches either do not scale to this dataset or underperform compared to the simple baseline of training a model on the union of data from all training domains.

Domain Generalization

Paper
Add Code

Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency

no code implementations • CVPR 2021 • Qing Liu, Vignesh Ramanathan, Dhruv Mahajan, Alan Yuille, Zhenheng Yang

However, existing approaches which rely only on image-level class labels predominantly suffer from errors due to (a) partial segmentation of objects and (b) missing object predictions.

Instance Segmentation Relation Network +3

Paper
Add Code

PreDet: Large-Scale Weakly Supervised Pre-Training for Detection

no code implementations • ICCV 2021 • Vignesh Ramanathan, Rui Wang, Dhruv Mahajan

State-of-the-art object detection approaches typically rely on pre-trained classification models to achieve better performance and faster convergence.

Classification Contrastive Learning +3

Paper
Add Code

What leads to generalization of object proposals?

no code implementations • 13 Aug 2020 • Rui Wang, Dhruv Mahajan, Vignesh Ramanathan

It is lucrative to train a good proposal model, that generalizes to unseen classes.

Object Object Proposal Generation

Paper
Add Code

Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias

1 code implementation • CVPR 2020 • Krishna Kumar Singh, Dhruv Mahajan, Kristen Grauman, Yong Jae Lee, Matt Feiszli, Deepti Ghadiyaram

Our key idea is to decorrelate feature representations of a category from its co-occurring context.

Attribute

Paper
Code

Measuring Dataset Granularity

1 code implementation • 21 Dec 2019 • Yin Cui, Zeqi Gu, Dhruv Mahajan, Laurens van der Maaten, Serge Belongie, Ser-Nam Lim

We also investigate the interplay between dataset granularity with a variety of factors and find that fine-grained datasets are more difficult to learn from, more difficult to transfer to, more difficult to perform few-shot learning with, and more vulnerable to adversarial attacks.

Clustering Few-Shot Learning

Paper
Code

From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality

2 code implementations • CVPR 2020 • Zhenqiang Ying, Haoran Niu, Praful Gupta, Dhruv Mahajan, Deepti Ghadiyaram, Alan Bovik

Blind or no-reference (NR) perceptual picture quality prediction is a difficult, unsolved problem of great consequence to the social and streaming media industries that impacts billions of viewers daily.

Ranked #4 on Video Quality Assessment on MSU SR-QA Dataset

Blind Image Quality Assessment Video Quality Assessment

Paper
Code

ClusterFit: Improving Generalization of Visual Representations

1 code implementation • CVPR 2020 • Xueting Yan, Ishan Misra, Abhinav Gupta, Deepti Ghadiyaram, Dhruv Mahajan

Pre-training convolutional neural networks with weakly-supervised and self-supervised strategies is becoming increasingly popular for several computer vision tasks.

Ranked #53 on Image Classification on iNaturalist 2018

Action Classification Clustering +2

3,233

Paper
Code

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

1 code implementation • NeurIPS 2020 • Humam Alwassel, Dhruv Mahajan, Bruno Korbar, Lorenzo Torresani, Bernard Ghanem, Du Tran

To the best of our knowledge, XDC is the first self-supervised learning method that outperforms large-scale fully-supervised pretraining for action recognition on the same architecture.

Ranked #2 on Self-Supervised Action Recognition on UCF101 (finetuned)

Audio Classification Clustering +5

Paper
Code

Scaling and Benchmarking Self-Supervised Visual Representation Learning

2 code implementations • ICCV 2019 • Priya Goyal, Dhruv Mahajan, Abhinav Gupta, Ishan Misra

Self-supervised learning aims to learn representations from the data itself without explicit manual supervision.

Benchmarking object-detection +5

590

Paper
Code

Large-scale weakly-supervised pre-training for video action recognition

3 code implementations • CVPR 2019 • Deepti Ghadiyaram, Matt Feiszli, Du Tran, Xueting Yan, Heng Wang, Dhruv Mahajan

Second, frame-based models perform quite well on action recognition; is pre-training for good image features sufficient or is pre-training for spatio-temporal features valuable for optimal transfer learning?

Ranked #2 on Egocentric Activity Recognition on EPIC-KITCHENS-55 (Actions Top-1 (S2) metric)

Action Classification Action Recognition +3

9,311

Paper
Code

Billion-scale semi-supervised learning for image classification

4 code implementations • 2 May 2019 • I. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, Dhruv Mahajan

This paper presents a study of semi-supervised learning with large convolutional networks.

Ranked #6 on Image Classification on OmniBenchmark (using extra training data)

Classification General Classification +2

240

Paper
Code

Activity Driven Weakly Supervised Object Detection

no code implementations • CVPR 2019 • Zhenheng Yang, Dhruv Mahajan, Deepti Ghadiyaram, Ram Nevatia, Vignesh Ramanathan

Weakly supervised object detection aims at reducing the amount of supervision required to train detection models.

Ranked #1 on Weakly Supervised Object Detection on Charades

Action Classification Object +2

Paper
Add Code

Defense Against Adversarial Images using Web-Scale Nearest-Neighbor Search

no code implementations • CVPR 2019 • Abhimanyu Dubey, Laurens van der Maaten, Zeki Yalniz, Yixuan Li, Dhruv Mahajan

Empirical evaluations of this defense strategy on ImageNet suggest that it is very effective in attack settings in which the adversary does not have access to the image database.

Paper
Add Code

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets

no code implementations • CVPR 2018 • De-An Huang, Vignesh Ramanathan, Dhruv Mahajan, Lorenzo Torresani, Manohar Paluri, Li Fei-Fei, Juan Carlos Niebles

The ability to capture temporal information has been critical to the development of video understanding models.

Video Understanding

Paper
Add Code

Exploring the Limits of Weakly Supervised Pretraining

4 code implementations • ECCV 2018 • Dhruv Mahajan, Ross Girshick, Vignesh Ramanathan, Kaiming He, Manohar Paluri, Yixuan Li, Ashwin Bharambe, Laurens van der Maaten

ImageNet classification is the de facto pretraining task for these models.

Ranked #222 on Image Classification on ImageNet (using extra training data)

General Classification Image Classification +3

5,284

Paper
Code

Distributed Newton Methods for Deep Neural Networks

no code implementations • 1 Feb 2018 • Chien-Chih Wang, Kent Loong Tan, Chun-Ting Chen, Yu-Hsiang Lin, S. Sathiya Keerthi, Dhruv Mahajan, S. Sundararajan, Chih-Jen Lin

First, to reduce the communication cost, we propose a diagonalization method such that an approximate Newton direction can be obtained without communication between machines.

Paper
Add Code

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

no code implementations • 15 Nov 2017 • Dhruv Mahajan, Vivek Gupta, S. Sathiya Keerthi, Sellamanickam Sundararajan, Shravan Narayanamurthy, Rahul Kidambi

We also demonstrate their usefulness in making design choices such as the number of classifiers in the ensemble and the size of a subset of data used for training that is needed to achieve a certain value of generalization error.

Paper
Add Code

Gradient Boosted Decision Trees for High Dimensional Sparse Output

no code implementations • ICML 2017 • Si Si, huan zhang, S. Sathiya Keerthi, Dhruv Mahajan, Inderjit S. Dhillon, Cho-Jui Hsieh

In this paper, we study the gradient boosted decision trees (GBDT) when the output space is high dimensional and sparse.

General Classification Vocal Bursts Intensity Prediction

Paper
Add Code

Batch-Expansion Training: An Efficient Optimization Framework

no code implementations • 22 Apr 2017 • Michał Dereziński, Dhruv Mahajan, S. Sathiya Keerthi, S. V. N. Vishwanathan, Markus Weimer

We propose Batch-Expansion Training (BET), a framework for running a batch optimizer on a gradually expanding dataset.

Paper
Add Code

Towards Geo-Distributed Machine Learning

no code implementations • 30 Mar 2016 • Ignacio Cano, Markus Weimer, Dhruv Mahajan, Carlo Curino, Giovanni Matteo Fumarola

Current solutions to learning from geo-distributed data sources revolve around the idea of first centralizing the data in one data center, and then training locally.

BIG-bench Machine Learning

Paper
Add Code

A Distributed Algorithm for Training Nonlinear Kernel Machines

no code implementations • 18 May 2014 • Dhruv Mahajan, S. Sathiya Keerthi, S. Sundararajan

This paper concerns the distributed training of nonlinear kernel machines on Map-Reduce.

Paper
Add Code

A distributed block coordinate descent method for training $l_1$ regularized linear classifiers

no code implementations • 18 May 2014 • Dhruv Mahajan, S. Sathiya Keerthi, S. Sundararajan

In this paper we design a distributed algorithm for $l_1$ regularization that is much better suited for such systems than existing algorithms.

Paper
Add Code

A Parallel SGD method with Strong Convergence

no code implementations • 4 Nov 2013 • Dhruv Mahajan, S. Sathiya Keerthi, S. Sundararajan, Leon Bottou

The method has strong convergence properties.

Paper
Add Code

An efficient distributed learning algorithm based on effective local functional approximations

no code implementations • 31 Oct 2013 • Dhruv Mahajan, Nikunj Agrawal, S. Sathiya Keerthi, S. Sundararajan, Leon Bottou

In this paper we give a novel approach to the distributed training of linear classifiers (involving smooth losses and L2 regularization) that is designed to reduce the total communication costs.

L2 Regularization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.