Search Results for author: Siddarth Jain

Found 5 papers, 0 papers with code

Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks

no code implementations • 11 Dec 2023 • Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres

Designing robotic agents to perform open vocabulary tasks has been the long-standing goal in robotics and AI.

Paper
Add Code

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

no code implementations • 27 Jun 2023 • Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh Jha, Diego Romeres, Jonathan Le Roux

This paper introduces a method for robot action sequence generation from instruction videos using (1) an audio-visual Transformer that converts audio-visual features and instruction speech to a sequence of robot actions called dynamic movement primitives (DMPs) and (2) style-transfer-based training that employs multi-task learning with video captioning and weakly-supervised learning with a semantic classifier to exploit unpaired video-action data.

Multi-Task Learning Scene Understanding +3

Paper
Add Code

Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control

no code implementations • 2 Dec 2022 • Devesh K. Jha, Siddarth Jain, Diego Romeres, William Yerazunis, Daniel Nikovski

In this paper, we present a system for human-robot collaborative assembly using learning from demonstration and pose estimation, so that the robot can adapt to the uncertainty caused by the operation of humans.

Imitation Learning Pose Estimation

Paper
Add Code

Learning to Synthesize Volumetric Meshes from Vision-based Tactile Imprints

no code implementations • 29 Mar 2022 • Xinghao Zhu, Siddarth Jain, Masayoshi Tomizuka, Jeroen van Baar

Vision-based tactile sensors typically utilize a deformable elastomer and a camera mounted above to provide high-resolution image observations of contacts.

Image Augmentation Robotic Grasping

Paper
Add Code

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

no code implementations • ICCV 2021 • Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan

To use our model for instance segmentation, we propose an instance pose encoder that learns to take in a generated depth image and reproduce the pose code vectors for all of the object instances.

Generative Adversarial Network Instance Segmentation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.