no code implementations • 13 May 2024 • Matthew Keller, Chi-en Amy Tai, Yuhao Chen, Pengcheng Xi, Alexander Wong
Many aging individuals encounter challenges in effectively tracking their dietary intake, exacerbating their susceptibility to nutrition-related health complications.
no code implementations • 13 May 2024 • Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A Clausi, John S Zelek
In the high-stakes world of baseball, every nuance of a pitcher's mechanics holds the key to maximizing performance and minimizing runs.
no code implementations • 12 May 2024 • Aaryam Sharma, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong
Monitoring dietary intake is a crucial aspect of promoting healthy living.
no code implementations • 12 May 2024 • Akil Pathiranage, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong
Ellipse estimation is an important topic in food image processing because it can be leveraged to parameterize plates and bowls, which in turn can be used to estimate camera view angles and food portion sizes.
no code implementations • 2 May 2024 • Yuxiang Huang, Yuhao Chen, John Zelek
Detecting and segmenting moving objects from a moving monocular camera is challenging in the presence of unknown camera motion, diverse object motions and complex scene structures.
no code implementations • 7 Apr 2024 • Yuanfeng Xu, Yuhao Chen, Zhongzhan Huang, Zijian He, Guangrun Wang, Philip Torr, Liang Lin
In this paper, we present AnimateZoo, a zero-shot diffusion-based video generator to address this challenging cross-species animation issue, aiming to accurately produce animal animations while preserving the background.
no code implementations • 17 Mar 2024 • Bavesh Balaji, Jerrin Bright, Sirisha Rambhatla, Yuhao Chen, Alexander Wong, John Zelek, David A Clausi
We further introduce a new spatio-temporal network leveraging our novel d-MAE for unique player identification.
no code implementations • 14 Mar 2024 • Jerrin Bright, Bavesh Balaji, Harish Prakash, Yuhao Chen, David A Clausi, John Zelek
Precise Human Mesh Recovery (HMR) with in-the-wild data is a formidable challenge and is often hindered by depth ambiguities and reduced precision.
no code implementations • 5 Feb 2024 • Dayou Mao, Yuhao Chen, Yifan Wu, Maximilian Gilles, Alexander Wong
One of the main motivations of MTL is to develop neural networks capable of inferring multiple tasks simultaneously.
no code implementations • 22 Dec 2023 • Yuhao Chen, Chloe Wong, Hanwen Yang, Juan Aguenza, Sai Bhujangari, Benthan Vu, Xun Lei, Amisha Prasad, Manny Fluss, Eric Phuong, Minghao Liu, Raja Kumar, Vanshika Vats, James Davis
This study critically evaluates the efficacy of prompting methods in enhancing the mathematical reasoning capability of large language models (LLMs).
no code implementations • 11 Dec 2023 • Saeejith Nair, Chi-en Amy Tai, Yuhao Chen, Alexander Wong
As the largest open-source synthetic food dataset, NV-Synth highlights the value of physics-based simulations for enabling scalable and controllable generation of diverse photorealistic meal images to overcome data limitations and drive advancements in automated dietary assessment using computer vision.
no code implementations • 6 Dec 2023 • Olivia Markham, Yuhao Chen, Chi-en Amy Tai, Alexander Wong
To address these limitations, we introduce FoodFusion, a Latent Diffusion model engineered specifically for the faithful synthesis of realistic food images from textual descriptions.
no code implementations • 30 Nov 2023 • Aditya Sridhar, Chi-en Amy Tai, Hayden Gunraj, Yuhao Chen, Alexander Wong
In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022.
no code implementations • 29 Nov 2023 • Shen Zhang, Zhaowei Chen, Zhenyu Zhao, Yuhao Chen, Yao Tang, Jiajun Liang
Extensive experiments demonstrate that our approach can address object duplication and heavy computation issues, achieving state-of-the-art performance on higher-resolution image synthesis tasks.
no code implementations • 22 Nov 2023 • Yuhao Chen, Yuxuan Yan, Qianqian Yang, Yuanchao Shu, Shibo He, Jiming Chen
Transformer-based large language models (LLMs) have demonstrated impressive capabilities in a variety of natural language processing (NLP) tasks.
no code implementations • 20 Nov 2023 • Chi-en Amy Tai, Saeejith Nair, Olivia Markham, Matthew Keller, Yifan Wu, Yuhao Chen, Alexander Wong
Dietary intake estimation plays a crucial role in understanding the nutritional habits of individuals and populations, aiding in the prevention and management of diet-related health issues.
no code implementations • 10 Nov 2023 • Yuhao Chen, Yuxuan Yan, Qianqian Yang, Yuanchao Shu, Shibo He, Zhiguo Shi, Jiming Chen
Moreover, we propose a bit-level computation-efficient data compression scheme to compress the data to be transmitted between devices during training.
no code implementations • 25 Sep 2023 • Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee, Alexander Wong
Thus, there is a need to dynamically optimize the neural network component of NeRFs to achieve a balance between computational complexity and specific targets for synthesis quality.
no code implementations • 14 Sep 2023 • Chi-en Amy Tai, Matthew Keller, Saeejith Nair, Yuhao Chen, Yifan Wu, Olivia Markham, Krish Parmar, Pengcheng Xi, Heather Keller, Sharon Kirkpatrick, Alexander Wong
Recent work has focused on using computer vision and machine learning to automatically estimate dietary intake from food images, but the lack of comprehensive datasets with diverse viewpoints, modalities and food annotations hinders the accuracy and realism of such methods.
no code implementations • 12 Sep 2023 • Bavesh Balaji, Jerrin Bright, Harish Prakash, Yuhao Chen, David A Clausi, John Zelek
To address these issues, we propose a robust keyframe identification module that extracts frames containing essential high-level information about the jersey number.
no code implementations • 2 Sep 2023 • Jerrin Bright, Yuhao Chen, John Zelek
The findings highlight the effectiveness of our method in mitigating the challenges posed by motion blur, thereby enhancing the overall quality of pose estimation.
no code implementations • 8 Aug 2023 • Yuhao Chen, Qianqian Yang, Zhiguo Shi, Jiming Chen
In recent years, semantic communication has been a popular research topic for its superiority in communication efficiency.
no code implementations • 15 Jun 2023 • Grant Sinha, Krish Parmar, Hilda Azimi, Amy Tai, Yuhao Chen, Alexander Wong, Pengcheng Xi
To address these issues, two models are trained and compared, one based on convolutional neural networks and the other on Bidirectional Encoder representation for Image Transformers (BEiT).
no code implementations • 5 Jun 2023 • Weixuan Chen, Yuhao Chen, Qianqian Yang, Chongwen Huang, Qian Wang, Zhaoyang Zhang
Adaptive rate control for deep joint source and channel coding (JSCC) is considered as an effective approach to transmit sufficient information in scenarios with limited communication resources.
no code implementations • 21 Apr 2023 • Alexander Wong, Yifan Wu, Saad Abbasi, Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee
As such, the design of highly efficient multi-task deep neural network architectures tailored for computer vision tasks for robotic grasping on the edge is highly desired for widespread adoption in manufacturing environments.
no code implementations • 12 Apr 2023 • Chi-en Amy Tai, Jason Li, Sriram Kumar, Saeejith Nair, Yuhao Chen, Pengcheng Xi, Alexander Wong
With the growth in capabilities of generative models, there has been growing interest in using photo-realistic renders of common 3D food items to improve downstream tasks such as food printing, nutrition prediction, or management of food wastage.
no code implementations • 12 Apr 2023 • Chi-en Amy Tai, Matthew Keller, Mattie Kerrigan, Yuhao Chen, Saeejith Nair, Pengcheng Xi, Alexander Wong
Unlike existing datasets, a collection of 3D models with nutritional information allow for view synthesis to create an infinite number of 2D images for any given viewpoint/camera angle along with the associated nutritional information.
no code implementations • 10 Apr 2023 • E. Zhixuan Zeng, Yuhao Chen, Alexander Wong
To address these challenges, this paper proposes ShapeShift, a superquadric-based framework for object pose estimation that predicts the object's pose relative to a primitive shape which is fitted to the object.
1 code implementation • CVPR 2023 • Yuhao Chen, Xin Tan, Borui Zhao, Zhaowei Chen, RenJie Song, Jiajun Liang, Xuequan Lu
ANL introduces the additional negative pseudo-label for all unlabeled data to leverage low-confidence examples.
no code implementations • 19 Oct 2022 • Yuhao Chen, Hayden Gunraj, E. Zhixuan Zeng, Robbie Meyer, Maximilian Gilles, Alexander Wong
We also demonstrate that our MC score is a more reliability indicator for outputs during inference time compared to the model generated confidence scores that are often over-confident.
no code implementations • 8 Aug 2022 • Maximilian Gilles, Yuhao Chen, Tim Robin Winter, E. Zhixuan Zeng, Alexander Wong
Autonomous bin picking poses significant challenges to vision-driven robotic systems given the complexity of the problem, ranging from various sensor modalities, to highly entangled object layouts, to diverse item properties and gripper types.
no code implementations • 21 May 2022 • Mingyao Cui, Zidong Wu, Yuhao Chen, Shenheng Xu, Fan Yang, Linglong Dai
By jointly designing the hardware and software, this prototype can realize real-time 4K video transmission with much reduced power consumption.
1 code implementation • 29 Dec 2021 • Yuhao Chen, E. Zhixuan Zeng, Maximilian Gilles, Alexander Wong
We also propose a new layout-weighted performance metric alongside the dataset for evaluating object detection and segmentation performance in a manner that is more appropriate for robotic grasp applications compared to existing general-purpose performance metrics.
no code implementations • 6 Oct 2021 • Yuhao Chen, Qianqian Yang, Shibo He, Zhiguo Shi, Jiming Chen
Our numerical results demonstrate that FTPipeHD is 6. 8x faster in training than the state of the art method when the computing capacity of the best device is 10x greater than the worst one.
no code implementations • 26 May 2021 • Guoqing Zhang, Yuhao Chen, Weisi Lin, Arun Chandran, Xuan Jing
As a prevailing task in video surveillance and forensics field, person re-identification (re-ID) aims to match person images captured from non-overlapped cameras.
1 code implementation • 25 May 2021 • Yuhao Chen, Guoqing Zhang, Yujiang Lu, zhenxing Wang, yuhui Zheng, Ruili Wang
Text-based person search is a sub-task in the field of image retrieval, which aims to retrieve target person images according to a given textual description.
Ranked #11 on Text based Person Retrieval on CUHK-PEDES
no code implementations • 21 Mar 2021 • Guoqing Zhang, Yuhao Chen, Yang Dai, yuhui Zheng, Yi Wu
Due to the inaccurate person detections and pose changes, pedestrian misalignment significantly increases the difficulty of feature extraction and matching.
no code implementations • 10 Jul 2020 • Yuhao Chen, Yifan Wu, Linlin Xu, Alexander Wong
In this paper, we leverage the performance of CNNs, and propose a module that uses prior knowledge of building corners to create angular and concise building polygons from CNN segmentation outputs.
no code implementations • 24 Jan 2020 • Changye Yang, Sriram Baireddy, Yuhao Chen, Enyu Cai, Denise Caldwell, Valérian Méline, Anjali S. Iyer-Pascuzzi, Edward J. Delp
Analysis of the shape of plants can potentially be used to accurately quantify the degree of wilting.
no code implementations • 20 Dec 2019 • Kennedy Ralston, Yuhao Chen, Haruna Isah, Farhana Zulkernine
The chatbot could also be adapted for use in other application areas such as student info-centers, government kiosks, and mental health support systems.
no code implementations • 2 Jul 2018 • Javier Ribera, Fangning He, Yuhao Chen, Ayman F. Habib, Edward J. Delp
Use of imagery is becoming popular for phenotyping.
6 code implementations • CVPR 2019 • Javier Ribera, David Güera, Yuhao Chen, Edward J. Delp
In these networks, the training procedure usually requires providing bounding boxes or the maximum number of expected objects.
Ranked #1 on Object Localization on Mall