Search Results for author: Haoyu Dong

Found 27 papers, 14 papers with code

Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities

no code implementations • 25 May 2024 • Shiyu Xia, Junyu Xiong, Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Mengyu Zhou, Yeye He, Shi Han, Dongmei Zhang

Notably, to leverage the strengths of VLMs in understanding text rather than two-dimensional positioning, we propose to decode cell values on the four boundaries of the table in spreadsheet boundary detection.

Boundary Detection Optical Character Recognition +2

Paper
Add Code

KET-QA: A Dataset for Knowledge Enhanced Table Question Answering

no code implementations • 13 May 2024 • Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han, Dongmei Zhang

In this paper, we propose to use a knowledge base (KB) as the external knowledge source for TableQA and construct a dataset KET-QA with fine-grained gold evidence annotation.

Question Answering

Paper
Add Code

How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model

1 code implementation • 15 Apr 2024 • Hanxue Gu, Haoyu Dong, Jichen Yang, Maciej A. Mazurowski

Automated segmentation is a fundamental medical image analysis task, which enjoys significant advances due to the advent of deep learning.

Decoder Image Segmentation +4

Paper
Code

Rethinking Perceptual Metrics for Medical Image Translation

no code implementations • 10 Apr 2024 • Nicholas Konz, YuWen Chen, Hanxue Gu, Haoyu Dong, Maciej A. Mazurowski

Modern medical image translation methods use generative models for tasks such as the conversion of CT images to MRI.

Translation

Paper
Add Code

ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models

no code implementations • 16 Mar 2024 • YuWen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee, Maciej A. Mazurowski

We evaluate our method by training a segmentation model on images translated from CT to MRI with their original CT masks and testing its performance on real MRIs.

Anatomy Translation

Paper
Add Code

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries

no code implementations • 20 Feb 2024 • Wei Zhao, Zhitao Hou, Siyuan Wu, Yan Gao, Haoyu Dong, Yao Wan, Hongyu Zhang, Yulei Sui, Haidong Zhang

Writing formulas on spreadsheets, such as Microsoft Excel and Google Sheets, is a widespread practice among users performing data analysis.

Natural Language Queries

Paper
Add Code

Medical Image Segmentation with InTEnt: Integrated Entropy Weighting for Single Image Test-Time Adaptation

1 code implementation • 14 Feb 2024 • Haoyu Dong, Nicholas Konz, Hanxue Gu, Maciej A. Mazurowski

Here, we approach such a task, of adapting a medical image segmentation model with only a single unlabeled test image.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models

1 code implementation • 7 Feb 2024 • Nicholas Konz, YuWen Chen, Haoyu Dong, Maciej A. Mazurowski

Diffusion models have enabled remarkably high-quality medical image generation, yet it is challenging to enforce anatomical constraints in generated images.

counterfactual Image Generation +1

Paper
Code

SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI

1 code implementation • 23 Jan 2024 • Hanxue Gu, Roy Colglazier, Haoyu Dong, Jikai Zhang, Yaqian Chen, Zafer Yildiz, YuWen Chen, Lin Li, Jichen Yang, Jay Willhite, Alex M. Meyer, Brian Guo, Yashvi Atul Shah, Emily Luo, Shipra Rajput, Sally Kuehn, Clark Bulleit, Kevin A. Wu, Jisoo Lee, Brandon Ramirez, Darui Lu, Jay M. Levin, Maciej A. Mazurowski

In our study, we propose a versatile, publicly available deep-learning model for bone segmentation in MRI across multiple standard MRI locations.

MRI segmentation Segmentation

Paper
Code

Rethinking Medical Report Generation: Disease Revealing Enhancement with Knowledge Graph

1 code implementation • 24 Jul 2023 • Yixin Wang, Zihao Lin, Haoyu Dong

Knowledge Graph (KG) plays a crucial role in Medical Report Generation (MRG) because it reveals the relations among diseases and thus can be utilized to guide the generation process.

Medical Report Generation

Paper
Code

A systematic study of the foreground-background imbalance problem in deep learning for object detection

no code implementations • 28 Jun 2023 • Hanxue Gu, Haoyu Dong, Nicholas Konz, Maciej A. Mazurowski

We experimentally study the effects of different aspects of F-B imbalance (object size, number of objects, dataset size, object type) on detection performance.

Object object-detection +1

Paper
Add Code

Unsupervised anomaly localization in high-resolution breast scans using deep pluralistic image completion

1 code implementation • 4 May 2023 • Nicholas Konz, Haoyu Dong, Maciej A. Mazurowski

Given the scarcity of abnormal images and the abundance of normal images for this problem, an anomaly detection/localization approach could be well-suited.

Anomaly Detection

Paper
Code

Segment Anything Model for Medical Image Analysis: an Experimental Study

2 code implementations • 20 Apr 2023 • Maciej A. Mazurowski, Haoyu Dong, Hanxue Gu, Jichen Yang, Nicholas Konz, Yixin Zhang

We conclude that SAM shows impressive zero-shot segmentation performance for certain medical imaging datasets, but moderate to poor performance for others.

Image Segmentation Interactive Segmentation +5

128

Paper
Code

Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems

no code implementations • 11 Oct 2022 • Fan Zhou, Haoyu Dong, Qian Liu, Zhoujun Cheng, Shi Han, Dongmei Zhang

Numerical reasoning over natural language has been a long-standing goal for the research community.

Paper
Add Code

The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning

1 code implementation • 6 Jul 2022 • Nicholas Konz, Hanxue Gu, Haoyu Dong, Maciej A. Mazurowski

These results give a more principled underpinning for the intuition that radiological images can be more challenging to apply deep learning to than natural image datasets common to machine learning research.

Paper
Code

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data

1 code implementation • 25 May 2022 • Fan Zhou, Mengkang Hu, Haoyu Dong, Zhoujun Cheng, Shi Han, Dongmei Zhang

Existing auto-regressive pre-trained language models (PLMs) like T5 and BART, have been well applied to table question answering by UNIFIEDSKG and TAPEX, respectively, and demonstrated state-of-the-art results on multiple benchmarks.

Question Answering

Paper
Code

PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation

1 code implementation • 25 May 2022 • Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, Dongmei Zhang

However, directly learning the logical inference knowledge from table-text pairs is very difficult for neural models because of the ambiguity of natural language and the scarcity of parallel data.

Table-to-Text Generation

Paper
Code

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

no code implementations • 24 Jan 2022 • Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, Dongmei Zhang

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs, and various other document types, a flurry of table pre-training frameworks have been proposed following the success of text and images, and they have achieved new state-of-the-arts on various tasks such as table question answering, table type recognition, column relation classification, table search, formula prediction, etc.

Denoising Question Answering +2

Paper
Add Code

Lightweight Transformer Backbone for Medical Object Detection

no code implementations • 22 Nov 2021 • Yifan Zhang, Haoyu Dong, Nicholas Konz, Hanxue Gu, Maciej A. Mazurowski

Specifically, we propose a novel modification of visual transformer (ViT) on image feature patches to connect the feature patches of a tumor with healthy backgrounds of breast images and form a more robust backbone for tumor detection.

Lesion Detection Medical Object Detection +2

Paper
Add Code

FORTAP: Using Formulas for Numerical-Reasoning-Aware Table Pretraining

1 code implementation • ACL 2022 • Zhoujun Cheng, Haoyu Dong, Ran Jia, Pengfei Wu, Shi Han, Fan Cheng, Dongmei Zhang

In this paper, we find that the spreadsheet formula, which performs calculations on numerical values in tables, is naturally a strong supervision of numerical reasoning.

Paper
Code

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation

1 code implementation • ACL 2022 • Zhoujun Cheng, Haoyu Dong, Zhiruo Wang, Ran Jia, Jiaqi Guo, Yan Gao, Shi Han, Jian-Guang Lou, Dongmei Zhang

HiTab provides 10, 686 QA pairs and descriptive sentences with well-annotated quantity and entity alignment on 3, 597 tables with broad coverage of table hierarchies and numerical reasoning types.

Descriptive Entity Alignment +2

Paper
Code

TableSense: Spreadsheet Table Detection with Convolutional Neural Networks

1 code implementation • 25 Jun 2021 • Haoyu Dong, Shijie Liu, Shi Han, Zhouyu Fu, Dongmei Zhang

Spreadsheet table detection is the task of detecting all tables on a given sheet and locating their respective ranges.

Active Learning Boundary Detection +1

Paper
Code

Trust It or Not: Confidence-Guided Automatic Radiology Report Generation

no code implementations • 21 Jun 2021 • Yixin Wang, Zihao Lin, Zhe Xu, Haoyu Dong, Jiang Tian, Jie Luo, Zhongchao shi, Yang Zhang, Jianping Fan, Zhiqiang He

Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets.

Decision Making Image Captioning +2

Paper
Add Code

Using Text to Teach Image Retrieval

no code implementations • 19 Nov 2020 • Haoyu Dong, Ze Wang, Qiang Qiu, Guillermo Sapiro

Image retrieval relies heavily on the quality of the data modeling and the distance measurement in the feature space.

Image Retrieval Retrieval +2

Paper
Add Code

TUTA: Tree-based Transformers for Generally Structured Table Pre-training

1 code implementation • 21 Oct 2020 • Zhiruo Wang, Haoyu Dong, Ran Jia, Jia Li, Zhiyi Fu, Shi Han, Dongmei Zhang

First, we devise a unified tree-based structure, called a bi-dimensional coordinate tree, to describe both the spatial and hierarchical information of generally structured tables.

Paper
Code

Rethinking Exposure Bias In Language Modeling

no code implementations • 13 Oct 2019 • Yifan Xu, Kening Zhang, Haoyu Dong, Yuezhou Sun, Wenlong Zhao, Zhuowen Tu

Exposure bias describes the phenomenon that a language model trained under the teacher forcing schema may perform poorly at the inference stage when its predictions are conditioned on its previous predictions unseen from the training corpus.

Language Modelling Reinforcement Learning (RL)

Paper
Add Code

Semantic Structure Extraction for Spreadsheet Tables with a Multi-task Learning Architecture

no code implementations • NeurIPS Workshop Document_Intelligen 2019 • Haoyu Dong, Shijie Liu, Zhouyu Fu, Shi Han, Dongmei Zhang

To learn spatial correlations and capture semantics on spreadsheets, we have developed a novel learning-based framework for spreadsheet semantic structure extraction.

Language Modelling Multi-Task Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.