Search Results for author: Wei Liu

Found 474 papers, 188 papers with code

End-to-end Speech Translation System Description of LIT for IWSLT 2019

no code implementations • EMNLP (IWSLT) 2019 • Mei Tu, Wei Liu, Lijie Wang, Xiao Chen, Xue Wen

We propose layer-tied self-attention for end-to-end speech translation.

Paper
Add Code

QuickGraph: A Rapid Annotation Tool for Knowledge Graph Extraction from Technical Text

1 code implementation • ACL 2022 • Tyler Bikaun, Michael Stewart, Wei Liu

Acquiring high-quality annotated corpora for complex multi-task information extraction (MT-IE) is an arduous and costly process for human-annotators.

Clustering

Paper
Code

CIST@CL-SciSumm 2020, LongSumm 2020: Automatic Scientific Document Summarization

no code implementations • EMNLP (sdp) 2020 • Lei LI, Yang Xie, Wei Liu, Yinan Liu, Yafei Jiang, Siya Qi, Xingyuan Li

In the LongSumm shared task, we integrate both the extractive and abstractive summarization ways.

Abstractive Text Summarization Document Summarization +3

Paper
Add Code

Lexicon-Based Graph Convolutional Network for Chinese Word Segmentation

no code implementations • Findings (EMNLP) 2021 • Kaiyu Huang, Hao Yu, Junpeng Liu, Wei Liu, Jingxiang Cao, Degen Huang

Experimental results on five benchmarks and four cross-domain datasets show the lexicon-based graph convolutional network successfully captures the information of candidate words and helps to improve performance on the benchmarks (Bakeoff-2005 and CTB6) and the cross-domain datasets (SIGHAN-2010).

Chinese Word Segmentation

Paper
Add Code

PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation

1 code implementation • ECCV 2020 • Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin

We propose a novel end-to-end deep scene flow model, called PointPWC-Net, that directly processes 3D point cloud scenes with large motions in a coarse-to-fine fashion.

Self-supervised Scene Flow Estimation

137

Paper
Code

LexiClean: An annotation tool for rapid multi-task lexical normalisation

1 code implementation • EMNLP (ACL) 2021 • Tyler Bikaun, Tim French, Melinda Hodkiewicz, Michael Stewart, Wei Liu

LexiClean’s main contribution is support for simultaneous in situ token-level modification and annotation that can be rapidly applied corpus wide.

Paper
Code

Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples

no code implementations • 25 Apr 2024 • Kuofeng Gao, Jindong Gu, Yang Bai, Shu-Tao Xia, Philip Torr, Wei Liu, Zhifeng Li

For verbose videos, a frame feature diversity loss is proposed to increase the feature diversity among frames.

Paper
Add Code

Atomas: Hierarchical Alignment on Molecule-Text for Unified Molecule Understanding and Generation

no code implementations • 23 Apr 2024 • Yikun Zhang, Geyan Ye, Chaohao Yuan, Bo Han, Long-Kai Huang, Jianhua Yao, Wei Liu, Yu Rong

We design a Hierarchical Adaptive Alignment model to concurrently learn the fine-grained fragment correspondence between two modalities and align these representations of fragments in three levels.

Drug Discovery molecular representation +2

Paper
Add Code

Functional Protein Design with Local Domain Alignment

no code implementations • 18 Apr 2024 • Chaohao Yuan, Songyou Li, Geyan Ye, Yikun Zhang, Long-Kai Huang, Wenbing Huang, Wei Liu, Jianhua Yao, Yu Rong

The core challenge of de novo protein design lies in creating proteins with specific functions or properties, guided by certain conditions.

Protein Annotation Protein Design

Paper
Add Code

Simulation-Free Determination of Microstructure Representative Volume Element Size via Fisher Scores

1 code implementation • 7 Apr 2024 • Wei Liu, Satyajit Mojumder, Wing Kam Liu, Wei Chen, Daniel W. Apley

We propose a simulation-free alternative that determines RVE size based only on a micrograph.

Paper
Code

What Causes the Failure of Explicit to Implicit Discourse Relation Recognition?

1 code implementation • 1 Apr 2024 • Wei Liu, Stephen Wan, Michael Strube

We consider an unanswered question in the discourse processing community: why do relation classifiers trained on explicit examples (with connectives removed) perform poorly in real implicit scenarios?

Relation

Paper
Code

Variational Graph Auto-Encoder Based Inductive Learning Method for Semi-Supervised Classification

no code implementations • 26 Mar 2024 • Hanxuan Yang, Zhaoxin Yu, Qingchao Kong, Wei Liu, Wenji Mao

Graph representation learning is a fundamental research issue in various domains of applications, of which the inductive learning problem is particularly challenging as it requires models to generalize to unseen graph structures during inference.

Graph Representation Learning Node Classification

Paper
Add Code

CodeS: Natural Language to Code Repository via Multi-Layer Sketch

2 code implementations • 25 Mar 2024 • Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

For feedback-based evaluation, we develop a VSCode plugin for CodeS and engage 30 participants in conducting empirical studies.

Benchmarking

20,765

Paper
Code

Event-Triggered State Estimation Through Confidence Level

no code implementations • 22 Mar 2024 • Wei Liu

This paper considers the state estimation problem for discrete-time linear systems under event-triggered scheme.

Paper
Add Code

Reversible Jump Attack to Textual Classifiers with Modification Reduction

1 code implementation • 21 Mar 2024 • Mingze Ni, Zhensu Sun, Wei Liu

Recent studies on adversarial examples expose vulnerabilities of natural language processing (NLP) models.

Paper
Code

LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model

no code implementations • 18 Mar 2024 • Yuxin Cao, Jinghao Li, Xi Xiao, Derui Wang, Minhui Xue, Hao Ge, Wei Liu, Guangwu Hu

Benefiting from the popularity and scalably usability of Segment Anything Model (SAM), we first extract different regions according to semantic information and then track them through the video stream to maintain the temporal consistency.

Adversarial Attack Style Transfer +2

Paper
Add Code

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

1 code implementation • 18 Mar 2024 • Yang Yang, Wen Wang, Liang Peng, Chaotian Song, Yao Chen, Hengjia Li, Xiaolong Yang, Qinglin Lu, Deng Cai, Boxi Wu, Wei Liu

Customization generation techniques have significantly advanced the synthesis of specific concepts across varied contexts.

Paper
Code

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

1 code implementation • 16 Mar 2024 • Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, GuanYing Chen, Wei Liu, Wenhan Luo

We also observe that the initiation denoising timestep for noise blending is the key to identity preservation and layout.

Denoising Text-to-Image Generation

520

Paper
Code

Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples

1 code implementation • 16 Mar 2024 • Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yao, Hai Jin

In response to these challenges, we propose Genetic Evolution-Nurtured Adversarial Fine-tuning (Gen-AF), a two-stage adversarial fine-tuning approach aimed at enhancing the robustness of downstream models.

Self-Supervised Learning

Paper
Code

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

2 code implementations • 13 Mar 2024 • Yue Ma, Yingqing He, Hongfa Wang, Andong Wang, Chenyang Qi, Chengfei Cai, Xiu Li, Zhifeng Li, Heung-Yeung Shum, Wei Liu, Qifeng Chen

Despite recent advances in image-to-video generation, better controllability and local animation are less explored.

Image Animation Image to Video Generation

727

Paper
Code

DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

no code implementations • 13 Mar 2024 • Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu

However, many of these works face challenges in identifying correct output modalities and generating coherent images accordingly as the number of output modalities increases and the conversations go deeper.

Prompt Engineering Text-to-Image Generation

Paper
Add Code

Category-Agnostic Pose Estimation for Point Clouds

no code implementations • 12 Mar 2024 • Bowen Liu, Wei Liu, Siang Chen, Pengwei Xie, Guijin Wang

The goal of object pose estimation is to visually determine the pose of a specific object in the RGB-D input.

Category-Agnostic Pose Estimation Object +1

Paper
Add Code

ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval

no code implementations • 11 Mar 2024 • Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan, Bin Wang

Specifically, our proposed ToolRerank includes Adaptive Truncation, which truncates the retrieval results related to seen and unseen tools at different positions, and Hierarchy-Aware Reranking, which makes retrieval results more concentrated for single-tool queries and more diverse for multi-tool queries.

Retrieval

Paper
Add Code

RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments

no code implementations • 11 Mar 2024 • Han Yan, Hua Chen, Wei Liu, Songjie Yang, Gang Wang, Chau Yuen

Reconfigurable Intelligent Surfaces (RIS) show great promise in the realm of 6th generation (6G) wireless systems, particularly in the areas of localization and communication.

Paper
Add Code

Large Language Models are In-Context Molecule Learners

1 code implementation • 7 Mar 2024 • Jiatong Li, Wei Liu, Zhihao Ding, Wenqi Fan, Yuqiang Li, Qing Li

Specifically, ICMA incorporates the following three stages: Hybrid Context Retrieval, Post-retrieval Re-ranking, and In-context Molecule Tuning.

Cross-Modal Retrieval Re-Ranking +2

Paper
Code

Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration

no code implementations • 29 Feb 2024 • Tony C. W. Mok, Zi Li, Yunhao Bai, Jianpeng Zhang, Wei Liu, Yan-Jie Zhou, Ke Yan, Dakai Jin, Yu Shi, Xiaoli Yin, Le Lu, Ling Zhang

Existing multi-modality image registration algorithms rely on statistical-based similarity measures or local structural image representations.

Anatomy Contrastive Learning +3

Paper
Add Code

A Comprehensive Evaluation of Quantization Strategies for Large Language Models

no code implementations • 26 Feb 2024 • Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

Our experimental results indicate that LLMs with 4-bit quantization can retain performance comparable to their non-quantized counterparts, and perplexity can serve as a proxy metric for quantized LLMs on most benchmarks.

Language Modelling Quantization

Paper
Add Code

Multi-Constraint Safe RL with Objective Suppression for Safety-Critical Applications

no code implementations • 23 Feb 2024 • Zihan Zhou, Jonathan Booher, Khashayar Rohanimanesh, Wei Liu, Aleksandr Petiushko, Animesh Garg

Safe reinforcement learning tasks with multiple constraints are a challenging domain despite being very common in the real world.

Autonomous Driving reinforcement-learning +1

Paper
Add Code

Analysing The Impact of Sequence Composition on Language Model Pre-Training

1 code implementation • 21 Feb 2024 • Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Miłoś, Yuxiang Wu, Pasquale Minervini

In this work, we find that applying causal masking can lead to the inclusion of distracting information from previous documents during pre-training, which negatively impacts the performance of the models on language modelling and downstream tasks.

In-Context Learning Language Modelling +1

Paper
Code

AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization

no code implementations • 19 Feb 2024 • Jiyao Li, Mingze Ni, Yifei Dong, Tianqing Zhu, Wei Liu

At the intersection of CV and NLP is the problem of image captioning, where the related models' robustness against adversarial attacks has not been well studied.

Adversarial Attack Image Captioning

Paper
Add Code

ChemLLM: A Chemical Large Language Model

no code implementations • 10 Feb 2024 • Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, Dongzhan Zhou, Shufei Zhang, Mao Su, Han-sen Zhong, Yuqiang Li

However, the community lacks an LLM specifically designed for chemistry.

Language Modelling Large Language Model +2

Paper
Add Code

Poisson Process for Bayesian Optimization

no code implementations • 5 Feb 2024 • Xiaoxing Wang, Jiaxing Li, Chao Xue, Wei Liu, Weifeng Liu, Xiaokang Yang, Junchi Yan, DaCheng Tao

BayesianOptimization(BO) is a sample-efficient black-box optimizer, and extensive methods have been proposed to build the absolute function response of the black-box function through a probabilistic surrogate model, including Tree-structured Parzen Estimator (TPE), random forest (SMAC), and Gaussian process (GP).

Bayesian Optimization Hyperparameter Optimization +2

Paper
Add Code

NFT1000: A Visual Text Dataset For Non-Fungible Token Retrieval

no code implementations • 29 Jan 2024 • Shuxun Wang, Yunfei Lei, Ziqi Zhang, Wei Liu, Haowei Liu, Li Yang, Wenjuan Li, Bing Li, Weiming Hu

With the rise of 'Metaverse' and 'Web3. 0', NFT ( Non-Fungible Token ) has emerged as a kind of pivotal digital asset, garnering significant attention.

Retrieval

Paper
Add Code

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

no code implementations • 28 Jan 2024 • Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Dehua Zheng, Weixuan Wang, Wenjin Yang, Siqin Li, Xianliang Wang, Wenhui Chen, Jing Dai, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

We expect that agents should learn to enhance the extent to which humans achieve these goals while maintaining agents' original abilities (e. g., winning games).

Paper
Add Code

A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research

no code implementations • 26 Jan 2024 • Sicong Cao, Xiaobing Sun, Ratnadira Widyasari, David Lo, Xiaoxue Wu, Lili Bo, Jiale Zhang, Bin Li, Wei Liu, Di wu, Yixin Chen

The remarkable achievements of Artificial Intelligence (AI) algorithms, particularly in Machine Learning (ML) and Deep Learning (DL), have fueled their extensive deployment across multiple sectors, including Software Engineering (SE).

Decision Making Vulnerability Detection

Paper
Add Code

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

1 code implementation • 20 Jan 2024 • Kuofeng Gao, Yang Bai, Jindong Gu, Shu-Tao Xia, Philip Torr, Zhifeng Li, Wei Liu

Once attackers maliciously induce high energy consumption and latency time (energy-latency cost) during inference of VLMs, it will exhaust computational resources.

Paper
Code

The Radiation Oncology NLP Database

1 code implementation • 19 Jan 2024 • Zhengliang Liu, Jason Holmes, Wenxiong Liao, Chenbin Liu, Lian Zhang, Hongying Feng, Peilong Wang, Muhammad Ali Elahi, Hongmin Cai, Lichao Sun, Quanzheng Li, Xiang Li, Tianming Liu, Jiajian Shen, Wei Liu

ROND is specifically designed to address this gap in the domain of radiation oncology, a field that offers many opportunities for NLP exploration.

Language Modelling Large Language Model +7

Paper
Code

LUPET: Incorporating Hierarchical Information Path into Multilingual ASR

no code implementations • 8 Jan 2024 • Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao, Tan Lee

Many factors have separately shown their effectiveness on improving multilingual ASR.

Acoustic Unit Discovery

Paper
Add Code

Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

1 code implementation • 29 Dec 2023 • Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Houjing Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli, Catherine Wurster, Philippe Bijlenga, Laura Westphal, Jeroen Bisschop, Elisa Colombo, Hakim Baazaoui, Andrew Makmur, James Hallinan, Bene Wiestler, Jan S. Kirschke, Roland Wiest, Emmanuel Montagnon, Laurent Letourneau-Guillon, Adrian Galdran, Francesco Galati, Daniele Falcetta, Maria A. Zuluaga, Chaolong Lin, Haoran Zhao, Zehan Zhang, Sinyoung Ra, Jongyun Hwang, HyunJin Park, Junqiang Chen, Marek Wodzinski, Henning Müller, Pengcheng Shi, Wei Liu, Ting Ma, Cansu Yalçin, Rachika E. Hamadache, Joaquim Salvi, Xavier Llado, Uma Maria Lal-Trehan Estrada, Valeriia Abramova, Luca Giancardo, Arnau Oliver, Jialu Liu, Haibin Huang, Yue Cui, Zehang Lin, Yusheng Liu, Shunzhi Zhu, Tatsat R. Patel, Vincent M. Tutino, Maysam Orouskhani, Huayu Wang, Mahmud Mossa-Basha, Chengcheng Zhu, Maximilian R. Rokuss, Yannick Kirchhoff, Nico Disch, Julius Holzschuh, Fabian Isensee, Klaus Maier-Hein, Yuki Sato, Sven Hirsch, Susanne Wegener, Bjoern Menze

The TopCoW dataset was the first public dataset with voxel-level annotations for thirteen possible CoW vessel components, enabled by virtual-reality (VR) technology.

Anatomy Benchmarking +1

Paper
Code

DrugAssist: A Large Language Model for Molecule Optimization

1 code implementation • 28 Dec 2023 • Geyan Ye, Xibao Cai, Houtim Lai, Xing Wang, Junhong Huang, Longyue Wang, Wei Liu, Xiangxiang Zeng

Recently, the impressive performance of large language models (LLMs) on a wide range of tasks has attracted an increasing number of attempts to apply LLMs in drug discovery.

Drug Discovery Language Modelling +1

122

Paper
Code

Experiential Co-Learning of Software-Developing Agents

1 code implementation • 28 Dec 2023 • Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

23,147

Paper
Code

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

1 code implementation • 25 Dec 2023 • Wei Liu, Weihao Zeng, Keqing He, Yong Jiang, Junxian He

We present deita (short for Data-Efficient Instruction Tuning for Alignment), a series of models fine-tuned from LLaMA and Mistral models using data samples automatically selected with our proposed approach.

345

Paper
Code

DreamTuner: Single Image is Enough for Subject-Driven Generation

no code implementations • 21 Dec 2023 • Miao Hua, Jiawei Liu, Fei Ding, Wei Liu, Jie Wu, Qian He

Diffusion-based models have demonstrated impressive capabilities for text-to-image generation and are expected for personalized applications of subject-driven generation, which require the generation of customized concepts with one or a few reference images.

Text-to-Image Generation

Paper
Add Code

Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling

no code implementations • 21 Dec 2023 • Jie Han, Yixiong Zou, Haozhao Wang, Jun Wang, Wei Liu, Yao Wu, Tao Zhang, Ruixuan Li

Therefore, current works first train a model on source domains with sufficiently labeled data, and then transfer the model to target domains where only rarely labeled data is available.

intent-classification Intent Classification +4

Paper
Add Code

T-Code: Simple Temporal Latent Code for Efficient Dynamic View Synthesis

no code implementations • 18 Dec 2023 • Zhenhuan Liu, Shuai Liu, Jie Yang, Wei Liu

Novel view synthesis for dynamic scenes is one of the spotlights in computer vision.

Monocular Reconstruction Novel View Synthesis +1

Paper
Add Code

Local Conditional Controlling for Text-to-Image Diffusion Models

no code implementations • 14 Dec 2023 • Yibo Zhao, Liang Peng, Yang Yang, Zekai Luo, Hengjia Li, Yao Chen, Wei Zhao, Qinglin Lu, Boxi Wu, Wei Liu

In this paper, we introduce a new simple yet practical task setting: local control.

Image Generation

Paper
Add Code

Enhancing the Rationale-Input Alignment for Self-explaining Rationalization

no code implementations • 7 Dec 2023 • Wei Liu, Haozhao Wang, Jun Wang, Zhiying Deng, Yuankai Zhang, Cheng Wang, Ruixuan Li

Rationalization empowers deep learning models with self-explaining capabilities through a cooperative game, where a generator selects a semantically consistent subset of the input as a rationale, and a subsequent predictor makes predictions based on the selected rationale.

Paper
Add Code

MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

1 code implementation • 4 Dec 2023 • Fenghe Tang, Bingkun Nian, Jianrui Ding, Quan Quan, Jie Yang, Wei Liu, S. Kevin Zhou

This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integrate the advantages of both worlds at the infrastructure design level.

Image Segmentation Inductive Bias +3

Paper
Code

SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution

1 code implementation • 4 Dec 2023 • Bingkun Nian, Fenghe Tang, Jianrui Ding, Pingping Zhang, Jie Yang, S. Kevin Zhou, Wei Liu

In this paper, we present a high-performance deep neural network for weak target image segmentation, including medical image segmentation and infrared image segmentation.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Noisy probing dose facilitated dose prediction for pencil beam scanning proton therapy: physics enhances generalizability

no code implementations • 2 Dec 2023 • Lian Zhang, Jason M. Holmes, Zhengliang Liu, Hongying Feng, Terence T. Sio, Carlos E. Vargas, Sameer R. Keole, Kristin Stützer, Sheng Li, Tianming Liu, Jiajian Shen, William W. Wong, Sujay A. Vora, Wei Liu

The noisy probing dose method showed better generalizability in the 6 outlier cases than the ROI-based and beam mask-based methods with 3D Gamma passing rates (for prostate cancer, targets: 89. 32%$\pm$1. 45% vs. 93. 48%$\pm$1. 51% vs. 96. 79%$\pm$0. 83%, OARs: 85. 87%$\pm$1. 73% vs. 91. 15%$\pm$1. 13% vs. 94. 29%$\pm$1. 01%).

Paper
Add Code

SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

1 code implementation • 29 Nov 2023 • Liang Peng, Haoran Cheng, Zheng Yang, Ruisi Zhao, Linxuan Xia, Chaotian Song, Qinglin Lu, Boxi Wu, Wei Liu

By applying the loss to existing one-shot video tuning methods, we significantly improve the overall consistency and smoothness of the generated videos.

Paper
Code

BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP

no code implementations • 26 Nov 2023 • Jiawang Bai, Kuofeng Gao, Shaobo Min, Shu-Tao Xia, Zhifeng Li, Wei Liu

Contrastive Vision-Language Pre-training, known as CLIP, has shown promising effectiveness in addressing downstream image recognition tasks.

Paper
Add Code

FBChain: A Blockchain-based Federated Learning Model with Efficiency and Secure Communication

no code implementations • 21 Nov 2023 • Yang Li, Chunhe Xia, Wei Liu, Weidong Zhou, Chen Chen, Tianbo Wang

This article proposes Blockchain-based Federated Learning (FBChain) model for federated learning parameter communication to overcome the above two problems.

Federated Learning

Paper
Add Code

Damped Proximal Augmented Lagrangian Method for weakly-Convex Problems with Convex Constraints

no code implementations • 15 Nov 2023 • Hari Dahal, Wei Liu, Yangyang Xu

For the former case, DPALM achieves the complexity of $\widetilde{\mathcal{O}}\left(\varepsilon^{-2. 5} \right)$ to produce an $\varepsilon$-KKT point by applying an accelerated proximal gradient (APG) method to each DPALM subproblem.

Paper
Add Code

Holistic Evaluation of GPT-4V for Biomedical Imaging

no code implementations • 10 Nov 2023 • Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, Jingyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang, Xinyu Wang, Xu Zhang, Lin Zhao, Yiheng Liu, Kai Zhang, Liheng Yan, Lichao Sun, Jun Liu, Ning Qiang, Bao Ge, Xiaoyan Cai, Shijie Zhao, Xintao Hu, Yixuan Yuan, Gang Li, Shu Zhang, Xin Zhang, Xi Jiang, Tuo Zhang, Dinggang Shen, Quanzheng Li, Wei Liu, Xiang Li, Dajiang Zhu, Tianming Liu

GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain.

Anatomy Image Captioning +1

Paper
Add Code

Evaluating Large Language Models in Ophthalmology

no code implementations • 7 Nov 2023 • Jason Holmes, Shuyuan Ye, Yiwei Li, Shi-Nan Wu, Zhengliang Liu, Zihao Wu, Jinyu Hu, Huan Zhao, Xi Jiang, Wei Liu, Hong Wei, Jie Zou, Tianming Liu, Yi Shao

Methods: A 100-item ophthalmology single-choice test was administered to three different LLMs (GPT-3. 5, GPT-4, and PaLM2) and three different professional levels (medical undergraduates, medical masters, and attending physicians), respectively.

Decision Making

Paper
Add Code

Evaluating multiple large language models in pediatric ophthalmology

no code implementations • 7 Nov 2023 • Jason Holmes, Rui Peng, Yiwei Li, Jinyu Hu, Zhengliang Liu, Zihao Wu, Huan Zhao, Xi Jiang, Wei Liu, Hong Wei, Jie Zou, Tianming Liu, Yi Shao

IMPORTANCE The response effectiveness of different large language models (LLMs) and various individuals, including medical students, graduate students, and practicing physicians, in pediatric ophthalmology consultations, has not been clearly established yet.

Multiple-choice

Paper
Add Code

Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions

no code implementations • 5 Nov 2023 • Xinyu Gong, Jason Holmes, Yiwei Li, Zhengliang Liu, Qi Gan, Zihao Wu, Jianli Zhang, Yusong Zou, Yuxi Teng, Tian Jiang, Hongtu Zhu, Wei Liu, Tianming Liu, Yajun Yan

Recent advances in Large Language Models (LLMs) have presented new opportunities for integrating Artificial General Intelligence (AGI) into biological research and education.

Logical Reasoning Multiple-choice

Paper
Add Code

Discussing the Spectrum of Physics-Enhanced Machine Learning; a Survey on Structural Mechanics Applications

no code implementations • 31 Oct 2023 • Marcus Haywood-Alexander, Wei Liu, Kiran Bacsa, Zhilu Lai, Eleni Chatzi

The intersection of physics and machine learning has given rise to the physics-enhanced machine learning (PEML) paradigm, aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods.

Paper
Add Code

From Indeterminacy to Determinacy: Augmenting Logical Reasoning Capabilities with Large Language Models

1 code implementation • 28 Oct 2023 • Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan

To address these challenges, we propose DetermLR, a novel reasoning framework that formulates the reasoning process as a transformational journey from indeterminate premises to determinate ones.

Logical Reasoning

Paper
Code

Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks

1 code implementation • 26 Oct 2023 • Zhaohui Yan, Songlin Yang, Wei Liu, Kewei Tu

Also, most of current ERE models do not take into account higher-order interactions between multiple entities and relations, while higher-order modeling could be beneficial. In this work, we propose HyperGraph neural network for ERE ($\hgnn{}$), which is built upon the PL-marker (a state-of-the-art marker-based pipleline model).

Joint Entity and Relation Extraction NER +1

Paper
Code

Simple Hardware-Efficient PCFGs with Independent Left and Right Productions

1 code implementation • 23 Oct 2023 • Wei Liu, Songlin Yang, Yoon Kim, Kewei Tu

Scaling dense PCFGs to thousands of nonterminals via a low-rank parameterization of the rule probability tensor has been shown to be beneficial for unsupervised parsing.

Constituency Grammar Induction Language Modelling

Paper
Code

Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report

no code implementations • 5 Oct 2023 • Jason Holmes, Lian Zhang, Yuzhen Ding, Hongying Feng, Zhengliang Liu, Tianming Liu, William W. Wong, Sujay A. Vora, Jonathan B. Ashman, Wei Liu

Conclusions: Given the accuracy of GPT-4 in re-labeling structure names of both target volumes and normal tissues as presented in this work, LLMs are poised to be the preferred method for standardizing structure names in radiation oncology, especially considering the rapid advancements in LLM capabilities that are likely to continue.

Benchmarking

Paper
Add Code

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

4 code implementations • 3 Oct 2023 • Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, Hongfa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan

We thus propose VIDAL-10M with Video, Infrared, Depth, Audio and their corresponding Language, naming as VIDAL-10M.

Ranked #1 on Zero-shot Audio Classification on VGG-Sound (using extra training data)

Audio Classification Contrastive Learning +11

2,417

Paper
Code

D-Separation for Causal Self-Explanation

1 code implementation • NeurIPS 2023 • Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Zhiying Deng, Yuankai Zhang, Yang Qiu

Instead of attempting to rectify the issues of the MMI criterion, we propose a novel criterion to uncover the causal rationale, termed the Minimum Conditional Dependence (MCD) criterion, which is grounded on our finding that the non-causal features and the target label are \emph{d-separated} by the causal rationale.

Paper
Code

Sparsely Shared LoRA on Whisper for Child Speech Recognition

no code implementations • 21 Sep 2023 • Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee

Child speech, as a representative type of low-resource speech, is leveraged for adaptation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

CoMFLP: Correlation Measure based Fast Search on ASR Layer Pruning

1 code implementation • 21 Sep 2023 • Wei Liu, Zhiyuan Peng, Tan Lee

The search process is carried out in two steps: (1) coarse search: to determine top $K$ candidates by pruning the most redundant layers based on the correlation matrix; (2) fine search: to select the best pruning proposal among $K$ candidates using a task-specific evaluation metric.

speech-recognition Speech Recognition

Paper
Code

PolicyGPT: Automated Analysis of Privacy Policies with Large Language Models

no code implementations • 19 Sep 2023 • Chenhao Tang, Zhengliang Liu, Chong Ma, Zihao Wu, Yiwei Li, Wei Liu, Dajiang Zhu, Quanzheng Li, Xiang Li, Tianming Liu, Lei Fan

In this study, we investigate a privacy policy text analysis framework PolicyGPT based on the LLM.

Sentence Zero-Shot Learning

Paper
Add Code

RadOnc-GPT: A Large Language Model for Radiation Oncology

no code implementations • 18 Sep 2023 • Zhengliang Liu, Peilong Wang, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Chenbin Liu, Ninghao Liu, Dajiang Zhu, Xiang Li, Quanzheng Li, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods.

Language Modelling Large Language Model +1

Paper
Add Code

Towards Better Data Exploitation in Self-Supervised Monocular Depth Estimation

1 code implementation • 11 Sep 2023 • Jinfeng Liu, Lingtong Kong, Jie Yang, Wei Liu

Additionally, we introduce the detail-enhanced DepthNet with an extra full-scale branch in the encoder and a grid decoder to enhance the restoration of fine details in depth maps.

Data Augmentation Monocular Depth Estimation

Paper
Code

DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection

no code implementations • 7 Sep 2023 • Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma

This paper reveals that the recently developed Diffusion Model is a scalable data engine for object detection.

Data Augmentation object-detection +1

Paper
Add Code

MathAttack: Attacking Large Language Models Towards Math Solving Ability

no code implementations • 4 Sep 2023 • ZiHao Zhou, Qiufeng Wang, Mingyu Jin, Jie Yao, Jianan Ye, Wei Liu, Wei Wang, Xiaowei Huang, Kaizhu Huang

Instead of attacking prompts in the use of LLMs, we propose a MathAttack model to attack MWP samples which are closer to the essence of security in solving math problems.

Adversarial Attack GSM8K +1

Paper
Add Code

Radiology-Llama2: Best-in-Class Large Language Model for Radiology

no code implementations • 29 Aug 2023 • Zhengliang Liu, Yiwei Li, Peng Shu, Aoxiao Zhong, Longtao Yang, Chao Ju, Zihao Wu, Chong Ma, Jie Luo, Cheng Chen, Sekeun Kim, Jiang Hu, Haixing Dai, Lin Zhao, Dajiang Zhu, Jun Liu, Wei Liu, Dinggang Shen, Tianming Liu, Quanzheng Li, Xiang Li

This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning.

Language Modelling Large Language Model

Paper
Add Code

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

1 code implementation • 24 Aug 2023 • Hanchi Huang, Li Shen, Deheng Ye, Wei Liu

We propose a novel master-slave architecture to solve the top-$K$ combinatorial multi-armed bandits problem with non-linear bandit feedback and diversity constraints, which, to the best of our knowledge, is the first combinatorial bandits setting considering diversity constraints under bandit feedback.

Multi-Armed Bandits

Paper
Code

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

1 code implementation • 21 Aug 2023 • Tianyu Yu, Chengyue Jiang, Chao Lou, Shen Huang, Xiaobin Wang, Wei Liu, Jiong Cai, Yangning Li, Yinghui Li, Kewei Tu, Hai-Tao Zheng, Ningyu Zhang, Pengjun Xie, Fei Huang, Yong Jiang

However, LLMs are sometimes too footloose for natural language understanding (NLU) tasks which always have restricted output and input format.

Entity Typing Event Extraction +3

189

Paper
Code

Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection

1 code implementation • 7 Aug 2023 • Xinhao Deng, Pingping Zhang, Wei Liu, Huchuan Lu

To address above issues, in this work, we first propose a new HRS10K dataset, which contains 10, 500 high-quality annotated images at 2K-8K resolution.

2k 8k +3

Paper
Code

LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement

1 code implementation • 27 Jul 2023 • Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tae-Kyun Kim, Wei Liu, Hongdong Li

In this paper, we address this limitation by proposing a degradation-aware learning scheme for LLIE using diffusion models, which effectively integrates degradation and image priors into the diffusion process, resulting in improved image enhancement.

Image Generation Low-Light Image Enhancement

Paper
Code

Evaluating Large Language Models for Radiology Natural Language Processing

1 code implementation • 25 Jul 2023 • Zhengliang Liu, Tianyang Zhong, Yiwei Li, Yutong Zhang, Yi Pan, Zihao Zhao, Peixin Dong, Chao Cao, Yuxiao Liu, Peng Shu, Yaonai Wei, Zihao Wu, Chong Ma, Jiaqi Wang, Sheng Wang, Mengyue Zhou, Zuowei Jiang, Chunlin Li, Jason Holmes, Shaochen Xu, Lu Zhang, Haixing Dai, Kai Zhang, Lin Zhao, Yuanhao Chen, Xu Liu, Peilong Wang, Pingkun Yan, Jun Liu, Bao Ge, Lichao Sun, Dajiang Zhu, Xiang Li, Wei Liu, Xiaoyan Cai, Xintao Hu, Xi Jiang, Shu Zhang, Xin Zhang, Tuo Zhang, Shijie Zhao, Quanzheng Li, Hongtu Zhu, Dinggang Shen, Tianming Liu

The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP).

802

Paper
Code

Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

no code implementations • 18 Jul 2023 • Chaofeng Chen, Wei Liu, Xiao Tan, Kwan-Yee K. Wong

Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.

Translation

Paper
Add Code

A Novel Multi-Task Model Imitating Dermatologists for Accurate Differential Diagnosis of Skin Diseases in Clinical Images

no code implementations • 17 Jul 2023 • Yan-Jie Zhou, Wei Liu, Yuan Gao, Jing Xu, Le Lu, Yuping Duan, Hao Cheng, Na Jin, Xiaoyong Man, Shuang Zhao, Yu Wang

Skin diseases are among the most prevalent health issues, and accurate computer-aided diagnosis methods are of importance for both dermatologists and patients.

Multi-Task Learning

Paper
Add Code

Communicative Agents for Software Development

1 code implementation • 16 Jul 2023 • Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun

At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting.

Decision Making

23,147

Paper
Code

First-order Methods for Affinely Constrained Composite Non-convex Non-smooth Problems: Lower Complexity Bound and Near-optimal Methods

no code implementations • 14 Jul 2023 • Wei Liu, Qihang Lin, Yangyang Xu

In this paper, we make the first attempt to establish lower complexity bounds of FOMs for solving a class of composite non-convex non-smooth optimization with linear constraints.

Paper
Add Code

SAMAug: Point Prompt Augmentation for Segment Anything Model

1 code implementation • 3 Jul 2023 • Haixing Dai, Chong Ma, Zhiling Yan, Zhengliang Liu, Enze Shi, Yiwei Li, Peng Shu, Xiaozheng Wei, Lin Zhao, Zihao Wu, Fang Zeng, Dajiang Zhu, Wei Liu, Quanzheng Li, Lichao Sun, Shu Zhang Tianming Liu, Xiang Li

Starting with an initial point prompt, SAM produces an initial mask, which is then fed into our proposed SAMAug to generate augmented point prompts.

Image Segmentation Prompt Engineering +2

Paper
Code

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

no code implementations • 1 Jul 2023 • Zhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Yongdong Zhang, Zhendong Mao

While large-scale pre-trained text-to-image models can synthesize diverse and high-quality human-centric images, an intractable problem is how to preserve the face identity for conditioned face images.

Image Generation

Paper
Add Code

CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?

no code implementations • 29 Jun 2023 • Tianwen Wei, Jian Luan, Wei Liu, Shuang Dong, Bin Wang

We present the Chinese Elementary School Math Word Problems (CMATH) dataset, comprising 1. 7k elementary school-level math word problems with detailed annotations, source from actual Chinese workbooks and exams.

Language Modelling Math +1

Paper
Add Code

Segment Anything Model (SAM) for Radiation Oncology

no code implementations • 20 Jun 2023 • Lian Zhang, Zhengliang Liu, Lu Zhang, Zihao Wu, Xiaowei Yu, Jason Holmes, Hongying Feng, Haixing Dai, Xiang Li, Quanzheng Li, Dajiang Zhu, Tianming Liu, Wei Liu

Given that SAM, a model pre-trained purely on natural images, can handle the delineation of OARs from medical images with clinically acceptable accuracy, these results highlight SAM's robust generalization capabilities with consistent accuracy in automatic segmentation for radiotherapy.

Segmentation

Paper
Add Code

UniMC: A Unified Framework for Long-Term Memory Conversation via Relevance Representation Learning

no code implementations • 18 Jun 2023 • Kang Zhao, Wei Liu, Jian Luan, Minglei Gao, Li Qian, Hanlin Teng, Bin Wang

In this paper, we propose a Unified framework for Long-term Memory Conversations (UniMC), which increases the connection between different stages by learning relevance representation.

Representation Learning Retrieval

Paper
Add Code

Radiology-GPT: A Large Language Model for Radiology

no code implementations • 14 Jun 2023 • Zhengliang Liu, Aoxiao Zhong, Yiwei Li, Longtao Yang, Chao Ju, Zihao Wu, Chong Ma, Peng Shu, Cheng Chen, Sekeun Kim, Haixing Dai, Lin Zhao, Lichao Sun, Dajiang Zhu, Jun Liu, Wei Liu, Dinggang Shen, Xiang Li, Quanzheng Li, Tianming Liu

We introduce Radiology-GPT, a large language model for radiology.

Language Modelling Large Language Model

Paper
Add Code

Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease

no code implementations • 13 Jun 2023 • Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu, Jiaojiao Xu, Bo Liu, Xuemei Wang, Yao Zhang, Qiong Yan, Muhan Lv, Xiaomei Chen, Shuhua Zhang, Yihua Wang, Yang Liu, Li Yin, Yanni Liu, Yanqing Huang, Yunfang Liu, Kun Wang, Meiqin Su, Li Bian, Ping An, Xin Zhang, Linxue Qian, Shao Li, Xiaolong Qi

Validation analysis revealed that the AUCs of DLRP were 0. 91 for GEV (95% CI 0. 90 to 0. 93, p < 0. 05) and 0. 88 for HRV (95% CI 0. 86 to 0. 89, p < 0. 01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM.

Paper
Add Code

Global and Local Semantic Completion Learning for Vision-Language Pre-training

1 code implementation • 12 Jun 2023 • Rong-Cheng Tu, Yatai Ji, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

MGSC promotes learning more representative global features, which have a great impact on the performance of downstream tasks, while MLTC reconstructs modal-fusion local tokens, further enhancing accurate comprehension of multimodal data.

Language Modelling Masked Language Modeling +5

Paper
Code

Annotation-Inspired Implicit Discourse Relation Classification with Auxiliary Discourse Connective Generation

1 code implementation • 10 Jun 2023 • Wei Liu, Michael Strube

Implicit discourse relation classification is a challenging task due to the absence of discourse connectives.

Implicit Discourse Relation Classification Relation

Paper
Code

Modeling Structural Similarities between Documents for Coherence Assessment with Graph Convolutional Networks

1 code implementation • 10 Jun 2023 • Wei Liu, Xiyan Fu, Michael Strube

Coherence is an important aspect of text quality, and various approaches have been applied to coherence modeling.

Automated Essay Scoring

Paper
Code

Artificial General Intelligence for Medical Imaging

no code implementations • 8 Jun 2023 • Xiang Li, Lu Zhang, Zihao Wu, Zhengliang Liu, Lin Zhao, Yixuan Yuan, Jun Liu, Gang Li, Dajiang Zhu, Pingkun Yan, Quanzheng Li, Wei Liu, Tianming Liu, Dinggang Shen

In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models.

Paper
Add Code

GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions

no code implementations • 29 May 2023 • Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tong Lu, Tae-Kyun Kim, Wei Liu, Hongdong Li

Second, we introduce a residual dense transformer block (RDTB) as the final GridFormer layer.

Image Restoration Rain Removal

Paper
Add Code

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

no code implementations • 29 May 2023 • Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Based on these observations, we propose a method named \texttt{p-align} to improve the compositional generalization of Text-to-SQL models.

SQL Parsing Text-To-SQL

Paper
Add Code

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

1 code implementation • 26 May 2023 • Kai Zhang, Jun Yu, Eashan Adhikarla, Rong Zhou, Zhiling Yan, Yixin Liu, Zhengliang Liu, Lifang He, Brian Davison, Xiang Li, Hui Ren, Sunyang Fu, James Zou, Wei Liu, Jing Huang, Chen Chen, Yuyin Zhou, Tianming Liu, Xun Chen, Yong Chen, Quanzheng Li, Hongfang Liu, Lichao Sun

Conventional task- and modality-specific artificial intelligence (AI) models are inflexible in real-world deployment and maintenance for biomedicine.

Ranked #1 on Text Summarization on MeQSum

Image Captioning Medical Visual Question Answering +5

287

Paper
Code

KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

1 code implementation • 25 May 2023 • Xu Bao, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Wangmeng Xiang, Jingdong Sun, Hanbing Liu, Wei Liu, Bin Luo, Yifeng Geng, Xuansong Xie

By spearheading the integration of Multilateration with facial analysis, KeyPosS marks a paradigm shift in facial landmark detection.

Benchmarking Face Recognition +3

Paper
Code

Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint

1 code implementation • 23 May 2023 • Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Yang Qiu, Yuankai Zhang, Jie Han, Yixiong Zou

However, such a cooperative game may incur the degeneration problem where the predictor overfits to the uninformative pieces generated by a not yet well-trained generator and in turn, leads the generator to converge to a sub-optimal model that tends to select senseless pieces.

Paper
Code

Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data

1 code implementation • 18 May 2023 • Yusheng Tian, Wei Liu, Tan Lee

One way to address this problem is to pre-enhance the speech with an enhancement model and then use the enhanced data for text-to-speech (TTS) model training.

Speech Enhancement Speech Synthesis

Paper
Code

MGR: Multi-generator Based Rationalization

1 code implementation • 8 May 2023 • Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Xinyang Li, Yuankai Zhang, Yang Qiu

Rationalization is to employ a generator and a predictor to construct a self-explaining NLP model in which the generator selects a subset of human-intelligible pieces of the input text to the following predictor.

Paper
Code

Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT

no code implementations • 29 Apr 2023 • Zhenxiang Xiao, Yuzhong Chen, Lu Zhang, Junjie Yao, Zihao Wu, Xiaowei Yu, Yi Pan, Lin Zhao, Chong Ma, Xinyu Liu, Wei Liu, Xiang Li, Yixuan Yuan, Dinggang Shen, Dajiang Zhu, Tianming Liu, Xi Jiang

Prompts have been proven to play a crucial role in large language models, and in recent years, vision models have also been using prompts to improve scalability for multiple downstream tasks.

Image Classification

Paper
Add Code

Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders

no code implementations • 25 Apr 2023 • Heng Pan, Chenyang Liu, Wenxiao Wang, Li Yuan, Hongfa Wang, Zhifeng Li, Wei Liu

To study which type of deep features is appropriate for MIM as a learning target, we propose a simple MIM framework with serials of well-trained self-supervised models to convert an Image to a feature Vector as the learning target of MIM, where the feature extractor is also known as a teacher model.

Attribute Vocal Bursts Intensity Prediction

Paper
Add Code

Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective

no code implementations • 23 Apr 2023 • Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Weixuan Wang, Siqin Li, Xianliang Wang, Xianhan Zeng, Rundong Wang, Jiawei Wang, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

MOBA games, e. g., Dota2 and Honor of Kings, have been actively used as the testbed for the recent AI research on games, and various AI systems have been developed at the human level so far.

Paper
Add Code

Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer

no code implementations • 21 Apr 2023 • Yuzhen Ding, Hongying Feng, Yunze Yang, Jason Holmes, Zhengliang Liu, David Liu, William W. Wong, Nathan Y. Yu, Terence T. Sio, Steven E. Schild, Baoxin Li, Wei Liu

Conclusion: A patient-specific vision-transformer-based network was developed and shown to be accurate and efficient to reconstruct 3D CT images from kV images.

Anatomy Image Registration

Paper
Add Code

Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task

no code implementations • 18 Apr 2023 • Zihao Wu, Lu Zhang, Chao Cao, Xiaowei Yu, Haixing Dai, Chong Ma, Zhengliang Liu, Lin Zhao, Gang Li, Wei Liu, Quanzheng Li, Dinggang Shen, Xiang Li, Dajiang Zhu, Tianming Liu

To this end, in this study, we evaluate the performance of ChatGPT/GPT-4 on a radiology NLI task and compare it to other models fine-tuned specifically on task-related data samples.

Specificity Task 2

Paper
Add Code

Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics

no code implementations • 1 Apr 2023 • Jason Holmes, Zhengliang Liu, Lian Zhang, Yuzhen Ding, Terence T. Sio, Lisa A. McGee, Jonathan B. Ashman, Xiang Li, Tianming Liu, Jiajian Shen, Wei Liu

We present the first study to investigate Large Language Models (LLMs) in answering radiation oncology physics questions.

Paper
Add Code

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

no code implementations • 30 Mar 2023 • Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu Enwei Zhang, Wei Liu, Jie Yang, Ke Li, Xing Sun

During the preceding biennium, vision-language pre-training has achieved noteworthy success on several downstream tasks.

Zero-Shot Learning

Paper
Add Code

Plug-and-Play Regulators for Image-Text Matching

1 code implementation • 23 Mar 2023 • Haiwen Diao, Ying Zhang, Wei Liu, Xiang Ruan, Huchuan Lu

Exploiting fine-grained correspondence and visual-semantic alignments has shown great potential in image-text matching.

Ranked #2 on Image Retrieval on Flickr30K 1K test

Image Retrieval Image-text matching +1

Paper
Code

DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4

1 code implementation • 20 Mar 2023 • Zhengliang Liu, Yue Huang, Xiaowei Yu, Lu Zhang, Zihao Wu, Chao Cao, Haixing Dai, Lin Zhao, Yiwei Li, Peng Shu, Fang Zeng, Lichao Sun, Wei Liu, Dinggang Shen, Quanzheng Li, Tianming Liu, Dajiang Zhu, Xiang Li

The digitization of healthcare has facilitated the sharing and re-using of medical data but has also raised concerns about confidentiality and privacy.

Benchmarking De-identification +4

Paper
Code

STGIC: a graph and image convolution-based method for spatial transcriptomic clustering

no code implementations • 19 Mar 2023 • Chen Zhang, Junhui Gao, Lingxin Kong, Guangshuo cao, Xiangyu Guo, Wei Liu

Spatial transcriptomic (ST) clustering employs spatial and transcription information to group spots spatially coherent and transcriptionally similar together into the same spatial domain.

Clustering Contrastive Learning +1

Paper
Add Code

CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention

1 code implementation • 13 Mar 2023 • Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu

On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +3

319

Paper
Code

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

no code implementations • 1 Mar 2023 • Chao Xue, Wei Liu, Shuai Xie, Zhenfang Wang, Jiaxing Li, Xuyang Peng, Liang Ding, Shanshan Zhao, Qiong Cao, Yibo Yang, Fengxiang He, Bohua Cai, Rongcheng Bian, Yiyan Zhao, Heliang Zheng, Xiangyang Liu, Dongkai Liu, Daqing Liu, Li Shen, Chang Li, Shijin Zhang, Yukang Zhang, Guanpu Chen, Shixiang Chen, Yibing Zhan, Jing Zhang, Chaoyue Wang, DaCheng Tao

Automated machine learning (AutoML) seeks to build ML models with minimal human effort.

AutoML

Paper
Add Code

Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process

1 code implementation • 1 Mar 2023 • Mingze Ni, Zhensu Sun, Wei Liu

In response, this study proposes a new method called the Fraud's Bargain Attack (FBA), which uses a randomization mechanism to expand the search space and produce high-quality adversarial examples with a higher probability of success.

Adversarial Text Sentence

Paper
Code

Q-Cogni: An Integrated Causal Reinforcement Learning Framework

no code implementations • 26 Feb 2023 • Cris Cunha, Wei Liu, Tim French, Ajmal Mian

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference.

Causal Inference Decision Making +3

Paper
Add Code

AugGPT: Leveraging ChatGPT for Text Data Augmentation

no code implementations • 25 Feb 2023 • Haixing Dai, Zhengliang Liu, Wenxiong Liao, Xiaoke Huang, Yihan Cao, Zihao Wu, Lin Zhao, Shaochen Xu, Wei Liu, Ninghao Liu, Sheng Li, Dajiang Zhu, Hongmin Cai, Lichao Sun, Quanzheng Li, Dinggang Shen, Tianming Liu, Xiang Li

Text data augmentation is an effective strategy for overcoming the challenge of limited sample sizes in many natural language processing (NLP) tasks.

Data Augmentation Few-Shot Learning +3

Paper
Add Code

Mask-guided BERT for Few Shot Text Classification

no code implementations • 21 Feb 2023 • Wenxiong Liao, Zhengliang Liu, Haixing Dai, Zihao Wu, Yiyang Zhang, Xiaoke Huang, Yuzhong Chen, Xi Jiang, Wei Liu, Dajiang Zhu, Tianming Liu, Sheng Li, Xiang Li, Hongmin Cai

The main challenge of FSL is the difficulty of training robust models on small amounts of samples, which frequently leads to overfitting.

Contrastive Learning Few-Shot Learning +2

Paper
Add Code

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

no code implementations • 21 Feb 2023 • Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

Recent studies on pronunciation scoring have explored the effect of introducing phone embeddings as reference pronunciation, but mostly in an implicit manner, i. e., addition or concatenation of reference phone embedding and actual pronunciation of the target phone as the phone-level pronunciation quality representation.

Paper
Add Code

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

no code implementations • 20 Feb 2023 • Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

A typical fluency scoring system generally relies on an automatic speech recognition (ASR) system to obtain time stamps in input speech for either the subsequent calculation of fluency-related features or directly modeling speech fluency with an end-to-end approach.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

1 code implementation • 5 Feb 2023 • Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Recent success in Deep Reinforcement Learning (DRL) methods has shown that policy optimization with respect to an off-policy distribution via importance sampling is effective for sample reuse.

Paper
Code

Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation

no code implementations • 5 Feb 2023 • Shiqi Sun, Shancheng Fang, Qian He, Wei Liu

Specifically, our method co-encodes images and text into a new domain during the training phase.

Translation

Paper
Add Code

CFFT-GAN: Cross-domain Feature Fusion Transformer for Exemplar-based Image Translation

no code implementations • 3 Feb 2023 • Tianxiang Ma, Bingchuan Li, Wei Liu, Miao Hua, Jing Dong, Tieniu Tan

In this paper, we propose a more general learning approach by considering two domain features as a whole and learning both inter-domain correspondence and intra-domain potential information interactions.

Translation

Paper
Add Code

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation

1 code implementation • 3 Feb 2023 • Hanyuan Chen, Jun-Yan He, Wangmeng Xiang, Zhi-Qi Cheng, Wei Liu, Hanbing Liu, Bin Luo, Yifeng Geng, Xuansong Xie

Human pose estimation is a challenging task due to its structured data sequence nature.

Ranked #74 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation 3D Pose Estimation +1

Paper
Code

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

no code implementations • 31 Jan 2023 • Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi

Specifically, in Phase I, a W-space-oriented StyleGAN inversion network is trained and used to perform image inversion and editing, which assures the editability but sacrifices reconstruction quality.

Image Generation

Paper
Add Code

Planning and Tracking Control of Full Drive-by-Wire Electric Vehicles in Unstructured Scenario

no code implementations • 7 Jan 2023 • Guoying Chen, Min Hua, Wei Liu, Jinhai Wang, Shunhui Song, Changsheng Liu

Full drive-by-wire electric vehicles (FDWEV) with X-by-wire technology can achieve independent driving, braking, and steering of each wheel, providing a good application platform for autonomous driving technology.

Autonomous Driving Model Predictive Control

Paper
Add Code

Heterogeneous Diversity Driven Active Learning for Multi-Object Tracking

no code implementations • ICCV 2023 • Rui Li, Baopeng Zhang, Jun Liu, Wei Liu, Jian Zhao, Zhu Teng

HD-AMOT defines the diversified informative representation by encoding the geometric and semantic information, and formulates the frame inference strategy as a Markov decision process to learn an optimal sampling policy based on the designed informative representation.

Active Learning Multi-Object Tracking

Paper
Add Code

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions

1 code implementation • 21 Dec 2022 • Hao Sun, Zhexin Zhang, Fei Mi, Yasheng Wang, Wei Liu, Jianwei Cui, Bin Wang, Qun Liu, Minlie Huang

In this paper, we propose a framework, MoralDial to train and evaluate moral dialogue systems.

Paper
Code

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

no code implementations • 9 Dec 2022 • Jie Jiang, Zhimin Li, Jiangfeng Xiong, Rongwei Quan, Qinglin Lu, Wei Liu

Therefore, TAVS is distinguished from previous temporal segmentation datasets due to its multi-modal information, holistic view of categories, and hierarchical granularities.

Multi-Label Classification Scene Segmentation +3

Paper
Add Code

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning

1 code implementation • CVPR 2023 • Yatai Ji, RongCheng Tu, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

Cross-modal alignment is essential for vision-language pre-training (VLP) models to learn the correct corresponding information across different modalities.

Ranked #8 on Zero-Shot Video Retrieval on LSMDC

Language Modelling Masked Language Modeling +6

Paper
Code

PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples

no code implementations • 22 Nov 2022 • Shengshan Hu, Junwei Zhang, Wei Liu, Junhui Hou, Minghui Li, Leo Yu Zhang, Hai Jin, Lichao Sun

In addition, existing attack approaches towards point cloud classifiers cannot be applied to the completion models due to different output forms and attack purposes.

Adversarial Attack Point Cloud Classification +2

Paper
Add Code

Curriculum-based Asymmetric Multi-task Reinforcement Learning

1 code implementation • 7 Nov 2022 • Hanchi Huang, Deheng Ye, Li Shen, Wei Liu

To mitigate the negative influence of customizing the one-off training order in curriculum-based AMTL, CAMRL switches its training mode between parallel single-task RL and asymmetric multi-task RL (MTRL), according to an indicator regarding the training time, the overall performance, and the performance gap among tasks.

Multi-Task Learning reinforcement-learning +1

Paper
Code

A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal

1 code implementation • 5 Nov 2022 • Tao Wang, Kaihao Zhang, Xuanxi Chen, Wenhan Luo, Jiankang Deng, Tong Lu, Xiaochun Cao, Wei Liu, Hongdong Li, Stefanos Zafeiriou

Second, we discuss the challenges of face restoration.

Image Restoration Super-Resolution

367

Paper
Code

Model Compression for DNN-based Speaker Verification Using Weight Quantization

no code implementations • 31 Oct 2022 • Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang, Tan Lee

Experimental results on VoxCeleb show that weight quantization is effective for compressing SV models.

Model Compression Quantization +1

Paper
Add Code

A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications

no code implementations • 26 Oct 2022 • Can Li, Lei Bai, Lina Yao, S. Travis Waller, Wei Liu

Transportation is the backbone of the economy and urban development.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Online LiDAR-Camera Extrinsic Parameters Self-checking

1 code implementation • 19 Oct 2022 • Pengjin Wei, Guohang Yan, Yikang Li, Kun Fang, Jie Yang, Wei Liu

This calibration task is multi-modal, where the rich color and texture information captured by the camera and the accurate three-dimensional spatial information from the LiDAR is incredibly significant for downstream tasks.

Autonomous Driving Binary Classification

Paper
Code

Neural Extended Kalman Filters for Learning and Predicting Dynamics of Structural Systems

1 code implementation • 9 Oct 2022 • Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

Typically, conventional variational inference models are parameterized by neural networks independent of the latent dynamics models.

Variational Inference

Paper
Code

FR: Folded Rationalization with a Unified Encoder

1 code implementation • 17 Sep 2022 • Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Chao Yue, Yuankai Zhang

Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces.

Paper
Code

Towards In-distribution Compatibility in Out-of-distribution Detection

no code implementations • 29 Aug 2022 • Boxi Wu, Jie Jiang, Haidong Ren, Zifan Du, Wenxiao Wang, Zhifeng Li, Deng Cai, Xiaofei He, Binbin Lin, Wei Liu

Various training criteria for these auxiliary outliers are proposed based on heuristic intuitions.

Out-of-Distribution Detection

Paper
Add Code

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

1 code implementation • 24 Aug 2022 • Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou

However, CL on VQA involves not only the expansion of label sets (new Answer sets).

Continual Learning Question Answering +1

Paper
Code

SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

1 code implementation • CVPR 2022 • Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, Wei Liu

Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS).

Ranked #6 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Semantic Segmentation Semi-Supervised Video Object Segmentation +1

Paper
Code

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

no code implementations • 21 Aug 2022 • Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.

Scene Text Detection Text Detection

Paper
Add Code

CircuitNet: An Open-Source Dataset for Machine Learning Applications in Electronic Design Automation (EDA)

no code implementations • 1 Aug 2022 • Zhuomin Chai, Yuxiang Zhao, Yibo Lin, Wei Liu, Runsheng Wang, Ru Huang

The electronic design automation (EDA) community has been actively exploring machine learning (ML) for very large-scale integrated computer-aided design (VLSI CAD).

BIG-bench Machine Learning

Paper
Add Code

NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation

no code implementations • 27 Jul 2022 • Lin Li, Long Chen, Hanrong Shi, Hanwang Zhang, Yi Yang, Wei Liu, Jun Xiao

To this end, we propose a novel NoIsy label CorrEction and Sample Training strategy for SGG: NICEST.

Graph Generation Knowledge Distillation +1

Paper
Add Code

Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

1 code implementation • 27 Jul 2022 • Jiawang Bai, Kuofeng Gao, Dihong Gong, Shu-Tao Xia, Zhifeng Li, Wei Liu

The security of deep neural networks (DNNs) has attracted increasing attention due to their widespread use in various applications.

Paper
Code

Towards Efficient Adversarial Training on Vision Transformers

no code implementations • 21 Jul 2022 • Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.

Paper
Add Code

Neural modal ordinary differential equations: Integrating physics-based modeling with neural ordinary differential equations for modeling high-dimensional monitored structures

1 code implementation • 16 Jul 2022 • Zhilu Lai, Wei Liu, Xudong Jian, Kiran Bacsa, Limin Sun, Eleni Chatzi

In the scope of physics-informed machine learning, this paper proposes a framework -- termed Neural Modal ODEs -- to integrate physics-based modeling with deep learning for modeling the dynamics of monitored and high-dimensional engineered systems.

Physics-informed machine learning

Paper
Code

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

no code implementations • 15 Jul 2022 • Mengyin Liu, Chao Zhu, Hongyu Gao, Weibo Gu, Hongfa Wang, Wei Liu, Xu-Cheng Yin

2) Secondly, a text-guided information range minimization method is proposed to adaptively encode descriptive parts of each modality into an identical space with a powerful pretrained linguistic model.

Attribute Attribute Value Extraction +2

Paper
Add Code

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

1 code implementation • 4 Jul 2022 • Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, RongCheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for four Ego4D challenge tasks, including Natural Language Query (NLQ), Moment Query (MQ), Object State Change Classification (OSCC), and PNR Localization (PNR).

Language Modelling Object State Change Classification

205

Paper
Code

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

1 code implementation • 4 Jul 2022 • Kevin Qinghong Lin, Alex Jinpeng Wang, Rui Yan, Eric Zhongcong Xu, RongCheng Tu, Yanru Zhu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Wei Liu, Mike Zheng Shou

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for the EPIC-KITCHENS-100 Multi-Instance Retrieval (MIR) challenge.

Language Modelling Multi-Instance Retrieval +1

205

Paper
Code

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

no code implementations • 21 Jun 2022 • Wei Liu, Rui Wang, Tao Zhang, Kaiwen Li, Wenhua Li, Hisao Ishibuchi

Multi-objective orienteering problems (MO-OPs) are classical multi-objective routing problems and have received a lot of attention in the past decades.

Problem Decomposition reinforcement-learning +1

Paper
Add Code

Towards Generalizable Person Re-identification with a Bi-stream Generative Model

no code implementations • 19 Jun 2022 • Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian

Guided by original pedestrian images, one stream is employed to learn a camera-invariant global feature for the CC problem via filtering cross-camera interference factors.

Domain Generalization Generalizable Person Re-identification

Paper
Add Code

EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification

no code implementations • 15 Jun 2022 • Jingyu Li, Wei Liu, Tan Lee

This paper proposes a domain transfer network, named EDITnet, to alleviate the language-mismatch problem on speaker embeddings without requiring speaker labels.

Self-Supervised Learning Speaker Verification +1

Paper
Add Code

Unsupervised Knowledge Adaptation for Passenger Demand Forecasting

no code implementations • 8 Jun 2022 • Can Li, Lei Bai, Wei Liu, Lina Yao, S Travis Waller

These multimodal forecasting models can improve accuracy but be less practical when different parts of multimodal datasets are owned by different institutions who cannot directly share data among them.

Paper
Add Code

Egocentric Video-Language Pretraining

2 code implementations • 3 Jun 2022 • Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, RongCheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Video-Language Pretraining (VLP), which aims to learn transferable representation to advance a wide range of video-text downstream tasks, has recently received increasing attention.

Ranked #2 on Video Summarization on Query-Focused Video Summarization Dataset

Action Recognition Contrastive Learning +11

205

Paper
Code

Efficient-Adam: Communication-Efficient Distributed Adam

no code implementations • 28 May 2022 • Congliang Chen, Li Shen, Wei Liu, Zhi-Quan Luo

Distributed adaptive stochastic gradient methods have been widely used for large-scale nonconvex optimization, such as training deep learning models.

Quantization

Paper
Add Code

An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech

no code implementations • 25 May 2022 • Wei Liu, Jingyu Li, Tan Lee

The performance of child speech recognition is generally less satisfactory compared to adult speech due to limited amount of training data.

Attribute Automatic Speech Recognition +4

Paper
Add Code

Demand Response Method Considering Multiple Types of Flexible Loads in Industrial Parks

no code implementations • 24 May 2022 • Jia Cui, Mingze Gao, Xiaoming Zhou, Yang Li, Wei Liu, Jiazheng Tian, XiMing Zhang

With the rapid development of the energy internet, the proportion of flexible loads in smart grid is getting much higher than before.

Paper
Add Code

An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity

no code implementations • 11 May 2022 • Wei Liu, Xin Liu, Xiaojun Chen

Moreover, we show the relationship between the new model and the original problem.

Paper
Add Code

Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs

2 code implementations • NAACL 2022 • Songlin Yang, Wei Liu, Kewei Tu

Recent research found it beneficial to use large state spaces for HMMs and PCFGs.

Ranked #4 on Constituency Grammar Induction on PTB Diagnostic ECG Database

Constituency Grammar Induction Language Modelling

Paper
Code

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

1 code implementation • CVPR 2022 • Li Yang, Yan Xu, Chunfeng Yuan, Wei Liu, Bing Li, Weiming Hu

They base the visual grounding on the features from pre-generated proposals or anchors, and fuse these features with the text embeddings to locate the target mentioned by the text.

Attribute object-detection +2

Paper
Code

Deep Reinforcement Learning for Orienteering Problems Based on Decomposition

no code implementations • 25 Apr 2022 • Wei Liu, Tao Zhang, Rui Wang, Kaiwen Li, Wenhua Li, Kang Yang

A dynamic pointer network (DYPN) is introduced as the TSP solver, which takes city locations as inputs and immediately outputs a permutation of nodes.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

ChildPredictor: A Child Face Prediction Framework with Disentangled Learning

1 code implementation • 21 Apr 2022 • Yuzhi Zhao, Lai-Man Po, Xuehui Wang, Qiong Yan, Wei Shen, Yujia Zhang, Wei Liu, Chun-Kit Wong, Chiu-Sing Pang, Weifeng Ou, Wing-Yin Yu, Buhua Liu

On this basis, we formulate predictions as a mapping from parents' genetic factors to children's genetic factors, and disentangle them from external and variety factors.

Age-Invariant Face Recognition Image-to-Image Translation +2

Paper
Code

XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation

no code implementations • CVPR 2022 • Wei Liu, Fangyue Liu, Fei Ding, Qian He, Zili Yi

The cross-modality encoder is pre-trained in a self-supervised manner to allow effective capture of cross- and intra-modality correlations, which facilitates the content-style disentanglement and modeling style representations of all scales (stroke-level, component-level and character-level).

Disentanglement Font Generation

Paper
Add Code

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations

no code implementations • 7 Apr 2022 • Jie Jiang, Shaobo Min, Weijie Kong, Dihong Gong, Hongfa Wang, Zhifeng Li, Wei Liu

With multi-level representations for video and text, hierarchical contrastive learning is designed to explore fine-grained cross-modal relationships, i. e., frame-word, clip-phrase, and video-sentence, which enables HCMI to achieve a comprehensive semantic comparison between video and text modalities.

Ranked #1 on Video Retrieval on MSR-VTT-1kA (using extra training data)

Contrastive Learning Denoising +4

Paper
Add Code

Improving Vision Transformers by Revisiting High-frequency Components

1 code implementation • 3 Apr 2022 • Jiawang Bai, Li Yuan, Shu-Tao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu

Inspired by this finding, we first investigate the effects of existing techniques for improving ViT models from a new frequency perspective, and find that the success of some techniques (e. g., RandAugment) can be attributed to the better usage of the high-frequency components.

Ranked #2 on Domain Generalization on Stylized-ImageNet

Domain Generalization Image Classification +1

Paper
Code

Masked Autoencoders for Point Cloud Self-supervised Learning

3 code implementations • 13 Mar 2022 • Yatian Pang, Wenxiao Wang, Francis E. H. Tay, Wei Liu, Yonghong Tian, Li Yuan

Then, a standard Transformer based autoencoder, with an asymmetric design and a shifting mask tokens operation, learns high-level latent features from unmasked point patches, aiming to reconstruct the masked point patches.

Ranked #2 on Point Cloud Segmentation on PointCloud-C

3D Part Segmentation Few-Shot 3D Point Cloud Classification +2

394

Paper
Code

CROON: Automatic Multi-LiDAR Calibration and Refinement Method in Road Scene

1 code implementation • 7 Mar 2022 • Pengjin Wei, Guohang Yan, Yikang Li, Kun Fang, Xinyu Cai, Jie Yang, Wei Liu

Sensor-based environmental perception is a crucial part of the autonomous driving system.

Autonomous Driving

Paper
Code

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

2 code implementations • 1 Mar 2022 • ZiHao Wang, Wei Liu, Qian He, Xinglong Wu, Zili Yi

Once trained, the transformer can generate coherent image tokens based on the text embedding extracted from the text encoder of CLIP upon an input text.

Text-to-Image Generation

126

Paper
Code

Deep Single Image Deraining using An Asymetric Cycle Generative and Adversarial Framework

no code implementations • 19 Feb 2022 • Wei Liu, Rui Jiang, Cheng Chen, Tao Lu, Zixiang Xiong

The former consists of parallel rain removal path and rain-fog feature extraction path by the rain and derain-fog network and the attention rain-fog feature extraction network (ARFE) , while the latter only contains a synthetic rain transformation path.

Single Image Deraining

Paper
Add Code

Unpaired Quad-Path Cycle Consistent Adversarial Networks for Single Image Defogging

no code implementations • 19 Feb 2022 • Wei Liu, Cheng Chen, Rui Jiang, Tao Lu, Zixiang Xiong

To address these issues, we develop a novel generative adversarial network, called quad-path cycle consistent adversarial network (QPC-Net), for single image defogging.

Generative Adversarial Network

Paper
Add Code

Exploring Structural Sparsity in Neural Image Compression

no code implementations • 9 Feb 2022 • Shanzhi Yin, Chao Li, Wen Tan, Youneng Bao, Yongsheng Liang, Wei Liu

Neural image compression have reached or out-performed traditional methods (such as JPEG, BPG, WebP).

Image Compression

Paper
Add Code

Constrained Variational Policy Optimization for Safe Reinforcement Learning

2 code implementations • 28 Jan 2022 • Zuxin Liu, Zhepeng Cen, Vladislav Isenbaev, Wei Liu, Zhiwei Steven Wu, Bo Li, Ding Zhao

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

2 code implementations • 28 Jan 2022 • Ziyu Wang, Wenhao Jiang, Yiming Zhu, Li Yuan, Yibing Song, Wei Liu

In contrast with vision transformers and CNNs, the success of MLP-like models shows that simple information fusion operations among tokens and channels can yield a good representation power for deep recognition models.

Image Classification

161

Paper
Code

DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

1 code implementation • 24 Jan 2022 • Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Long-Kai Huang, Tingyang Xu, Yu Rong, Lanqing Li, Jie Ren, Ding Xue, Houtim Lai, Shaoyong Xu, Jing Feng, Wei Liu, Ping Luo, Shuigeng Zhou, Junzhou Huang, Peilin Zhao, Yatao Bian

AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient.

Benchmarking Drug Discovery +1

143

Paper
Code

Spatio-Temporal Graph Representation Learning for Fraudster Group Detection

no code implementations • 7 Jan 2022 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Then we use an RNN on the spatial relations to predict the spatio-temporal relations of reviewers in the group.

Graph Representation Learning

Paper
Add Code

Coherent Point Drift Revisited for Non-Rigid Shape Matching and Registration

no code implementations • CVPR 2022 • Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Liu

In this paper, we explore a new type of extrinsic method to directly align two geometric shapes with point-to-point correspondences in ambient space by recovering a deformation, which allows more continuous and smooth maps to be obtained.

Paper
Add Code

RFNet: Unsupervised Network for Mutually Reinforcing Multi-Modal Image Registration and Fusion

no code implementations • CVPR 2022 • Han Xu, Jiayi Ma, Jiteng Yuan, Zhuliang Le, Wei Liu

Specifically, for image registration, we solve the bottlenecks of defining registration metrics applicable for multi-modal images and facilitating the network convergence.

Image Registration

Paper
Add Code

Triangle Attack: A Query-efficient Decision-based Adversarial Attack

1 code implementation • 13 Dec 2021 • Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu

Decision-based attack poses a severe threat to real-world applications since it regards the target model as a black box and only accesses the hard prediction label.

Adversarial Attack Dimensionality Reduction

Paper
Code

DGL-GAN: Discriminator Guided Learning for GAN Compression

1 code implementation • 13 Dec 2021 • Yuesong Tian, Li Shen, Xiang Tian, DaCheng Tao, Zhifeng Li, Wei Liu, Yaowu Chen

Moreover, DGL-GAN is also effective in boosting the performance of original uncompressed GANs.

Paper
Code

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

no code implementations • 2 Dec 2021 • Wei Liu, Huanqin Wu, Wenjing Mu, Zhen Li, Tao Chen, Dan Nie

We propose CO2Sum (Contrastive for Consistency), a contrastive learning scheme that can be easily applied on sequence-to-sequence models for factual-consistent abstractive summarization, proving that the model can be fact-aware without modifying the architecture.

Abstractive Text Summarization Contrastive Learning

Paper
Add Code

Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement

1 code implementation • NeurIPS 2021 • Aming Wu, Suqi Zhao, Cheng Deng, Wei Liu

To alleviate the impact of few samples, enhancing the generalization and discrimination abilities of detectors on new objects plays an important role.

Dictionary Learning Few-Shot Object Detection +1

Paper
Code

Neural Routing by Memory

no code implementations • NeurIPS 2021 • Kaipeng Zhang, Zhenqiang Li, Zhifeng Li, Wei Liu, Yoichi Sato

However, they use the same procedure sequence for all inputs, regardless of the intermediate features. This paper proffers a simple yet effective idea of constructing parallel procedures and assigning similar intermediate features to the same specialized procedures in a divide-and-conquer fashion.

Paper
Add Code

MC-Blur: A Comprehensive Benchmark for Image Deblurring

2 code implementations • 1 Dec 2021 • Kaihao Zhang, Tao Wang, Wenhan Luo, Boheng Chen, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang

Blur artifacts can seriously degrade the visual quality of images, and numerous deblurring methods have been proposed for specific scenarios.

Benchmarking Deblurring +1

143

Paper
Code

PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction

1 code implementation • 30 Nov 2021 • Qingyu Wang, Baojian Ma, Wei Liu, Mingzhao Lou, Mingchuan Zhou, Huanyu Jiang, Yibin Ying

In this paper, we aim to address the issue between datasets and models and propose a large scale stereo dataset with high accuracy disparity ground truth named PlantStereo.

Camera Calibration Image Registration +1

Paper
Code

Social Fraud Detection Review: Methods, Challenges and Analysis

no code implementations • 10 Nov 2021 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Many studies proposed approaches based on user behaviors and review text to address the challenges of fraud detection.

Decision Making Fraud Detection

Paper
Add Code

SpineOne: A One-Stage Detection Framework for Degenerative Discs and Vertebrae

no code implementations • 28 Oct 2021 • Jiabo He, Wei Liu, Yu Wang, Xingjun Ma, Xian-Sheng Hua

Spinal degeneration plagues many elders, office workers, and even the younger generations.

Medical Diagnosis Medical Object Detection

Paper
Add Code

Meter-Range Wireless Motor Drive for Pipeline Transportation

no code implementations • 26 Oct 2021 • Wei Liu, K. T. Chau, Hui Wang, Tengbo Yang

This paper proposes and implements a meter-range wireless motor drive (WMD) system for promising applications of underground pipeline transportations or in-pipe robots.

Paper
Add Code

Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty

1 code implementation • 16 Oct 2021 • Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

To address this, we bridge physics-based state space models with Deep Markov Models, thus delivering a hybrid modeling framework for unsupervised learning and identification of nonlinear dynamical systems.

Variational Inference

Paper
Code

Rethinking the Spatial Route Prior in Vision-and-Language Navigation

no code implementations • 12 Oct 2021 • Xinzhe Zhou, Wei Liu, Yadong Mu

In a most information-rich case of knowing environment maps and admitting shortest-path prior, we observe that given an origin-destination node pair, the internal route can be uniquely determined.

Navigate Vision and Language Navigation

Paper
Add Code

A Simplified System Model for Optical Camera Communication

no code implementations • Conference 2021 • Anqi Liu, Wenxiao Shi, Wei Liu, Zhuo Wang

Data rate and communication distance are two important criteria for measuring the performance of optical camera communication (OCC) systems.

Paper
Add Code

EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset

1 code implementation • 11 Oct 2021 • Kaihao Zhang, Dongxu Li, Wenhan Luo, Jingyu Liu, Jiankang Deng, Wei Liu, Stefanos Zafeiriou

It is thus unclear how these algorithms perform on public face hallucination datasets.

Ranked #1 on Image Super-Resolution on WLFW

Benchmarking Face Hallucination +2

Paper
Code

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

no code implementations • 4 Oct 2021 • Ying Qin, Wei Liu, Zhiyuan Peng, Si-Ioi Ng, Jingyu Li, Haibo Hu, Tan Lee

Input to these classifiers are speech transcripts produced by automatic speech recognition (ASR) models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

1 code implementation • 22 Sep 2021 • Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi

To address these limitations, we design a Dynamic Style Manipulation Network (DyStyle) whose structure and parameters vary by input samples, to perform nonlinear and adaptive manipulation of latent codes for flexible and precise attribute control.

Attribute Contrastive Learning

Paper
Code

Utterance-level neural confidence measure for end-to-end children speech recognition

no code implementations • 16 Sep 2021 • Wei Liu, Tan Lee

The investigation is focused on evaluating and comparing the efficacies of predictor features that are derived from different internal and external modules of the E2E system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Semantic-Preserving Adversarial Text Attacks

2 code implementations • 23 Aug 2021 • Xinghao Yang, Weifeng Liu, James Bailey, DaCheng Tao, Wei Liu

In this paper, we propose a Bigram and Unigram based adaptive Semantic Preservation Optimization (BU-SPO) method to examine the vulnerability of deep models.

Adversarial Text Semantic Similarity +4

Paper
Code

End2End Occluded Face Recognition by Masking Corrupted Features

1 code implementation • 21 Aug 2021 • Haibo Qiu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are exactly the common cases in real-world scenarios.

Face Recognition

Paper
Code

SynFace: Face Recognition with Synthetic Data

1 code implementation • ICCV 2021 • Haibo Qiu, Baosheng Yu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

We then analyze the underlying causes behind the performance gap, e. g., the poor intra-class variations and the domain gap between synthetic and real face images.

Face Generation Face Recognition

Paper
Code

Structure-Aware Feature Generation for Zero-Shot Learning

no code implementations • 16 Aug 2021 • Lianbo Zhang, Shaoli Huang, Xinchao Wang, Wei Liu, DaCheng Tao

In this paper, we introduce a novel structure-aware feature generation scheme, termed as SA-GAN, to explicitly account for the topological structure in learning both the latent space and the generative networks.

Attribute Generative Adversarial Network +1

Paper
Add Code

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

3 code implementations • ICLR 2022 • Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu

On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Ranked #42 on Semantic Segmentation on ADE20K val