Search Results for author: Xinyue Zhang

Found 25 papers, 9 papers with code

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

2 code implementations • 9 Apr 2024 • Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang

The Large Vision-Language Model (LVLM) field has seen significant advancements, yet its progression has been hindered by challenges in comprehending fine-grained visual content due to limited resolution.

Ranked #12 on Visual Question Answering on MM-Vet

4k Language Modelling +1

1,690

Paper
Code

InternLM2 Technical Report

1 code implementation • 26 Mar 2024 • Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

Ranked #5 on Long-Context Understanding on Ada-LEval (BestAnswer)

4k Long-Context Understanding

5,247

Paper
Code

Driving Style Alignment for LLM-powered Driver Agent

2 code implementations • 17 Mar 2024 • Ruoxuan Yang, Xinyue Zhang, Anais Fernandez-Laaksonen, Xin Ding, Jiangtao Gong

Recently, LLM-powered driver agents have demonstrated considerable potential in the field of autonomous driving, showcasing human-like reasoning and decision-making abilities. However, current research on aligning driver agent behaviors with human driving styles remains limited, partly due to the scarcity of high-quality natural language data from human driving behaviors. To address this research gap, we propose a multi-alignment framework designed to align driver agents with human driving styles through demonstrations and feedback.

Autonomous Driving Decision Making

Paper
Code

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

1 code implementation • 29 Jan 2024 • Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

We introduce InternLM-XComposer2, a cutting-edge vision-language model excelling in free-form text-image composition and comprehension.

Ranked #17 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

1,690

Paper
Code

Audiobox: Unified Audio Generation with Natural Language Prompts

no code implementations • 25 Dec 2023 • Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu

Research communities have made great progress over the past year advancing the performance of large scale audio generative models for a single modality (speech, sound, or music) through adopting more powerful generative models and scaling data.

Ranked #1 on Audio Generation on AudioCaps

AudioCaps Audio Generation +1

Paper
Add Code

Harnessing Inherent Noises for Privacy Preservation in Quantum Machine Learning

no code implementations • 18 Dec 2023 • Keyi Ju, Xiaoqi Qin, Hui Zhong, Xinyue Zhang, Miao Pan, Baoling Liu

Quantum computing revolutionizes the way of solving complex problems and handling vast datasets, which shows great potential to accelerate the machine learning process.

Binary Classification Quantum Machine Learning

Paper
Add Code

Optimised Storage for Datalog Reasoning

no code implementations • 18 Dec 2023 • Xinyue Zhang, Pan Hu, Yavor Nenov, Ian Horrocks

Materialisation facilitates Datalog reasoning by precomputing all consequences of the facts and the rules so that queries can be directly answered over the materialised facts.

Paper
Add Code

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

1 code implementation • 26 Sep 2023 • Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Haodong Duan, Songyang Zhang, Shuangrui Ding, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

We propose InternLM-XComposer, a vision-language large model that enables advanced image-text comprehension and composition.

Ranked #9 on Visual Question Answering (VQA) on InfiMM-Eval

Image Comprehension Reading Comprehension +1

1,690

Paper
Code

Ultrafast-and-Ultralight ConvNet-Based Intelligent Monitoring System for Diagnosing Early-Stage Mpox Anytime and Anywhere

no code implementations • 25 Aug 2023 • Yubiao Yue, Xiaoqiang Shi, Li Qin, Xinyue Zhang, Yanmei Chen, Jialong Xu, Zipei Zheng, Yujun Cao, Di Liu, Zhenzhang Li, Yang Li

Due to the lack of more efficient diagnostic tools for monkeypox, its spread remains unchecked, presenting a formidable challenge to global health.

Data Augmentation MonkeyPox Diagnosis +1

Paper
Add Code

Enhancing Datalog Reasoning with Hypertree Decompositions

no code implementations • 11 May 2023 • Xinyue Zhang, Pan Hu, Yavor Nenov, Ian Horrocks

In this paper, we provide algorithms that exploit hypertree decompositions for the materialisation and incremental evaluation of Datalog programs.

Paper
Add Code

Accelerating Globally Optimal Consensus Maximization in Geometric Vision

no code implementations • 11 Apr 2023 • Xinyue Zhang, Liangzu Peng, Wanting Xu, Laurent Kneip

Branch-and-bound-based consensus maximization stands out due to its important ability of retrieving the globally optimal solution to outlier-affected geometric problems.

Pose Estimation

Paper
Add Code

Mpox-AISM: AI-Mediated Super Monitoring for Mpox and Like-Mpox

no code implementations • 17 Mar 2023 • Yubiao Yue, Minghua Jiang, Xinyue Zhang, Jialong Xu, Huacong Ye, Fan Zhang, Zhenzhang Li, Yang Li

With the help of the Internet and communication terminal, Mpox-AISM can perform a real-time, low-cost, and convenient diagnosis for earlier-stage mpox in various real-world settings, thereby effectively curbing the spread of mpox virus.

Data Augmentation Decision Making +2

Paper
Add Code

Data-based Polymer-Unit Fingerprint (PUFp): A Newly Accessible Expression of Polymer Organic Semiconductors for Machine Learning

no code implementations • 3 Nov 2022 • Xinyue Zhang, Genwang Wei, Ye Sheng, Jiong Yang, Caichao Ye, Wenqing Zhang

By investigating the combinations of polymer units with mobility performance, a scheme for designing polymer OSC materials by combining ML approaches and PUFp information is proposed to not only passively predict OSC mobility but also actively provide structural guidance for new high-mobility OSC material design.

Paper
Add Code

CheXplaining in Style: Counterfactual Explanations for Chest X-rays using StyleGAN

1 code implementation • 15 Jul 2022 • Matan Atad, Vitalii Dmytrenko, Yitong Li, Xinyue Zhang, Matthias Keicher, Jan Kirschke, Bene Wiestler, Ashkan Khakzar, Nassir Navab

Deep learning models used in medical image analysis are prone to raising reliability concerns due to their black-box nature.

counterfactual

Paper
Code

Two New Stenosis Detection Methods of Coronary Angiograms

no code implementations • 12 Dec 2021 • Yaofang Liu, Xinyue Zhang, Wenlong Wan, Shaoyu Liu, Yingdi Liu, Hu Liu, Xueying Zeng, Qing Zhang

Two vascular stenosis detection methods are proposed to assist the diagnosis.

Vocal Bursts Valence Prediction

Paper
Add Code

Boosting RANSAC via Dual Principal Component Pursuit

no code implementations • 6 Oct 2021 • Yunchen Yang, Xinyue Zhang, Tianjiao Ding, Daniel P. Robinson, Rene Vidal, Manolis C. Tsakiris

In this paper, we revisit the problem of local optimization in RANSAC.

Paper
Add Code

Open Set Domain Adaptation with Zero-shot Learning on Graph

no code implementations • 29 Sep 2021 • Xinyue Zhang, Xu Yang, Zhi-Yong Liu

Thus the classification ability of the source domain is transferred to the target domain and the model can distinguish the unknown classes with prior knowledge.

Domain Adaptation Zero-Shot Learning

Paper
Add Code

Two New Stenosis Detection Methods of Coronary Angiograms

no code implementations • 3 Aug 2021 • Yaofang Liu, Xinyue Zhang, Wenlong Wan, Shaoyu Liu, Yingdi Liu, Hu Liu, Xueying Zeng, Qing Zhang

Two vascular stenosis detection methods are proposed to assist the diagnosis.

Vocal Bursts Valence Prediction

Paper
Add Code

Diverse Melody Generation from Chinese Lyrics via Mutual Information Maximization

no code implementations • 7 Dec 2020 • Ruibin Yuan, Ge Zhang, Anqiao Yang, Xinyue Zhang

In this paper, we propose to adapt the method of mutual information maximization into the task of Chinese lyrics conditioned melody generation to improve the generation quality and diversity.

Paper
Add Code

Evaluation of Inference Attack Models for Deep Learning on Medical Data

no code implementations • 31 Oct 2020 • Maoqiang Wu, Xinyue Zhang, Jiahao Ding, Hien Nguyen, Rong Yu, Miao Pan, Stephen T. Wong

This paper aims to attract interest from researchers in the medical deep learning community to this important problem.

Attribute Inference Attack

Paper
Add Code

Revealing Secrets in SPARQL Session Level

1 code implementation • 13 Sep 2020 • Xinyue Zhang, Meng Wang, Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, Guilin Qi, Haofen Wang

Based on Semantic Web technologies, knowledge graphs help users to discover information of interest by using live SPARQL services.

Knowledge Graphs

Paper
Code

Differentially Private and Fair Classification via Calibrated Functional Mechanism

no code implementations • 14 Jan 2020 • Jiahao Ding, Xinyue Zhang, Xiaohuan Li, Junyi Wang, Rong Yu, Miao Pan

In order to enforce $\epsilon$-differential privacy and fairness, we leverage the functional mechanism to add different amounts of Laplace noise regarding different attributes to the polynomial coefficients of the objective function in consideration of fairness constraint.

Autonomous Driving BIG-bench Machine Learning +4

Paper
Add Code

From Dark Matter to Galaxies with Convolutional Neural Networks

1 code implementation • 17 Oct 2019 • Jacky H. T. Yip, Xinyue Zhang, Yanfang Wang, Wei zhang, Yueqiu Sun, Gabriella Contardo, Francisco Villaescusa-Navarro, Siyu He, Shy Genel, Shirley Ho

Cosmological simulations play an important role in the interpretation of astronomical data, in particular in comparing observed data to our theoretical expectations.

Paper
Code

From Dark Matter to Galaxies with Convolutional Networks

1 code implementation • 15 Feb 2019 • Xinyue Zhang, Yanfang Wang, Wei zhang, Yueqiu Sun, Siyu He, Gabriella Contardo, Francisco Villaescusa-Navarro, Shirley Ho

In combination with current and upcoming data from cosmological observations, our method has the potential to answer fundamental questions about our Universe with the highest accuracy.

Paper
Code

Image Registration Based Flicker Solving in Video Face Replacement and Analysis Based Sub-pixel Image Registration

no code implementations • 9 Mar 2018 • Xiaofang Wang, Guoqiang Xiang, Xinyue Zhang, Wei Wei

In this paper, a framework of video face replacement is proposed and it deals with the flicker of swapped face in video sequence.

Image Registration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.