Search Results for author: Qihang Zhao

Found 8 papers, 4 papers with code

Redefining Information Retrieval of Structured Database via Large Language Models

no code implementations • 9 May 2024 • Mingzhu Wang, Yuzhe Zhang, Qihang Zhao, Juanyi Yang, Hong Zhang

Retrieval augmentation is critical when Language Models (LMs) exploit non-parametric knowledge related to the query through external knowledge bases before reasoning.

Information Retrieval Question Answering +1

Paper
Add Code

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

4 code implementations • 8 Apr 2024 • Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture.

11,768

Paper
Code

Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker Identification with PathFormer

no code implementations • 11 Feb 2024 • Zehao Dong, Qihang Zhao, Philip R. O. Payne, Michael A Province, Carlos Cruchaga, Muhan Zhang, Tianyu Zhao, Yixin Chen, Fuhai Li

However, we found two major limitations of existing GNNs in omics data analysis, i. e., limited-prediction (diagnosis) accuracy and limited-reproducible biomarker identification capacity across multiple datasets.

Paper
Add Code

RWKV: Reinventing RNNs for the Transformer Era

5 code implementations • 22 May 2023 • Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-Jie Zhu

This work presents a significant step towards reconciling trade-offs between computational efficiency and model performance in sequence processing tasks.

Ranked #22 on Natural Language Inference on WNLI

Computational Efficiency Natural Language Inference

11,768

Paper
Code

Both Efficiency and Effectiveness! A Large Scale Pre-ranking Framework in Search System

no code implementations • 5 Apr 2023 • Qihang Zhao, Rui-Jie Zhu, Liu Yang, He Yongming, Bo Zhou, Luo Cheng

In the realm of search systems, multi-stage cascade architecture is a prevalent method, typically consisting of sequential modules such as matching, pre-ranking, and ranking.

feature selection

Paper
Add Code

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

1 code implementation • 27 Feb 2023 • Rui-Jie Zhu, Qihang Zhao, Guoqi Li, Jason K. Eshraghian

As a result, their performance lags behind modern deep learning, and we are yet to see the effectiveness of SNNs in language generation.

Language Modelling Text Generation

696

Paper
Code

TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks

1 code implementation • 21 Jun 2022 • Rui-Jie Zhu, Malu Zhang, Qihang Zhao, Haoyu Deng, Yule Duan, Liang-Jian Deng

Given the critical role of attention mechanisms in enhancing neural network performance, the integration of SNNs and attention mechanisms exhibits potential to deliver energy-efficient and high-performance computing paradigms.

Image Classification Image Generation

Paper
Code

Utilizing Citation Network Structure to Predict Citation Counts: A Deep Learning Approach

no code implementations • 6 Sep 2020 • Qihang Zhao

Therefore, it is very important to be able to accurately predict the citation counts of academic papers.

Decision Making

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.