Search Results for author: Zheyang Xiong

Found 3 papers, 2 papers with code

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

2 code implementations6 Feb 2024 Jongho Park, Jaeseung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee, Dimitris Papailiopoulos

State-space models (SSMs), such as Mamba (Gu & Dao, 2023), have been proposed as alternatives to Transformer networks in language modeling, by incorporating gating, convolutions, and input-dependent token selection to mitigate the quadratic cost of multi-head attention.

In-Context Learning Language Modelling +1

Strong Lottery Ticket Hypothesis with $\varepsilon$--perturbation

no code implementations29 Oct 2022 Zheyang Xiong, Fangshuo Liao, Anastasios Kyrillidis

The strong Lottery Ticket Hypothesis (LTH) claims the existence of a subnetwork in a sufficiently large, randomly initialized neural network that approximates some target neural network without the need of training.

Frame Difference-Based Temporal Loss for Video Stylization

2 code implementations11 Feb 2021 Jianjin Xu, Zheyang Xiong, Xiaolin Hu

To ensure temporal inconsistency between the frames of the stylized video, a common approach is to estimate the optic flow of the pixels in the original video and make the generated pixels match the estimated optical flow.

Optical Flow Estimation Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.