Search Results for author: Zongyang Du

Found 6 papers, 0 papers with code

Exploring speech style spaces with language models: Emotional TTS without emotion labels

no code implementations • 18 May 2024 • Shreeram Suresh Chandra, Zongyang Du, Berrak Sisman

Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain.

Transfer Learning

Paper
Add Code

Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model

no code implementations • 2 May 2024 • Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman

A major challenge of expressive VC lies in emotion prosody modeling.

Denoising Speaker Verification +2

Paper
Add Code

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

no code implementations • 19 Jan 2024 • Ismail Rasim Ulgen, Zongyang Du, Carlos Busso, Berrak Sisman

In order to leverage this information, we introduce a novel contrastive pretraining approach applied to emotion-unlabeled data for speech emotion recognition.

Contrastive Learning Speech Emotion Recognition

Paper
Add Code

Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion

no code implementations • 20 Oct 2021 • Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li

Expressive voice conversion performs identity conversion for emotional speakers by jointly converting speaker identity and emotional style.

Disentanglement Voice Conversion

Paper
Add Code

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer

no code implementations • 8 Jul 2021 • Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li

Traditional voice conversion(VC) has been focused on speaker identity conversion for speech with a neutral expression.

Speech Emotion Recognition Style Transfer +1

Paper
Add Code

Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN

no code implementations • 11 Aug 2020 • Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li

It relies on non-parallel training data from two different languages, hence, is more challenging than mono-lingual voice conversion.

Voice Conversion

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.