no code implementations • 31 Mar 2022 • Bence Mark Halpern, Teja Rebernik, Thomas Tienkamp, Rob van Son, Michiel van den Brekel, Martijn Wieling, Max Witjes, Odette Scharenborg
We present an articulatory synthesis framework for the synthesis and manipulation of oral cancer speech for clinical decision making and alleviation of patient stress.
no code implementations • 13 Jan 2022 • Luke Prananta, Bence Mark Halpern, Siyuan Feng, Odette Scharenborg
In this paper, we investigate several existing and a new state-of-the-art generative adversarial network-based (GAN) voice conversion method for enhancing dysarthric speech for improved dysarthric speech recognition.
no code implementations • 15 Oct 2021 • Wen-Chin Huang, Bence Mark Halpern, Lester Phillip Violeta, Odette Scharenborg, Tomoki Toda
We present a voice conversion framework that converts normal speech into dysarthric speech while preserving the speaker identity.
no code implementations • 1 Jul 2021 • Bence Mark Halpern, Julian Fritsch, Enno Hermann, Rob van Son, Odette Scharenborg, Mathew Magimai. -Doss
The development of pathological speech systems is currently hindered by the lack of a standardised objective evaluation framework.
no code implementations • 15 Jun 2021 • Marc Illa, Bence Mark Halpern, Rob van Son, Laureano Moro-Velazquez, Odette Scharenborg
This approach alleviates the evaluation problem one normally has when converting typical speech to pathological speech, as in our approach, the voice conversion (VC) model does not need to be optimised for speech degradation but only for the speaker change.
1 code implementation • 28 Mar 2021 • Siyuan Feng, Olya Kudina, Bence Mark Halpern, Odette Scharenborg
Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs struggle with the large variation in speech due to e. g., gender, age, speech impairment, race, and accents.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 28 Jul 2020 • Bence Mark Halpern, Rob van Son, Michiel van den Brekel, Odette Scharenborg
3) We set baselines for an oral cancer speech detection task on this dataset.