1 code implementation • 30 May 2023 • Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros
The study of speech disorders can benefit greatly from time-aligned data.
no code implementations • 6 Apr 2023 • Thodoris Kouzelis, Grigoris Bastas, Athanasios Katsamanis, Alexandros Potamianos
The results show that the proposed techniques improve the performance of our system and while reducing the computational complexity.
no code implementations • 3 Apr 2023 • Nikolaos Antoniou, Athanasios Katsamanis, Theodoros Giannakopoulos, Shrikanth Narayanan
There is an imminent need for guidelines and standard test sets to allow direct and fair comparisons of speech emotion recognition (SER).
no code implementations • 31 Dec 2022 • Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos
Modern speech recognition systems exhibits rapid performance degradation under domain shift.
1 code implementation • 22 Jul 2022 • Panagiotis P. Filntisis, George Retsinas, Foivos Paraperas-Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos
The recent state of the art on monocular 3D face reconstruction from image data has made some impressive advancements, thanks to the advent of Deep Learning.
no code implementations • 28 Apr 2022 • Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos
Recent deep learning Text-to-Speech (TTS) systems have achieved impressive performance by generating speech close to human parity.
no code implementations • 1 Apr 2022 • Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Athanasios Katsamanis, Vassilis Katsouros
Like in many medical applications, aphasic speech data is scarce and the problem is exacerbated in so-called "low resource" languages, which are, for this task, most languages excluding English.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 30 Oct 2021 • Emmanouil Zaranis, Georgios Paraskevopoulos, Athanasios Katsamanis, Alexandros Potamianos
Specifically, during finetuning we propose to use three objectives: response language modeling, sentiment understanding, and empathy forcing.
no code implementations • 18 Feb 2021 • Efthymios Georgiou, Athanasios Katsamanis
This brief literature review studies the problem of audiovisual speech synthesis, which is the problem of generating an animated talking head given a text as input.
no code implementations • LREC 2012 • Priti Aggarwal, Ron artstein, Jillian Gerten, Athanasios Katsamanis, Shrikanth Narayanan, Angela Nazarian, David Traum
In addition to speech recordings, the corpus contains the outputs of speech recognition performed at the time of utterance as well as the system interpretation of the utterances.