no code implementations • 14 Sep 2023 • William Aris, François Grondin
However, computer vision tends to involve a large computational load due to the amount of data (i. e. pixels) that needs to be processed in a short amount of time.
1 code implementation • 10 Sep 2023 • François Grondin, Caleb Rascón
Spatial filters can exploit deep-learning-based speech enhancement models to increase their reliability in scenarios with multiple speech sources scenarios.
no code implementations • 2 Mar 2023 • Jacob Kealey, Anthony Gosselin, Étienne Deshaies-Samson, Francis Cardinal, Félix Ducharme-Turcotte, Olivier Bergeron, Amélie Rioux-Joyal, Jérémy Bélec, François Grondin
Results demonstrate the feasibility of the approach, and opens the door to the exploration and validation of a wide range of beamformer and speech enhancement methods for real-time speech enhancement.
no code implementations • 1 Mar 2023 • Pierre-Olivier Lagacé, François Ferland, François Grondin
The performance of speech and events recognition systems significantly improved recently thanks to deep learning methods.
1 code implementation • 19 Jun 2022 • Luca Della Libera, Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin
Transformers have recently achieved state-of-the-art performance in speech separation.
no code implementations • 28 Apr 2022 • François Grondin, Marc-Antoine Maheux, Jean-Samuel Lauzon, Jonathan Vincent, François Michaud
This paper introduces the Fast Cross-Correlation (FCC) method for Time Difference of Arrival (TDoA) Estimation for pairs of microphones on a small aperture microphone array.
no code implementations • 18 Feb 2022 • Sahar Bahrami, Jérémy Moriot, Patrice Masson, François Grondin
Classification and regression employing a simple Deep Neural Network (DNN) are investigated to perform touch localization on a tactile surface using ultrasonic guided waves.
no code implementations • 8 Nov 2021 • Samuele Cornell, Manuel Pariente, François Grondin, Stefano Squartini
We perform a detailed analysis using the recent Clarity Challenge data and show that by using learnt filterbanks it is possible to surpass oracle-mask based beamforming for short windows.
1 code implementation • 20 Oct 2021 • Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin
First, we release the REAL-M dataset, a crowd-sourced corpus of real-life mixtures.
no code implementations • 6 Oct 2021 • Thomas Bernard, François Grondin
This paper introduces a new method referred to as KISS-GEV (for Keep It Super Simple Generalized eigenvalue) beamforming.
no code implementations • 17 Sep 2021 • Simon Chamorro, Jack Collier, François Grondin
Pose estimates are predicted from lidar scans using a Convolutional Neural Network trained using an existing stereo-based pose estimation system.
4 code implementations • 8 Jun 2021 • Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato de Mori, Yoshua Bengio
SpeechBrain is an open-source and all-in-one speech toolkit.
no code implementations • 3 Apr 2021 • Nauman Dawalatabad, Mirco Ravanelli, François Grondin, Jenthe Thienpondt, Brecht Desplanques, Hwidong Na
Learning robust speaker embeddings is a crucial step in speaker diarization.
1 code implementation • 5 Mar 2021 • François Grondin, Dominic Létourneau, Cédric Godin, Jean-Samuel Lauzon, Jonathan Vincent, Simon Michaud, Samuel Faucher, François Michaud
Artificial audition aims at providing hearing capabilities to machines, computers and robots.
1 code implementation • 19 Oct 2020 • François Grondin, Jean-Samuel Lauzon, Simon Michaud, Mirco Ravanelli, François Michaud
This paper introduces BIRD, the Big Impulse Response Dataset.
Sound Audio and Speech Processing
1 code implementation • 31 Jul 2020 • Jonathan Vincent, Mathieu Labbé, Jean-Samuel Lauzon, François Grondin, Pier-Marc Comtois-Rivet, François Michaud
In dynamic environments, performance of visual SLAM techniques can be impaired by visual features taken from moving objects.