no code implementations • 14 Jul 2023 • Varun Krishna, Tarun Sai, Sriram Ganapathy
The input to the model consists of audio samples that are windowed and processed with 1-D convolutional layers.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3