no code implementations • 4 Apr 2022 • Muhammad Huzaifah, Ivan Kukanov
Embeddings play an important role in end-to-end solutions for multi-modal language processing problems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 26 Mar 2019 • Lonce Wyse, Muhammad Huzaifah
A recurrent Neural Network (RNN) is trained to predict sound samples based on audio input augmented by control parameter information for pitch, volume, and instrument identification.