1 code implementation • 4 Jul 2021 • Xuejiao Tang, Xin Huang, Wenbin Zhang, Travers B. Child, Qiong Hu, Zhen Liu, Ji Zhang
Moreover, the proposed model provides intuitive interpretation into visual commonsense reasoning.
no code implementations • 13 Jan 2021 • Qiong Hu, Tobias Bleisch, Petko Petkov, Tuomo Raitio, Erik Marchi, Varun Lakshminarasimhan
2) Although our speaker verification (SV) model is not explicitly trained to discriminate different speaking styles, and no Lombard and whisper voice is used for pre-training this system, the SV model can be used as a style encoder for generating different style embeddings as input for the Tacotron system.
no code implementations • 9 Sep 2016 • Xi Peng, Qiong Hu, Junzhou Huang, Dimitris N. Metaxas
Our approach takes advantage of part-based representation and cascade regression for robust and efficient alignment on each frame.