no code implementations • 24 Jun 2020 • Chelhwon Kim, Andrew Port, Mitesh Patel
Further, we discover that the distance preservation constraint in the generative adversarial model leads to reduced diversity in the translated audio samples, and propose the use of an auxiliary discriminator to enhance the diversity of the translations while using the distance preservation constraint.
no code implementations • 27 May 2020 • Andrew Port, Chelhwon Kim, Mitesh Patel
A generative adversarial network (GAN) is then used to find a distance preserving map from this metric space of feature vectors into the metric space defined by a target audio dataset equipped with either the Euclidean metric or a mel-frequency cepstrum-based psychoacoustic distance metric.
no code implementations • 11 May 2020 • Jingwei Song, Mitesh Patel, Andreas Girgensohn, Chelhwon Kim
Considering these, we propose a novel approach to combine DL method with traditional feature based approach to achieve better localization with small training data.