Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018 | 1464 | 2018 |
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018 | 134 | 2018 |
The multilingual tedx corpus for speech recognition and translation E Salesky, M Wiesner, J Bremerman, R Cattoni, M Negri, M Turchi, ... arXiv preprint arXiv:2102.01757, 2021 | 115 | 2021 |
Findings of the IWSLT 2022 Evaluation Campaign. A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ... Proceedings of the 19th International Conference on Spoken Language …, 2022 | 89 | 2022 |
Massively multilingual adversarial speech recognition O Adams, M Wiesner, S Watanabe, D Yarowsky arXiv preprint arXiv:1904.02210, 2019 | 82 | 2019 |
Multi-modal data augmentation for end-to-end ASR A Renduchintala, S Ding, M Wiesner, S Watanabe arXiv preprint arXiv:1803.10299, 2018 | 64 | 2018 |
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... Interspeech, 3597-3601, 2017 | 47 | 2017 |
A corpus for large-scale phonetic typology E Salesky, E Chodroff, T Pimentel, M Wiesner, R Cotterell, AW Black, ... arXiv preprint arXiv:2005.13962, 2020 | 25 | 2020 |
The chime-7 dasr challenge: Distant meeting transcription with multiple devices in diverse scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023 | 24 | 2023 |
Topic identification for speech without asr C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur arXiv preprint arXiv:1703.07476, 2017 | 21 | 2017 |
Automatic speech recognition and topic identification for almost-zero-resource languages M Wiesner, C Liu, L Ondel, C Harman, V Manohar, J Trmal, Z Huang, ... arXiv preprint arXiv:1802.08731, 2018 | 16 | 2018 |
Pretraining by backtranslation for end-to-end asr in low-resource settings M Wiesner, A Renduchintala, S Watanabe, C Liu, N Dehak, S Khudanpur arXiv preprint arXiv:1812.03919, 2018 | 15* | 2018 |
Analysis of multilingual sequence-to-sequence speech recognition systems M Karafiát, MK Baskar, S Watanabe, T Hori, M Wiesner, J Černocký arXiv preprint arXiv:1811.03451, 2018 | 13 | 2018 |
Towards zero-shot code-switched speech recognition B Yan, M Wiesner, O Klejch, P Jyothi, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models M Wiesner, D Raj, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
End-to-end ASR to jointly predict transcriptions and linguistic annotations M Omachi, Y Fujita, S Watanabe, M Wiesner Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 8 | 2021 |
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop H Hermansky, L Burget, J Cohen, E Dupoux, N Feldman, J Godfrey, ... 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 7 | 2015 |
JHU IWSLT 2022 dialect speech translation system description J Yang, A Hussein, M Wiesner, S Khudanpur Proceedings of the 19th International Conference on Spoken Language …, 2022 | 6 | 2022 |
Zero-shot pronunciation lexicons for cross-language acoustic model transfer M Wiesner, O Adams, D Yarowsky, J Trmal, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 6 | 2019 |
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. M Wiesner, M Sarma, A Arora, D Raj, D Gao, R Huang, S Preet, ... Interspeech, 2906-2910, 2021 | 4 | 2021 |