Hubert: Self-supervised speech representation learning by masked prediction of hidden units WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed IEEE/ACM transactions on audio, speech, and language processing 29, 3451-3460, 2021 | 2503 | 2021 |
Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021 | 850 | 2021 |
XLS-R: Self-supervised cross-lingual speech representation learning at scale A Babu, C Wang, A Tjandra, K Lakhotia, Q Xu, N Goyal, K Singh, ... arXiv preprint arXiv:2111.09296, 2021 | 604 | 2021 |
On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021 | 310 | 2021 |
Speech resynthesis from discrete disentangled self-supervised representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... arXiv preprint arXiv:2104.00355, 2021 | 282 | 2021 |
Learning audio-visual speech representation by masked multimodal cluster prediction B Shi, WN Hsu, K Lakhotia, A Mohamed arXiv preprint arXiv:2201.02184, 2022 | 266 | 2022 |
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 220 | 2024 |
Text-free prosody-aware generative spoken language modeling E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ... arXiv preprint arXiv:2109.03264, 2021 | 108 | 2021 |
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ... arXiv preprint arXiv:2203.06849, 2022 | 93 | 2022 |
Domain-matched pre-training tasks for dense retrieval B Oğuz, K Lakhotia, A Gupta, P Lewis, V Karpukhin, A Piktus, X Chen, ... arXiv preprint arXiv:2107.13602, 2021 | 57 | 2021 |
Salient phrase aware dense retrieval: can a dense retriever imitate a sparse one? X Chen, K Lakhotia, B Oğuz, A Gupta, P Lewis, S Peshterliev, Y Mehdad, ... arXiv preprint arXiv:2110.06918, 2021 | 55 | 2021 |
Fid-ex: Improving sequence-to-sequence models for extractive rationale generation K Lakhotia, B Paranjape, A Ghoshal, W Yih, Y Mehdad, S Iyer arXiv preprint arXiv:2012.15482, 2020 | 23 | 2020 |
K.-t. Lee, D SW Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... R. Liu, Z. Huang, S. Dong, S.-W. Li, S. Watanabe, A. Mohamed, and H.-y. Lee …, 2021 | 21 | 2021 |
Pytext: A seamless path from nlp research to production A Aly, K Lakhotia, S Zhao, M Mohit, B Oguz, A Arora, S Gupta, C Dewan, ... arXiv preprint arXiv:1812.08729, 2018 | 16 | 2018 |
textless-lib: A library for textless spoken language processing E Kharitonov, J Copet, K Lakhotia, TA Nguyen, P Tomasello, A Lee, ... arXiv preprint arXiv:2202.07359, 2022 | 12 | 2022 |
A Large-Scale Evaluation of Speech Foundation Models S Yang, HJ Chang, Z Huang, AT Liu, CI Lai, H Wu, J Shi, X Chang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 8 | 2024 |
Quaser: Question answering with scalable extractive rationalization A Ghoshal, S Iyer, B Paranjape, K Lakhotia, SW Yih, Y Mehdad Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 4 | 2022 |
Virtual Assistant That Provides Answers Based On Past Conversations K Lakhotia, D Savenkov, S Gupta | | 2023 |
Speech Resynthesis from Disentangled Self-Supervised Discrete Representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... | | |
SUPERB: Speech Understanding and PERformance Benchmark SYPH Chi, YS Chuang, CI Lai, K Lakhotia, YY Lin, AT Liu, J Shi, XCD Lin, ... | | |