Follow
Guoli Ye
Guoli Ye
Microsoft
No verified email
Title
Cited by
Cited by
Year
Advancing acoustic-to-word CTC model
J Li, G Ye, A Das, R Zhao, Y Gong
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1162018
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention.
D Yu, W Xiong, J Droppo, A Stolcke, G Ye, J Li, G Zweig
Interspeech, 17-21, 2016
1072016
Towards code-switching ASR for end-to-end CTC models
K Li, J Li, G Ye, R Zhao, Y Gong
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
862019
Developing far-field speaker system via teacher-student learning
J Li, R Zhao, Z Chen, C Liu, X Xiao, G Ye, Y Gong
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
672018
Acoustic-to-word model without OOV
J Li, G Ye, R Zhao, J Droppo, Y Gong
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
582017
Low latency end-to-end streaming speech recognition with a scout network
C Wang, Y Wu, S Liu, J Li, L Lu, G Ye, M Zhou
arXiv preprint arXiv:2003.10369, 2020
572020
Semantic mask for transformer based end-to-end speech recognition
C Wang, Y Wu, Y Du, J Li, S Liu, L Lu, S Ren, G Ye, S Zhao, M Zhou
arXiv preprint arXiv:1912.03010, 2019
472019
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone
N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
arXiv preprint arXiv:2103.16776, 2021
332021
End-to-end speaker-attributed ASR with transformer
N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
arXiv preprint arXiv:2104.02128, 2021
302021
Advancing acoustic-to-word CTC model with attention and mixed-units
A Das, J Li, G Ye, R Zhao, Y Gong
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (12 …, 2019
302019
Adaptation of rnn transducer with text-to-speech technology for keyword spotting
E Sharma, G Ye, W Wei, R Zhao, Y Tian, J Wu, L He, E Lin, Y Gong
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
192020
Minimum word error rate training with language model fusion for end-to-end speech recognition
Z Meng, Y Wu, N Kanda, L Lu, X Chen, G Ye, E Sun, J Li, Y Gong
arXiv preprint arXiv:2106.02302, 2021
162021
Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need.
Y Huang, G Ye, J Li, Y Gong
Interspeech, 1309-1313, 2021
142021
Fast GMM computation for speaker verification using scalar quantization and discrete densities.
G Ye, B Mak, MW Mak
INTERSPEECH, 2327-2330, 2009
102009
Have best of both worlds: Two-pass hybrid and E2E cascading framework for speech recognition
G Ye, V Mazalov, J Li, Y Gong
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
82022
Exploring sequential characteristics in speaker bottleneck feature for text-dependent speaker verification
L Chen, Y Zhao, SX Zhang, J Li, G Ye, F Soong
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
62018
Adapting large language model with speech for fully formatted end-to-end speech recognition
S Ling, Y Hu, S Qian, G Ye, Y Qian, Y Gong, E Lin, M Zeng
arXiv preprint arXiv:2307.08234, 2023
52023
Wake word selection assistance architectures and methods
E Stoimenov, K Shahid, YE Guoli, HA Khalil, Y Gong
US Patent 11,222,622, 2022
52022
Geo-location dependent deep neural network acoustic model for speech recognition
G Ye, C Liu, Y Gong
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
32016
Transition probabilities are more important than we once thought
G Ye, D Chen, B Mak
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
32012
The system can't perform the operation now. Try again later.
Articles 1–20