Follow
Yanzhang He
Title
Cited by
Cited by
Year
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
4412019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1352019
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency
TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1192020
Two-pass end-to-end speech recognition
TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ...
arXiv preprint arXiv:1908.10992, 2019
882019
Streaming small-footprint keyword spotting using sequence-to-sequence models
Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
722017
Towards fast and accurate streaming end-to-end ASR
B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
652020
Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition
K Han, Y He, D Bagchi, E Fosler-Lussier, DL Wang
INTERSPEECH 2015, 2015
642015
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition
D Bagchi, MI Mandel, Z Wang, Y He, A Plummer, E Fosler-Lussier
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
482015
Conditional random fields in speech, audio, and language processing
E Fosler-Lussier, Y He, P Jyothi, R Prabhavalkar
Proceedings of the IEEE 101 (5), 1054-1075, 2013
422013
A better and faster end-to-end model for streaming asr
B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
362021
Subword-based modeling for handling OOV words in keyword spotting
Y He, B Hutchinson, P Baumann, M Ostendorf, E Fosler-Lussier, ...
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International …, 2014
342014
Fastemit: Low-latency streaming asr with sequence-level emission regularization
J Yu, CC Chiu, B Li, S Chang, TN Sainath, Y He, A Narayanan, W Han, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
292021
Voicefilter-lite: Streaming targeted voice separation for on-device speech recognition
Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ...
arXiv preprint arXiv:2009.04323, 2020
282020
Efficient Segmental Conditional Random Fields for Phone Recognition
Y He, E Fosler-Lussier
13th Annual Conference of the International Speech Communication Association …, 2012
222012
Joint endpointing and decoding with end-to-end models
SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
212019
Learning word-level confidence for subword end-to-end ASR
D Qiu, Q Li, Y He, Y Zhang, B Li, L Cao, R Prabhavalkar, D Bhatia, W Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
182021
Confidence estimation for attention-based sequence-to-sequence models for speech recognition
Q Li, D Qiu, Y Zhang, B Li, Y He, PC Woodland, L Cao, T Strohman
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
172021
Using pronunciation-based morphological subword units to improve OOV handling in keyword search
Y He, P Baumann, H Fang, B Hutchinson, A Jaech, M Ostendorf, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (1), 79-92, 2015
142015
An efficient streaming non-recurrent on-device end-to-end model with improvements to rare-word modeling
TN Sainath, YR He, A Narayanan, R Botros, R Pang, DJ Rybach, ...
112021
Segmental Conditional Random Fields with Deep Neural Networks as Acoustic Models for First-Pass Word Recognition
Y He, E Fosler-Lussier
INTERSPEECH 2015, 2015
112015
The system can't perform the operation now. Try again later.
Articles 1–20