Follow
Zhi-Jie Yan
Zhi-Jie Yan
iDST, Alibaba Inc.
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
I-Vector Based Clustering Training Data in Speech Recognition
Q Huo, ZJ Yan, Y Zhang, J Xu
US Patent App. 13/640,804, 2015
2302015
Deep-FSMN for large vocabulary continuous speech recognition
S Zhang, M Lei, Z Yan, L Dai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1242018
A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR.
ZJ Yan, Q Huo, J Xu
Interspeech, 104-108, 2013
632013
A unified trajectory tiling approach to high quality speech rendering
Y Qian, FK Soong, ZJ Yan
IEEE transactions on audio, speech, and language processing 21 (2), 280-290, 2012
632012
Improving latency-controlled BLSTM acoustic models for online speech recognition
S Xue, Z Yan
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
602017
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge
F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
572022
A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition
K Chen, ZJ Yan, Q Huo
2015 13th International Conference on Document Analysis and Recognition …, 2015
432015
Rich-context unit selection (RUS) approach to high quality TTS
ZJ Yan, Y Qian, FK Soong
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
372010
Rich context modeling for high quality HMM-based TTS
ZJ Yan, Y Qian, FK Soong
Tenth Annual Conference of the International Speech Communication Association, 2009
362009
Trajectory Tiling Approach for Text-to-Speech
Y Qian, ZJ Yan, YJ Wu, FKP Soong
US Patent App. 12/962,543, 2012
332012
Improved modeling for F0 generation and V/U decision in HMM-based TTS
Q Zhang, F Soong, Y Qian, Z Yan, J Pan, Y Yan
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
332010
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition.
S Zhang, M Lei, Z Yan
Interspeech, 2180-2184, 2019
322019
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech
Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
Streaming chunk-aware multihead attention for online end-to-end speech recognition
S Zhang, Z Gao, H Luo, M Lei, J Gao, Z Yan, L Xie
arXiv preprint arXiv:2006.01712, 2020
302020
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition
Z Gao, S Zhang, I McLoughlin, Z Yan
arXiv preprint arXiv:2206.08317, 2022
282022
An i-vector based approach to training data clustering for improved speech recognition
Y Zhang, J Xu, ZJ Yan, Q Huo
Twelfth Annual Conference of the International Speech Communication Association, 2011
282011
Method and apparatus for initiating an operation using voice data
XU Minqiang, Z Yan, J Gao, M Chu
US Patent App. 15/292,632, 2017
272017
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge
F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
222022
A real-time speaker diarization system based on spatial spectrum
S Zheng, W Huang, X Wang, H Suo, J Feng, Z Yan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
222021
Tip tap tones: mobile microtraining of mandarin sounds
D Edge, KY Cheng, M Whitney, Y Qian, Z Yan, F Soong
Proceedings of the 14th international conference on Human-computer …, 2012
222012
The system can't perform the operation now. Try again later.
Articles 1–20