Shaofei Zhang
Shaofei Zhang
Senior Software Engineer, Microsoft
Verified email at
Cited by
Cited by
Conversational end-to-end tts for voice agents
H Guo, S Zhang, FK Soong, L He, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021
Exemplar-based sparse representation of timbre and prosody for voice conversion
H Ming, D Huang, L Xie, S Zhang, M Dong, H Li
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
Fundamental frequency modeling using wavelets for emotional voice conversion
H Ming, D Huang, M Dong, H Li, L Xie, S Zhang
2015 International Conference on Affective Computing and Intelligent …, 2015
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts
L Xue, FK Soong, S Zhang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022
Self-supervised context-aware style representation for expressive speech synthesis
Y Wu, X Wang, S Zhang, L He, R Song, JY Nie
arXiv preprint arXiv:2206.12559, 2022
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation
S Zhang, D Huang, L Xie, ES Chng, H Li, M Dong
2015 Asia-Pacific Signal and Information Processing Association Annual …, 2015
An automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity
DY Huang, L Xie, S Zhang, YSW Lee, J Wu, H Ming, X Tian, C Ding, M Li, ...
A hybrid virtual bass system with improved phase vocoder and high efficiency
S Zhang, L Xie, ZH Fu, Y Yuan
The 9th International Symposium on Chinese Spoken Language Processing, 401-405, 2014
Stylespeech: Self-supervised style enhancing with vq-vae-based pre-training for expressive audiobook speech synthesis
X Chen, X Wang, S Zhang, L He, Z Wu, X Wu, H Meng
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
MuLanTTS The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Z Xu, S Zhang, X Wang, J Zhang, W Wei, L He, S Zhao
arXiv preprint arXiv:2309.02743, 2023
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Y Xiao, S Zhang, X Wang, X Tan, L He, S Zhao, FK Soong, T Lee
arXiv preprint arXiv:2307.00782, 2023
Paragraph synthesis with cross utterance features for neural TTS
S Zhang, L He
US Patent App. 17/631,695, 2022
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation.
S Zhang, DY Huang, L Xie, E Chng, H Li, M Dong
INTERSPEECH, 1498-1502, 2015
Large-Scale Automatic Audiobook Creation
B Walsh, M Hamilton, G Newby, X Wang, S Ruan, S Zhao, L He, S Zhang, ...
arXiv preprint arXiv:2309.03926, 2023
The system can't perform the operation now. Try again later.
Articles 1–14