Kaizhi Qian

Cited by

	All	Since 2019
Citations	1147	1120
h-index	12	12
i10-index	12	12

420

210

105

315

201820192020202120222023202426 33 98 182 287 405 109

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yang ZhangMIT-IBM Watson AI LabVerified email at ibm.com
Mark Hasegawa-JohnsonProfessor of Electrical and Computer Engineering, University of IllinoisVerified email at illinois.edu
Shiyu ChangUniversity of California, Santa BarbaraVerified email at cs.ucsb.edu
David CoxVP, AI Models; IBM Director, MIT-IBM Watson AI Lab, IBM ResearchVerified email at ibm.com
Xuesong YangNVIDIAVerified email at nvidia.com
Dinei FlorencioMicrosoft ResearchVerified email at microsoft.com
Zeyu JinAdobe ResearchVerified email at adobe.com
Gautham J. MysoreSenior Principal Scientist, Adobe ResearchVerified email at adobe.com
Jenelle FeatherFlatiron InstituteVerified email at flatironinstitute.org
Masato AkagiProfessor of Japan Advanced Institute of Science and TechnologyVerified email at jaist.ac.jp

Kaizhi Qian

MIT-IBM Watson AI Lab

Verified email at ibm.com

speech processing deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Autovc: Zero-shot voice style transfer with only autoencoder loss K Qian, Y Zhang, S Chang, X Yang, M Hasegawa-Johnson International Conference on Machine Learning, 5210-5219, 2019	457	2019
Unsupervised speech decomposition via triple information bottleneck K Qian, Y Zhang, S Chang, M Hasegawa-Johnson, D Cox International Conference on Machine Learning, 7836-7846, 2020	169	2020
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder K Qian, Z Jin, M Hasegawa-Johnson, GJ Mysore ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	105	2020
Speech Enhancement Using Bayesian Wavenet. K Qian, Y Zhang, S Chang, X Yang, D Florêncio, M Hasegawa-Johnson Interspeech, 2013-2017, 2017	100	2017
Contentvec: An improved self-supervised speech representation by disentangling speakers K Qian, Y Zhang, H Gao, J Ni, CI Lai, D Cox, M Hasegawa-Johnson, ... International Conference on Machine Learning, 18003-18017, 2022	60	2022
Parp: Prune, adjust and re-prune for self-supervised speech recognition CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ... Advances in Neural Information Processing Systems 34, 21256-21272, 2021	54	2021
Deep learning based speech beamforming K Qian, Y Zhang, S Chang, X Yang, D Florencio, M Hasegawa-Johnson 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	43	2018
Global prosody style transfer without text transcriptions K Qian, Y Zhang, S Chang, J Xiong, C Gan, D Cox, M Hasegawa-Johnson International Conference on Machine Learning, 8650-8660, 2021	36	2021
Unsupervised text-to-speech synthesis by unsupervised automatic speech recognition J Ni, L Wang, H Gao, K Qian, Y Zhang, S Chang, M Hasegawa-Johnson arXiv preprint arXiv:2203.15796, 2022	28	2022
Speechsplit2. 0: Unsupervised speech disentanglement for voice conversion without tuning autoencoder bottlenecks CH Chan, K Qian, Y Zhang, M Hasegawa-Johnson ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	25	2022
Wavprompt: Towards few-shot spoken language understanding with frozen language models H Gao, J Ni, K Qian, Y Zhang, S Chang, M Hasegawa-Johnson arXiv preprint arXiv:2203.15863, 2022	20	2022
Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding. H Gao, J Ni, Y Zhang, K Qian, S Chang, M Hasegawa-Johnson Interspeech, 1304-1308, 2021	13	2021
Physics-driven diffusion models for impact sound synthesis from videos K Su, K Qian, E Shlizerman, A Torralba, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	8	2023
Continuous cnn for nonuniform time series H Shi, Y Zhang, H Wu, S Chang, K Qian, M Hasegawa-Johnson, J Zhao ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	7*	2021
Speech denoising with auditory models MR Saddler, A Francl, J Feather, K Qian, Y Zhang, JH McDermott arXiv preprint arXiv:2011.10706, 2020	7*	2020
On the interplay between sparsity, naturalness, intelligibility, and prosody in speech synthesis CIJ Lai, E Cooper, Y Zhang, S Chang, K Qian, YL Liao, YS Chuang, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	4	2022
Application of local binary patterns for SVM based stop consonant detection K Qian, Y Zhang, M Hasegawa-Johnson Proc. Speech Prosody, 1114-1118, 2016	4	2016
Losses can be blessings: Routing self-supervised speech representations towards efficient multilingual and multitask speech processing Y Fu, Y Zhang, K Qian, Z Ye, Z Yu, CIJ Lai, C Lin Advances in Neural Information Processing Systems 35, 20902-20920, 2022	3	2022
Master-ASR: achieving multilingual scalability and low-resource adaptation in ASR with modular learning Z Yu, Y Zhang, K Qian, C Wan, Y Fu, Y Zhang, YC Lin International Conference on Machine Learning, 40475-40487, 2023	2	2023
Deep generative models for speech editing K Qian University of Illinois at Urbana-Champaign, 2020	1	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors