Kyu Jeong Han
Kyu Jeong Han
Amazon Web Services (AWS)
Verified email at
Cited by
Cited by
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
M Li, KJ Han, S Narayanan
Computer Speech & Language 27 (1), 151-167, 2013
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
The CAPIO 2017 conversational speech recognition system
KJ Han, A Chandrashekaran, J Kim, I Lane
arXiv preprint arXiv:1801.00059, 2017
Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization
KJ Han, S Kim, SS Narayanan
IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1590-1601, 2008
State-of-the-art speech recognition using multi-stream self-attention with dilated 1d convolutions
KJ Han, R Prieto, T Ma
2019 IEEE Automatic speech recognition and understanding workshop (ASRU), 54-61, 2019
E-branchformer: Branchformer with enhanced merging for speech recognition
K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe
2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023
Robust language identification using convolutional neural network features.
S Ganapathy, KJ Han, S Thomas, MK Omar, M Van Segbroeck, ...
Interspeech, 1846-1850, 2014
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system.
KJ Han, SS Narayanan
Interspeech, 1853-1856, 2007
Slue: New benchmark tasks for spoken language understanding evaluation on natural speech
S Shon, A Pasad, F Wu, P Brusco, Y Artzi, K Livescu, KJ Han
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Combining five acoustic level modeling methods for automatic speaker age and gender recognition.
M Li, CS Jung, KJ Han
INTERSPEECH, 2826-2829, 2010
Multistream CNN for robust acoustic modeling
KJ Han, J Pan, VKN Tadala, T Ma, D Povey
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Deep Learning-Based Telephony Speech Recognition in the Wild
KJ Han, S Hahm, BH Kim, J Kim, IR Lane
INTERSPEECH, 1323-1327, 2017
Performance-efficiency trade-offs in unsupervised pre-training for speech recognition
F Wu, K Kim, J Pan, KJ Han, KQ Weinberger, Y Artzi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
ASAPP-ASR: Multistream CNN and self-attentive SRU for SOTA speech recognition
J Pan, J Shapiro, J Wohlwend, KJ Han, T Lei, T Ma
arXiv preprint arXiv:2005.10469, 2020
Identifying a driver of a vehicle
SV Myers, S Elwart, WJ Talamonti, JT Mullen, ZD Nelson, T Smith, ...
US Patent 9,707,911, 2017
Wav2seq: Pre-training speech-to-text encoder-decoder models using pseudo languages
F Wu, K Kim, S Watanabe, KJ Han, R McDonald, KQ Weinberger, Y Artzi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling.
KJ Han, SS Narayanan
Interspeech, 20-23, 2008
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering
KJ Han, SS Narayanan
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
The system can't perform the operation now. Try again later.
Articles 1–20