Follow
Gary Wang
Gary Wang
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
A long-short term memory recurrent neural network based reinforcement learning controller for office heating ventilation and air conditioning systems
Y Wang, K Velswamy, B Huang
Processes 5 (3), 46, 2017
1782017
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ...
arXiv preprint arXiv:2303.01037, 2023
1092023
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech
G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
542020
A Novel Approach to Feedback Control with Deep Reinforcement Learning
Y Wang, K Velswamy, B Huang
IFAC-PapersOnLine 51 (18), 31-36, 2018
542018
Injecting text in self-supervised speech pretraining
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
322021
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection.
Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno
INTERSPEECH, 556-560, 2020
322020
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
202022
Modular Hybrid Autoregressive Transducer
Z Meng, T Chen, R Prabhavalkar, Y Zhang, G Wang, K Audhkhasi, ...
arXiv preprint arXiv:2210.17049, 2022
182022
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
A Aksėnova, Z Chen, CC Chiu, D van Esch, P Golik, W Han, L King, ...
arXiv preprint arXiv:2205.08014, 2022
162022
Semi-supervision in asr: Sequential mixmatch and factorized tts-based augmentation
Z Chen, A Rosenberg, Y Zhang, H Zen, M Ghodsi, Y Huang, J Emond, ...
132021
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR.
G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, PJ Moreno
INTERSPEECH, 2832-2836, 2020
92020
Deep text-to-speech system with seq2seq model
G Wang
arXiv preprint arXiv:1903.07398, 2019
92019
Understanding Shared Speech-Text Representations
G Wang, K Kastner, A Bapna, Z Chen, A Rosenberg, B Ramabhadran, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ...
arXiv preprint arXiv:2210.15447, 2022
72022
Non-Parallel Voice Conversion for ASR Augmentation
G Wang, A Rosenberg, B Ramabhadran, F Biadsy, Y Huang, J Emond, ...
arXiv preprint arXiv:2209.06987, 2022
22022
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Y Blau, R Agrawal, L Madmony, G Wang, A Rosenberg, Z Chen, ...
arXiv preprint arXiv:2308.07393, 2023
12023
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR
G Wang, ED Cubuk, A Rosenberg, S Cheng, RJ Weiss, B Ramabhadran, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 23-30, 2023
12023
Supervised and Unsupervised Training with Contrastive Loss Over Sequences
A Rosenberg, B Ramabhadran, Z Chen, G Wang, Y Zhang, J Emond
US Patent App. 17/655,903, 2022
12022
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ...
arXiv preprint arXiv:2402.18932, 2024
2024
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings
C Li, G Wang, K Kastner, H Su, A Chen, A Rosenberg, Z Chen, Z Wu, ...
arXiv preprint arXiv:2401.04235, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20