Kamil Pokora
Kamil Pokora
Applied Scientist at Amazon
Verified email at
Cited by
Cited by
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech
R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ...
arXiv preprint arXiv:2106.12896, 2021
Creating new voices using normalizing flows
P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ...
arXiv preprint arXiv:2312.14569, 2023
Text-free non-parallel many-to-many voice conversion using normalising flow
T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Varying speaking styles with neural textto-speech
T Wood, T Merritt
Amazon Science, 2018
Enhancing audio quality for expressive Neural Text-to-Speech
A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ...
arXiv preprint arXiv:2108.06270, 2021
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows
A Ezzerg, T Merritt, K Yanagisawa, P Bilinski, M Proszewska, K Pokora, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 984-990, 2023
On granularity of prosodic representations in expressive text-to-speech
M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov
2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023
Cross-lingual knowledge distillation via flow-based voice conversion for robust polyglot text-to-speech
D Piotrowski, R Korzeniowski, A Falai, S Cygert, K Pokora, G Tinchev, ...
International Conference on Neural Information Processing, 252-264, 2023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ...
arXiv preprint arXiv:2307.16679, 2023
The system can't perform the operation now. Try again later.
Articles 1–9