Olivier Pietquin
Olivier Pietquin
Google Brain (On leave of Professor at University of Lille - CRIStAL - SequeL team)
Adresse e-mail validée de univ-lille.fr - Page d'accueil
Titre
Citée par
Citée par
Année
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Thirty-second AAAI conference on artificial intelligence, 2018
645*2018
Noisy networks for exploration
M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ...
arXiv preprint arXiv:1706.10295, 2017
4742017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
3082017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, A Courville
arXiv preprint arXiv:1707.00683, 2017
2802017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
2802017
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
1942006
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
1522016
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1412005
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
Eighth Annual Conference of the International Speech Communication Association, 2007
1312007
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1182018
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
1182013
Sample-efficient batch reinforcement learning for dialogue management optimization
O Pietquin, M Geist, S Chandramohan, H Frezza-Buet
ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011
1012011
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
1002019
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
982010
Algorithmic Survey of Parametric Value Function Approximation
M Geist, O Pietquin
Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
97*2013
User simulation in dialogue systems using inverse reinforcement learning
S Chandramohan, M Geist, F Lefevre, O Pietquin
Twelfth annual conference of the international speech communication association, 2011
972011
End-to-end optimization of goal-driven and visually grounded dialogue systems
F Strub, H De Vries, J Mary, B Piot, A Courville, O Pietquin
arXiv preprint arXiv:1703.05423, 2017
922017
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
NIPS 2012, 1-9, 2012
912012
Data-driven methods for adaptive spoken dialogue systems: Computational learning for conversational interfaces
O Lemon, O Pietquin
Springer Science & Business Media, 2012
862012
Laugh-aware virtual agent and its impact on user amusement
R Niewiadomski, J Hofmann, J Urbain, T Platt, J Wagner, P Bilal, T Ito, ...
University of Zurich, 2013
642013
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20