Yasin Abbasi Yadkori

Citée par

	Toutes	Depuis 2019
Citations	4746	3956
indice h	26	25
indice i10	48	43

900

450

225

675

201220132014201520162017201820192020202120222023202440 54 83 84 135 153 219 379 647 820 881 866 361

Accès public

Tout afficher

12 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
Peter BartlettProfessor, EECS and Statistics, UC BerkeleyAdresse e-mail validée de cs.berkeley.edu
Nevena LazicDeepMindAdresse e-mail validée de google.com
Zheng WenGoogle DeepMindAdresse e-mail validée de google.com
Dávid PálStaff Machine Learning Engineer, InstacartAdresse e-mail validée de instacart.com
Anup B. RaoGeorgia Institute of TechnologyAdresse e-mail validée de gatech.edu
Branislav KvetonAmazonAdresse e-mail validée de amazon.com
Alan MalekMITAdresse e-mail validée de mit.edu
Mohammad GhavamzadehAmazonAdresse e-mail validée de amazon.com
Botao HaoDeepmindAdresse e-mail validée de google.com
Yevgeny SeldinProfessor, Department of Computer Science, University of CopenhagenAdresse e-mail validée de di.ku.dk
Gellért WeiszDeepMind, UCL, gellert@google.comAdresse e-mail validée de google.com
Vishwa VinayCanvaAdresse e-mail validée de acm.org
Ryan A. RossiAdobe ResearchAdresse e-mail validée de adobe.com
Varun KanadeUniversity of OxfordAdresse e-mail validée de cs.ox.ac.uk
Hong ZhangChair Professor, SUSTech; Professor Emeritus, University of AlbertaAdresse e-mail validée de ualberta.ca
Kiana HajebiSenior Applied Scientist, AmazonAdresse e-mail validée de amazon.com
Aldo PacchianoBroad Institute of MIT and HarvardAdresse e-mail validée de broadinstitute.org
S MuthukrishnanRutgers UnivAdresse e-mail validée de cs.rutgers.edu
Tor LattimoreDeepMindAdresse e-mail validée de google.com

Suivre

Yasin Abbasi Yadkori

DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Artificial Intelligence Machine Learning Sequential Decision Problems


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Improved algorithms for linear stochastic bandits Y Abbasi-Yadkori, C Szepesvári, D Pal Advances in Neural Information Processing Systems, 2312-2320, 2011	1879	2011
Regret Bounds for the Adaptive Control of Linear Quadratic Systems. Y Abbasi-Yadkori, C Szepesvári COLT, 1-26, 2011	410	2011
Fast approximate nearest-neighbor search with k-nearest neighbor graph K Hajebi, Y Abbasi-Yadkori, H Shahbazi, H Zhang Twenty-Second International Joint Conference on Artificial Intelligence, 2011	273	2011
Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits. Y Abbasi-Yadkori, D Pal, C Szepesvari AISTATS 22, 1-9, 2012	181	2012
Sharp Convergence Rates for Langevin Dynamics in the Nonconvex Setting X Cheng, NS Chatterji, Y Abbasi-Yadkori, PL Bartlett, MI Jordan arXiv preprint arXiv:1805.01648, 2018	176	2018
POLITEX: Regret bounds for policy iteration using expert prediction Y Abbasi-Yadkori, P Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz Proceedings of the 36th International Conference on Machine Learning 97 …, 2019	132	2019
POLITEX: Regret Bounds for Policy Iteration Using Expert Prediction Y Abbasi-Yadkori, PL Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz	132	2019
Conservative contextual linear bandits A Kazerouni, M Ghavamzadeh, YA Yadkori, B Van Roy Advances in Neural Information Processing Systems, 3910-3919, 2017	114	2017
Model-Free Linear Quadratic Control via Reduction to Expert Prediction Y Abbasi-Yadkori, N Lazic, C Szepesvari The 22nd International Conference on Artificial Intelligence and Statistics, 2019	111*	2019
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions Y Abbasi-Yadkori, P Bartlett, V Kanade, Y Seldin, C Szepesvari Neural Information Processing Systems, 2013	97	2013
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020	90	2020
Online Learning for Linearly Parametrized Control Problems Y Abbasi-Yadkori University of Alberta, 2012	79	2012
Offline Evaluation of Ranking Policies with Click Models S Li, Y Abbasi-Yadkori, B Kveton, S Muthukrishnan, V Vinay, Z Wen Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018	68	2018
Prediction with limited advice and multiarmed bandits with paid observations Y Seldin, P Bartlett, K Crammer, Y Abbasi-Yadkori International Conference on Machine Learning, 280-287, 2014	68	2014
Bayesian Optimal Control of Smoothly Parameterized Systems Y Abbasi-Yadkori, C Szepesvári Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2015	65*	2015
Online least squares estimation with self-normalized processes: An application to bandit problems Y Abbasi-Yadkori, D Pál, C Szepesvári arXiv preprint arXiv:1102.2670, 2011	65	2011
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments. Y Seldin, C Szepesvári, P Auer, Y Abbasi-Yadkori EWRL, 103-116, 2012	62	2012
Bootstrapping upper confidence bound B Hao, YA Yadkori, Z Wen, G Cheng Advances in Neural Information Processing Systems, 12123-12133, 2019	56	2019
Bootstrapping upper confidence bound B Hao, YA Yadkori, Z Wen, G Cheng Advances in Neural Information Processing Systems, 12123-12133, 2019	56	2019
Linear Programming for Large-Scale Markov Decision Problems Y Abbasi-Yadkori, P Bartlett, A Malek Proceedings of the 31st International Conference on Machine Learning (ICML …, 2014	54*	2014

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs