Zheng Wen

Citée par

	Toutes	Depuis 2019
Citations	5155	4338
indice h	31	30
indice i10	54	50

1000

500

250

750

2014201520162017201820192020202120222023202428 68 146 184 330 502 738 847 985 924 340

Accès public

Tout afficher

8 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Branislav KvetonAmazonAdresse e-mail validée de amazon.com
Benjamin Van RoyStanford UniversityAdresse e-mail validée de stanford.edu
Ian OsbandOpenAIAdresse e-mail validée de openai.com
Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
Azin AshkanGoogleAdresse e-mail validée de uwaterloo.ca
Xiuyuan LuGoogle DeepMindAdresse e-mail validée de google.com
Yasin Abbasi YadkoriDeepMindAdresse e-mail validée de google.com
Vikranth DwaracherlaDeepMindAdresse e-mail validée de google.com
Morteza IbrahimiStanford UniversityAdresse e-mail validée de stanford.edu
Mohammad GhavamzadehAmazonAdresse e-mail validée de amazon.com
Sharan VaswaniSimon Fraser UniversityAdresse e-mail validée de sfu.ca
Daniel RussoColumbia UniversityAdresse e-mail validée de gsb.columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindAdresse e-mail validée de meta.com
Seyed Mohammad AsghariResearch Engineer, DeepMindAdresse e-mail validée de google.com
Brian ErikssonAdobeAdresse e-mail validée de adobe.com
Botao HaoDeepmindAdresse e-mail validée de google.com
S MuthukrishnanRutgers UnivAdresse e-mail validée de cs.rutgers.edu
Sumeet KatariyaAmazonAdresse e-mail validée de wisc.edu
Shlomo BerkovskyMacquarie UniversityAdresse e-mail validée de mq.edu.au
Claire VernadeUniversity of TuebingenAdresse e-mail validée de uni-tuebingen.de

Suivre

Zheng Wen

Google DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1047*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	327	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	320	2019
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	306	2015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	305	2014
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	302	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	143*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	128*	2019
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	125	2016
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	125	2015
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	125	2014
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	108	2014
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	107	2015
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	105	2017
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	88	2016
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	86	2013
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	81*	2017
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	78	2024
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	74	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	72	2019

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs