Lihong Li (李力鸿)

Citata da

	Tutte	Dal 2019
Citazioni	26113	18113
Indice H	66	57
i10-index	101	86

3700

1850

925

2775

2008200920102011201220132014201520162017201820192020202120222023202495 184 214 340 437 546 594 835 990 1228 1762 2349 3040 3561 3424 3694 2044

Accesso pubblico

Visualizza tutto

14 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

John LangfordMicrosoft Research New YorkEmail verificata su hunch.net
Michael LittmanBrown UniversityEmail verificata su brown.edu
Jianfeng GaoMicrosoft Research, RedmondEmail verificata su microsoft.com
Wei Chu（褚崴）InfEmail verificata su gatsby.ucl.ac.uk
Li DengChief AI Officer, Citadel (former)Email verificata su ieee.org
Robert SchapireMicrosoft ResearchEmail verificata su microsoft.com
Bo DaiGoogle Brain & Georgia TechEmail verificata su google.com
Denny ZhouResearch Scientist, Google DeepMindEmail verificata su google.com
Jianshu ChenPrincipal Scientist, AmazonEmail verificata su ucla.edu
Asli CelikyilmazResearcher @ FAIR at Meta AIEmail verificata su ieee.org
Dale SchuurmansUniversity of Alberta, Google DeepMindEmail verificata su cs.ualberta.ca
Zachary C. LiptonRaj Reddy Associate Professor of Machine Learning @ Carnegie Mellon University; CTO + CSO @ AbridgeEmail verificata su cmu.edu
Yun-Nung (Vivian) ChenNational Taiwan UniversityEmail verificata su ieee.org
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityEmail verificata su cs.stanford.edu
Faisal Ahmed, PhDMicrosoftEmail verificata su microsoft.com
Thomas J. WalshSony AIEmail verificata su sony.com
Miroslav DudikMicrosoft ResearchEmail verificata su microsoft.com
Xiujun LiUniversity of Washington / AppleEmail verificata su cs.washington.edu
Chong WangAppleEmail verificata su cs.princeton.edu
Csaba SzepesvariDeepMind & University of AlbertaEmail verificata su cs.ualberta.ca

Segui

Lihong Li (李力鸿)

Amazon

Email verificata su amazon.com - Home page

Reinforcement Learning Machine Learning Artificial Intelligence


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
A contextual-bandit approach to personalized news article recommendation L Li, W Chu, J Langford, RE Schapire Proceedings of the 19th international conference on World wide web, 661-670, 2010	3380	2010
An empirical evaluation of thompson sampling O Chapelle, L Li Advances in neural information processing systems 24, 2011	1806	2011
Parallelized stochastic gradient descent M Zinkevich, M Weimer, L Li, A Smola Advances in neural information processing systems 23, 2010	1750	2010
Contextual bandits with linear payoff functions W Chu, L Li, L Reyzin, R Schapire Proceedings of the Fourteenth International Conference on Artificial …, 2011	1224	2011
Neural approaches to conversational AI J Gao, M Galley, L Li The 41st international ACM SIGIR conference on research & development in …, 2018	908	2018
Doubly robust policy evaluation and learning M Dudík, J Langford, L Li arXiv preprint arXiv:1103.4601, 2011	872	2011
Doubly Robust Policy Evaluation and Learning M Dudık, J Langford, L Li	872*
Doubly robust off-policy value evaluation for reinforcement learning N Jiang, L Li International conference on machine learning, 652-661, 2016	815	2016
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms L Li, W Chu, J Langford, X Wang Proceedings of the fourth ACM international conference on Web search and …, 2011	672	2011
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006	634	2006
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006	604	2006
Sparse Online Learning via Truncated Gradient. J Langford, L Li, T Zhang Journal of Machine Learning Research 10 (3), 2009	593	2009
Taming the monster: A fast and simple algorithm for contextual bandits A Agarwal, D Hsu, S Kale, J Langford, L Li, R Schapire International conference on machine learning, 1638-1646, 2014	568	2014
Towards end-to-end reinforcement learning of dialogue agents for information access B Dhingra, L Li, X Li, J Gao, YN Chen, F Ahmed, L Deng arXiv preprint arXiv:1609.00777, 2016	530*	2016
Doubly robust policy evaluation and optimization M Dudík, D Erhan, J Langford, L Li	475	2014
End-to-end task-completion neural dialogue systems X Li, YN Chen, L Li, J Gao, A Celikyilmaz arXiv preprint arXiv:1703.01008, 2017	457	2017
Neuro-symbolic program synthesis E Parisotto, A Mohamed, R Singh, L Li, D Zhou, P Kohli arXiv preprint arXiv:1611.01855, 2016	407	2016
Reinforcement Learning in Finite MDPs: PAC Analysis. AL Strehl, L Li, ML Littman Journal of Machine Learning Research 10 (11), 2009	377	2009
Breaking the curse of horizon: Infinite-horizon off-policy estimation Q Liu, L Li, Z Tang, D Zhou Advances in neural information processing systems 31, 2018	376	2018
Contextual bandit algorithms with supervised learning guarantees A Beygelzimer, J Langford, L Li, L Reyzin, RE Schapire Arxiv preprint arXiv:1002.4058, 2010	361	2010

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori