Doina Precup

Citée par

	Toutes	Depuis 2019
Citations	32613	23533
indice h	63	54
indice i10	234	184

6000

3000

1500

4500

20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024118 119 179 217 246 307 327 331 320 380 408 477 590 610 875 1085 1896 2611 3405 4321 5250 5918 2000

Accès public

Tout afficher

61 articles

5 articles

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaAdresse e-mail validée de cs.mcgill.ca
Satinder SinghGoogle DeepMind / U. of MichiganAdresse e-mail validée de umich.edu
Prakash PanangadenProfessor of Computer Science, McGill UniversityAdresse e-mail validée de cs.mcgill.ca
Tal ArbelProfessor of Electrical & Computer Engineering, McGill UniversityAdresse e-mail validée de cim.mcgill.ca
Riashat IslamResearch ScientistAdresse e-mail validée de dreamfold.ai
Andre BarretoResearch Scientist, Google DeepMindAdresse e-mail validée de google.com
Emmanuel BengioMcGill UniversityAdresse e-mail validée de mail.mcgill.ca
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARAdresse e-mail validée de umontreal.ca
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchAdresse e-mail validée de technion.ac.il
David SilverDeepMind, UCLAdresse e-mail validée de google.com
Jean HarbOpenAIAdresse e-mail validée de openai.com
Guilherme Sant AnnaProfessor (Full) of Pediatrics, McGill UniversityAdresse e-mail validée de mcgill.ca
Philip WarrickPerigen Inc.Adresse e-mail validée de perigen.com
Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
Norm FernsAdresse e-mail validée de normferns.com
Jordan FrankSoftware Engineer, FacebookAdresse e-mail validée de cs.mcgill.ca
Amir-massoud FarahmandUniversity of TorontoAdresse e-mail validée de cs.toronto.edu
Pablo Samuel CastroGoogleAdresse e-mail validée de google.com
Hamid MaeiNetflixAdresse e-mail validée de netflix.com
Borja BalleDeepMindAdresse e-mail validée de google.com

Suivre

Doina Precup

DeepMind and McGill University

Adresse e-mail validée de cs.mcgill.ca

Artificial Intelligence machine learning reinforcement learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
The multimodal brain tumor image segmentation benchmark (BRATS) BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ... IEEE transactions on medical imaging 34 (10), 1993-2024, 2014	5428	2014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4305	1999
Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2243	2018
Off-policy deep reinforcement learning without exploration S Fujimoto, D Meger, D Precup International conference on machine learning, 2052-2062, 2019	1378	2019
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1186	2017
Eligibility traces for off-policy policy evaluation D Precup Computer Science Department Faculty Publication Series, 80, 2000	919	2000
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	698	2009
Learning with pseudo-ensembles P Bachman, O Alsharif, D Precup Advances in neural information processing systems 27, 2014	632	2014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	578	2011
Algorithms for multi-armed bandit problems V Kuleshov, D Precup arXiv preprint arXiv:1402.6028, 2014	534	2014
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	515	2021
Off-policy temporal-difference learning with function approximation D Precup, RS Sutton, S Dasgupta ICML, 417-424, 2001	457	2001
Learning options in reinforcement learning M Stolle, D Precup Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002	451	2002
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation T Nair, D Precup, DL Arnold, T Arbel Medical image analysis 59, 101557, 2020	445	2020
Temporal abstraction in reinforcement learning D Precup University of Massachusetts Amherst, 2000	388	2000
Metrics for Finite Markov Decision Processes. N Ferns, P Panangaden, D Precup UAI 4, 162-169, 2004	336	2004
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	329	2009
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	327	2015
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control R Islam, P Henderson, M Gomrokchi, D Precup arXiv preprint arXiv:1708.04133, 2017	302	2017
Gradient starvation: A learning proclivity in neural networks M Pezeshki, O Kaba, Y Bengio, AC Courville, D Precup, G Lajoie Advances in Neural Information Processing Systems 34, 1256-1272, 2021	236	2021

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs