Jean Harb

Citée par

	Toutes	Depuis 2019
Citations	6241	5724
indice h	9	9
indice i10	8	8

1700

850

425

1275

2017201820192020202120222023202479 264 535 799 997 1296 1667 429

Accès public

Tout afficher

2 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Doina PrecupDeepMind and McGill UniversityAdresse e-mail validée de cs.mcgill.ca
Pierre-Luc BaconUniversity of MontrealAdresse e-mail validée de mila.quebec
Pieter AbbeelUC Berkeley | CovariantAdresse e-mail validée de cs.berkeley.edu
Yi WuInstitute for Interdisciplinary Information Sciences, Tsinghua UniversityAdresse e-mail validée de mail.tsinghua.edu.cn
Ryan LoweOpenAIAdresse e-mail validée de openai.com
Igor MordatchGoogle DeepMindAdresse e-mail validée de google.com
Aviv TamarTechnionAdresse e-mail validée de technion.ac.il

Suivre

Jean Harb

OpenAI

Adresse e-mail validée de openai.com

Machine Learning Reinforcement Learning Deep Learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4543	2017
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1182	2017
Investigating recurrence and eligibility traces in deep Q-networks J Harb, D Precup arXiv preprint arXiv:1704.05495, 2017	231	2017
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	152	2018
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	56	2017
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	38	2020
Waymax: An accelerated, data-driven simulator for large-scale autonomous driving research C Gulino, J Fu, W Luo, G Tucker, E Bronstein, Y Lu, J Harb, X Pan, ... Advances in Neural Information Processing Systems 36, 2024	16	2024
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018	13	2018
General policy evaluation and improvement by learning to identify few but crucial states F Faccio, A Ramesh, V Herrmann, J Harb, J Schmidhuber arXiv preprint arXiv:2207.01566, 2022	9	2022
Learning options in deep reinforcement learning J Merheb-Harb McGill University (Canada), 2016	1	2016
Asynchronous Advantage Option-Critic with Deliberation Cost J Harb, PL Bacon, D Precup RLDM, 2017		2017

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–11

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs