Victoria Krakovna

Citée par

	Toutes	Depuis 2019
Citations	1445	1356
indice h	12	12
indice i10	15	14

500

250

125

375

201720182019202020212022202320249 61 108 157 162 186 254 486

Accès public

Tout afficher

1 article

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Tom EverittStaff Research Scientist at Google DeepMindAdresse e-mail validée de google.com
Laurent OrseauResearch Scientist at Google DeepMindAdresse e-mail validée de google.com
Ramana KumarDeepMindAdresse e-mail validée de cl.cam.ac.uk
Miljan MarticDeepMindAdresse e-mail validée de google.com
Jonathan UesatoAdresse e-mail validée de mit.edu
Marcus HutterResearcher@DeepMind & Professor at ANUAdresse e-mail validée de anu.edu.au
Zachary KentonGoogle DeepMindAdresse e-mail validée de google.com
Pedro A. OrtegaArtificial Intelligence & Machine LearningAdresse e-mail validée de adaptiveagents.org
Jan LeikeOpenAIAdresse e-mail validée de openai.com
Richard NgoOpenAIAdresse e-mail validée de openai.com
Matthew RahtzGoogle DeepMindAdresse e-mail validée de google.com
Finale Doshi-VelezProfessor, HarvardAdresse e-mail validée de seas.harvard.edu
Vladimir MikulikDeepMindAdresse e-mail validée de google.com
Rohin ShahResearch Scientist, Google DeepMindAdresse e-mail validée de deepmind.com
Vikrant VarmaDeepMindAdresse e-mail validée de deepmind.com
Mary PhuongIST AustriaAdresse e-mail validée de ist.ac.at
Janos KramarDeepMindAdresse e-mail validée de google.com
Gerald PennProfessor of Computer Science, University of TorontoAdresse e-mail validée de cs.toronto.edu
Jun S LiuProfessor of statistics, Harvard UniversityAdresse e-mail validée de stat.harvard.edu
Andis DragunsIMCS UL, MATSAdresse e-mail validée de lumii.lv

Suivre

Victoria Krakovna

Autres nomsViktoriya Krakovna

Senior Research Scientist at DeepMind

Adresse e-mail validée de google.com - Page d'accueil

AI Alignment Agent Incentives Interpretability Reinforcement Learning Machine Learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	443	2023
AI safety gridworlds J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ... arXiv preprint arXiv:1711.09883, 2017	327	2017
Reinforcement Learning with a Corrupted Reward Channel T Everitt, V Krakovna, L Orseau, M Hutter, S Legg IJCAI AI & Autonomy, 2017	115	2017
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... https://deepmind.com/blog/article/Specification-gaming-the-flip-side-of-AI …, 2020	98*	2020
Reward tampering problems and solutions in reinforcement learning: A causal influence diagram perspective T Everitt, M Hutter, R Kumar, V Krakovna Synthese 198 (Suppl 27), 6435-6467, 2021	83	2021
Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models V Krakovna, F Doshi-Velez ICML Workshop on Human Interpretability (WHI 2016), arXiv preprint arXiv …, 2016	82	2016
Penalizing side effects using stepwise relative reachability V Krakovna, L Orseau, R Kumar, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	57	2018
Goal misgeneralization: why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022	45	2022
Avoiding Side Effects By Considering Future Tasks V Krakovna, L Orseau, R Ngo, M Martic, S Legg NeurIPS 2020, arXiv preprint arXiv:2010.07877, 2020	43	2020
Specification gaming examples in AI V Krakovna tinyurl.com/specification-gaming, 2018	37*	2018
Modeling AGI safety frameworks with causal influence diagrams T Everitt, R Kumar, V Krakovna, S Legg arXiv preprint arXiv:1906.08663, 2019	23	2019
Measuring and avoiding side effects using relative reachability V Krakovna, L Orseau, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	18	2018
REALab: An embedded perspective on tampering R Kumar, J Uesato, R Ngo, T Everitt, V Krakovna, S Legg arXiv preprint arXiv:2011.08820, 2020	12	2020
Power-seeking can be probable and predictive for trained agents V Krakovna, J Kramar arXiv preprint arXiv:2304.06528, 2023	10*	2023
Memory-bounded left-corner unsupervised grammar induction on child-directed input C Shain, W Bryce, L Jin, V Krakovna, F Doshi-Velez, T Miller, W Schuler, ... Proceedings of COLING 2016, the 26th International Conference on …, 2016	10*	2016
Avoiding tampering incentives in deep RL via decoupled approval J Uesato, R Kumar, V Krakovna, T Everitt, R Ngo, S Legg arXiv preprint arXiv:2011.08827, 2020	7	2020
Interpretable selection and visualization of features and interactions using bayesian forests V Krakovna, J Du, JS Liu Statistics and its Interface 2018 (Volume 11 Number 3), arXiv preprint arXiv …, 2015	6*	2015
A generalized-zero-preserving method for compact encoding of concept lattices M Skala, V Krakovna, J Kramár, G Penn Proceedings of the 48th annual meeting of the Association for Computational …, 2010	6	2010
A Minimalistic Approach to Sum-Product Network Learning for Real Applications V Krakovna, M Looks ICLR 2016 workshop, arXiv preprint arXiv:1602.04259, 2016	5	2016
Building interpretable models: From Bayesian networks to neural networks V Krakovna	4	2016

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs