Victoria Krakovna

Cited by

	All	Since 2019
Citations	1487	1397
h-index	12	12
i10-index	15	14

540

270

135

405

201720182019202020212022202320249 61 108 157 162 186 254 527

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Tom EverittStaff Research Scientist at Google DeepMindVerified email at google.com
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Ramana KumarDeepMindVerified email at cl.cam.ac.uk
Miljan MarticDeepMindVerified email at google.com
Jonathan UesatoVerified email at mit.edu
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Zachary KentonGoogle DeepMindVerified email at google.com
Pedro A. OrtegaArtificial Intelligence & Machine LearningVerified email at adaptiveagents.org
Jan LeikeOpenAIVerified email at openai.com
Richard NgoOpenAIVerified email at openai.com
Matthew RahtzGoogle DeepMindVerified email at google.com
Finale Doshi-VelezProfessor, HarvardVerified email at seas.harvard.edu
Vladimir MikulikDeepMindVerified email at google.com
Rohin ShahResearch Scientist, Google DeepMindVerified email at deepmind.com
Vikrant VarmaDeepMindVerified email at deepmind.com
Mary PhuongIST AustriaVerified email at ist.ac.at
Janos KramarDeepMindVerified email at google.com
Gerald PennProfessor of Computer Science, University of TorontoVerified email at cs.toronto.edu
Jun S LiuProfessor of statistics, Harvard UniversityVerified email at stat.harvard.edu
Andis DragunsIMCS UL, MATSVerified email at lumii.lv

Victoria Krakovna

Other namesViktoriya Krakovna

Senior Research Scientist at DeepMind

Verified email at google.com - Homepage

AI Alignment Agent Incentives Interpretability Reinforcement Learning Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	481	2023
AI safety gridworlds J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ... arXiv preprint arXiv:1711.09883, 2017	329	2017
Reinforcement Learning with a Corrupted Reward Channel T Everitt, V Krakovna, L Orseau, M Hutter, S Legg IJCAI AI & Autonomy, 2017	116	2017
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... https://deepmind.com/blog/article/Specification-gaming-the-flip-side-of-AI …, 2020	99*	2020
Reward tampering problems and solutions in reinforcement learning: A causal influence diagram perspective T Everitt, M Hutter, R Kumar, V Krakovna Synthese 198 (Suppl 27), 6435-6467, 2021	83	2021
Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models V Krakovna, F Doshi-Velez ICML Workshop on Human Interpretability (WHI 2016), arXiv preprint arXiv …, 2016	82	2016
Penalizing side effects using stepwise relative reachability V Krakovna, L Orseau, R Kumar, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	57	2018
Goal misgeneralization: why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022	45	2022
Avoiding Side Effects By Considering Future Tasks V Krakovna, L Orseau, R Ngo, M Martic, S Legg NeurIPS 2020, arXiv preprint arXiv:2010.07877, 2020	43	2020
Specification gaming examples in AI V Krakovna tinyurl.com/specification-gaming, 2018	37*	2018
Modeling AGI safety frameworks with causal influence diagrams T Everitt, R Kumar, V Krakovna, S Legg arXiv preprint arXiv:1906.08663, 2019	23	2019
Measuring and avoiding side effects using relative reachability V Krakovna, L Orseau, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	18	2018
REALab: An embedded perspective on tampering R Kumar, J Uesato, R Ngo, T Everitt, V Krakovna, S Legg arXiv preprint arXiv:2011.08820, 2020	12	2020
Power-seeking can be probable and predictive for trained agents V Krakovna, J Kramar arXiv preprint arXiv:2304.06528, 2023	10*	2023
Memory-bounded left-corner unsupervised grammar induction on child-directed input C Shain, W Bryce, L Jin, V Krakovna, F Doshi-Velez, T Miller, W Schuler, ... Proceedings of COLING 2016, the 26th International Conference on …, 2016	10*	2016
Avoiding tampering incentives in deep RL via decoupled approval J Uesato, R Kumar, V Krakovna, T Everitt, R Ngo, S Legg arXiv preprint arXiv:2011.08827, 2020	7	2020
Interpretable selection and visualization of features and interactions using bayesian forests V Krakovna, J Du, JS Liu Statistics and its Interface 2018 (Volume 11 Number 3), arXiv preprint arXiv …, 2015	6*	2015
A generalized-zero-preserving method for compact encoding of concept lattices M Skala, V Krakovna, J Kramár, G Penn Proceedings of the 48th annual meeting of the Association for Computational …, 2010	6	2010
A Minimalistic Approach to Sum-Product Network Learning for Real Applications V Krakovna, M Looks ICLR 2016 workshop, arXiv preprint arXiv:1602.04259, 2016	5	2016
Building interpretable models: From Bayesian networks to neural networks V Krakovna	4	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors