Paul Christiano
Page d'accueil
Concrete problems in AI safety
D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané
arXiv preprint arXiv:1606.06565, 2016
Theano: A Python framework for fast computation of mathematical expressions
R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ...
arXiv, arXiv: 1605.02688, 2016
Deep reinforcement learning from human preferences
PF Christiano, J Leike, T Brown, M Martic, S Legg, D Amodei
Advances in Neural Information Processing Systems, 4299-4307, 2017
Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs
P Christiano, JA Kelner, A Madry, DA Spielman, SH Teng
Proceedings of the forty-third annual ACM symposium on Theory of computing …, 2011
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
C Finn, P Christiano, P Abbeel, S Levine
arXiv preprint arXiv:1611.03852, 2016
Transfer from simulation to real world through learning deep inverse dynamics model
P Christiano, Z Shah, I Mordatch, J Schneider, T Blackwell, J Tobin, ...
arXiv preprint arXiv:1610.03518, 2016
Quantum money from hidden subspaces
S Aaronson, P Christiano
Proceedings of the forty-fourth annual ACM symposium on Theory of computing …, 2012
Unrestricted adversarial examples
TB Brown, N Carlini, C Zhang, C Olsson, P Christiano, I Goodfellow
arXiv preprint arXiv:1809.08352, 2018
Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic
M Barasz, P Christiano, B Fallenstein, M Herreshoff, P LaVictoire, ...
arXiv preprint arXiv:1401.5577, 2014
AI safety via debate
G Irving, P Christiano, D Amodei
arXiv preprint arXiv:1805.00899, 2018
A cryptographic test of quantumness and certifiable randomness from a single quantum device
Z Brakerski, P Christiano, U Mahadev, U Vazirani, T Vidick
2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS …, 2018
Supervising strong learners by amplifying weak experts
P Christiano, B Shlegeris, D Amodei
arXiv preprint arXiv:1810.08575, 2018
Fine-tuning language models from human preferences
DM Ziegler, N Stiennon, J Wu, TB Brown, A Radford, D Amodei, ...
arXiv preprint arXiv:1909.08593, 2019
Online local learning via semidefinite programming
P Christiano
Proceedings of the forty-sixth annual ACM symposium on Theory of computing …, 2014
Reflective oracles: A foundation for game theory in artificial intelligence
B Fallenstein, J Taylor, PF Christiano
International Workshop on Logic, Rationality and Interaction, 411-415, 2015
Non-omniscience, probabilistic inference, and metamathematics
P Christiano
Machine Intelligence Research Institute, Berkeley, CA, June, 2014
Lossless fault-tolerant data structures with additive overhead
P Christiano, ED Demaine, S Kishore
Workshop on Algorithms and Data Structures, 243-254, 2011
Provably manipulation-resistant reputation systems
P Christiano
Conference on Learning Theory, 670-697, 2016
Open problem: Online local learning
P Christiano
Conference on Learning Theory, 1290-1294, 2014
Manipulation-resistant online learning
PF Christiano
UC Berkeley, 2017
