Imitating interactive intelligence J Abramson, A Ahuja, I Barr, A Brussee, F Carnevale, M Cassin, ... arXiv preprint arXiv:2012.05672, 2020 | 73 | 2020 |
Goal misgeneralization: why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022 | 46 | 2022 |
Explaining grokking through circuit efficiency V Varma, R Shah, Z Kenton, J Kramár, R Kumar arXiv preprint arXiv:2309.02390, 2023 | 17 | 2023 |
Safe deep rl in 3d environments using human feedback M Rahtz, V Varma, R Kumar, Z Kenton, S Legg, J Leike arXiv preprint arXiv:2201.08102, 2022 | 7 | 2022 |
Inter-device data transfer based on barcodes J Chien, RI Orton, G Weisz, V Varma US Patent 9,600,701, 2017 | 7 | 2017 |