Benjamin Van Roy
Title
Cited by
Cited by
Year
Analysis of temporal-diffference learning with function approximation
JN Tsitsiklis, B Van Roy
Advances in neural information processing systems, 1075-1081, 1997
14701997
The linear programming approach to approximate dynamic programming
DP De Farias, B Van Roy
Operations research 51 (6), 850-865, 2003
7672003
Regression methods for pricing complex American-style options
JN Tsitsiklis, B Van Roy
IEEE Transactions on Neural Networks 12 (4), 694-703, 2001
7222001
Deep exploration via bootstrapped DQN
I Osband, C Blundell, A Pritzel, B Van Roy
Advances in neural information processing systems 29, 4026-4034, 2016
5972016
Feature-based methods for large scale dynamic programming
JN Tsitsiklis, B Van Roy
Machine Learning 22 (1-3), 59-94, 1996
5901996
Markov perfect industry dynamics with many firms
G Weintraub, CL Benkard, B Van Roy
Econometrica 76 (6), 1375-1411, 2008
4442008
On constraint sampling in the linear programming approach to approximate dynamic programming
DP De Farias, B Van Roy
Mathematics of operations research 29 (3), 462-478, 2004
4162004
Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives
JN Tsitsiklis, B Van Roy
IEEE Transactions on Automatic Control 44 (10), 1840-1851, 1999
3961999
Learning to optimize via posterior sampling
D Russo, B Van Roy
Mathematics of Operations Research 39 (4), 1221-1243, 2014
3332014
A tutorial on thompson sampling
D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen
Foundations and Trends in Machine Learning 11 (1), pp. 1-96, 2018
2972018
Consensus propagation
CC Moallemi, B Van Roy
IEEE Transactions on Information Theory 52 (11), 4753-4766, 2006
2832006
Solving data mining problems through pattern recognition
RL Kennedy, Y Lee, B Van Roy, CD Reed, RP Lippman
Upper Saddle River, NJ: Prentice Hall PTR, 2011
238*2011
Dynamic pricing with a prior on market response
VF Farias, B Van Roy
Operations Research 58 (1), 16-29, 2010
2072010
An information-theoretic analysis of Thompson sampling
D Russo, B Van Roy
The Journal of Machine Learning Research 17 (1), 2442-2471, 2016
1922016
A neuro-dynamic programming approach to retailer inventory management
B Van Roy, DP Bertsekas, Y Lee, JN Tsitsiklis
Proceedings of the 36th IEEE Conference on Decision and Control 4, 4052-4057, 1997
1821997
Average cost temporal-difference learning
JN Tsitsiklis, B Van Roy
Automatica 35, 319-349, 1999
1751999
Generalization and exploration via randomized value functions
I Osband, B Van Roy, Z Wen
International Conference on Machine Learning, 2377-2386, 2016
1712016
Compensation for frequency adjustment in mobile communication-positioning device with shared oscillator
LS Bloebaum, P Bharti, S Chung, B Van Roy, W Mann
US Patent 6,724,342, 2004
1542004
A nonparametric approach to multiproduct pricing
P Rusmevichientong, B Van Roy, PW Glynn
Operations Research 54 (1), 82-98, 2006
1522006
Making eigenvector-based reputation systems robust to collusion
H Zhang, A Goel, R Govindan, K Mason, B Van Roy
International Workshop on Algorithms and Models for the Web-Graph, 92-104, 2004
1512004
The system can't perform the operation now. Try again later.
Articles 1–20