Learning Algorithms for Separable Approximations of Discrete Stochastic Optimization Problems
Mathematics of Operations Research
Huseyin Topaloglu
Andrzej Ruszczy?ski
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm