Authors: Kevin Murphy
Abstract: This manuscript gives a big-picture, up-to-date overview of the field of
(deep) reinforcement learning and sequential decision making, covering
value-based RL, policy-gradient methods, model-based methods, and various other
topics (including a very brief discussion of RL+LLMs).
Source: http://arxiv.org/abs/2412.05265v1