There's got to be a better way!
A critique of Reformist RL's inefficiency and a proposal for more effective alternatives in reinforcement learning.
Ben Recht is a researcher and writer exploring the history, theory, and practice of decision-making by humans and machines. On arg min, he covers optimization, machine learning, cybernetics, and occasional reflections on music and culture.
23 articles from this blog
A critique of Reformist RL's inefficiency and a proposal for more effective alternatives in reinforcement learning.
A simplified, non-technical definition of reinforcement learning as an iterative optimization process based on external feedback.
A technical lecture on applying policy gradient methods to derive optimization algorithms, focusing on the unbiased gradient estimator and its applications.