CS 185/285 at UC Berkeley Deep Reinforcement Learning

What You'll Learn

Official Source

  • Fundamentals of Reinforcement Learning (RL) and how agents learn through trial and error.

  • Imitation Learning and Behavioral Cloning to learn from expert demonstrations.

  • Policy Gradient methods for directly optimizing decision-making policies.

  • Actor-Critic algorithms that combine value estimation and policy learning.

  • Value-Based RL techniques, including Q-Learning and Deep Q-Networks (DQN).

  • Advanced RL methods such as SAC, advanced policy gradients, and exploration strategies.

  • Variational Inference and Control as Inference for probabilistic decision-making.

  • Model-Based RL for planning using learned environment models.

  • Offline RL, where agents learn from previously collected datasets.

  • LLM Reinforcement Learning, including training and aligning large language models.

  • Multi-task RL, RL theory, and current challenges and open research problems.

  • Practical implementation using PyTorch, along with hands-on homework assignments and a final project.


This course teaches how intelligent agents learn to make decisions through reinforcement learning. You will study core RL algorithms such as Policy Gradients, Actor-Critic, Q-Learning, Model-Based RL, Offline RL, and LLM RL, while gaining practical experience implementing state-of-the-art deep learning and reinforcement learning systems for real-world AI applications.