- Fundamentals of Reinforcement Learning
- K-armed_bandit_assignment
- optimal Policies with Dynamic Programming Assignment
- Sample-based Learning Methods
- Policy_evaluation_TD(0)_assignment
- Q-Learning_and_Expected_SARSA_assignment
- Dyna_Q_DynaQ+assignment
- Prediction and Control with Function Approximation
- Semi-Gradient TD(0) with State Aggregation Assignment
- Semi-Gradient TD with a Neural Network Assignment
- Function Approximation and Control Assignment
- Average Reward SoftMax Actor Critic Assignment
- Capstone Project
- Moonshoot Technologies
- Implement your agent Lunar Landing Assignment
- Completing the parameter study