This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.
python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army
-
Updated
Feb 15, 2022 - Python