GitHub - ngmq/adversarial-online-multi-task-reinforcement-learning: Code for the paper Adversarial Online Multi-Task Reinforcement Learning (ALT 2023).

Six steps to replicate the results:

Step 1. Install rlberry. Please follow the guideline at https://rlberry.readthedocs.io/en/latest/installation.html
Step 2. Open a terminal and change directory to the directory ReplicateExperiments.
Step 3. Activate rlberry environment with command.

$ conda activate rlberry

Step 4. Train and test four agents: the optimal non-stationary agent, the AOMultiRL agent with a given distinguishing set, the one-episode UCBVI agent and the random agent.

$ python AOMultiRL1.py

At the end of this command, results for these four agents are saved in the directory Data/AOMultiRL1.

Step 5. Train and test the AOMultiRL2 agent that discovers a distinguishing set on its own.

$ python AOMultiRL2.py

At the end of this command, results for these four agents are saved in the directory Data/AOMultiRL2.

$ utils.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
AOMTAgent.py		AOMTAgent.py
AOMultiRL1.py		AOMultiRL1.py
AOMultiRL2.py		AOMultiRL2.py
ExploreIDAgent.py		ExploreIDAgent.py
RandomAgent.py		RandomAgent.py
Readme.md		Readme.md
UCBVICHAgent.py		UCBVICHAgent.py
constants.py		constants.py
utils.py		utils.py

Provide feedback