Skip to content

Implementation of Iterative Bounding Markov Decision Processes (Topin et. al. 2021)

Notifications You must be signed in to change notification settings

KohlerHECTOR/ibmdp-py

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Iterative Bounding Markov Decision Processes

Implementation of IBMDPs. Just copy the gymnasium environment from here ibmdp.

from stable_baselines3 import PPO
from gymnasium import make
from gymnasium.wrappers.time_limit import TimeLimit
from ibmdp import IBMDP

env = make("CartPole-v1")
env = IBMDP(env, zeta=0, info_gathering_actions=[(0,0)])
env = TimeLimit(env, 1000)

model = PPO("MlpPolicy", env, verbose=1)
model.learn(1e5)

About

Implementation of Iterative Bounding Markov Decision Processes (Topin et. al. 2021)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages