gym-pomdp-wrappers

gym-pomdp-wrappers is a set of wrappers that can be used to turn some specific OpenAI Gym environments into Partially Observable Markov Decision Processes (POMDPs).

Dependencies

This pacakge requires python3 (>=3.5). Additionally, you need the following packages:

Installation

After installaling the dependencies, you can perform a install of gym-pomdp-wrappers with:

git clone https://github.com/stweigand/gym-pomdp-wrappers.git
cd gym-pomdp-wrappers
pip install -e .

Implemented Wrappers

MuJoCoHistoryEnv: Stacks observations from a given MuJoCo environment to a history of given length. There are multiple history types implemented. For the POMDP versions of some tasks are dimensions of the observation vecotor specified which will be overwritten with zeros.
RockSampleHistoryEnv: Stacks observations (and additional information) from a RockSample instance to a history of given length. The history types specifiy which information are added to the history.
NoisyEnv: Takes observations from an environment, adds guassian noise and stacks them to a history of given length. There are multiple history types implemented.

Detailed information on the different history types and usage will be added later! See the comments in the code for further information.

Example

Environments could be initialized as follows:

import gym
from gym_pomdp_wrappers import MuJoCoHistoryEnv
env = MuJoCoHistoryEnv("Walker2d-v2", hist_len=4, history_type="history_ac_pomdp")

import gym
import gym_pomdp
from gym_pomdp_wrappers import RockSampleHistoryEnv
env = RockSampleHistoryEnv("Rock-v0", hist_len=4, history_type="history_full", kwargs={'board_size':5, 'num_rocks':5})

MuJoCo

Some of the wrappers use MuJoCo (multi-joint dynamics in contact) physics simulator, which is proprietary and requires binaries and a license (temporary 30-day license can be obtained from www.mujoco.org). Instructions on setting up MuJoCo can be found here

Citing the Project

This repository is part of my Master's thesis which is about "Guided Reinforcement Learning Under Partial Observability" (Link will be added later). Here is a BibTeX entry that you can use to cite it in a publication::

@mastersthesis{guidedreinforcementlearning,
    Author = {Stephan Weigand},
    School = {Technische Universit{\"a}t Darmstadt},
    Title = {Guided Reinforcement Learning Under Partial Observability},
    Year = {2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
gym_pomdp_wrappers		gym_pomdp_wrappers
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gym-pomdp-wrappers

Dependencies

Installation

Implemented Wrappers

Example

MuJoCo

Citing the Project

About

Releases

Packages

Languages

License

stweigand/gym-pomdp-wrappers

Folders and files

Latest commit

History

Repository files navigation

gym-pomdp-wrappers

Dependencies

Installation

Implemented Wrappers

Example

MuJoCo

Citing the Project

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages