Skip to content
View tdsimao's full-sized avatar

Highlights

  • Pro

Block or report tdsimao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlgTUDelft/AlwaysSafe AlgTUDelft/AlwaysSafe Public

    Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"

    Python 18 3

  2. LAVA-LAB/spi_pomdp LAVA-LAB/spi_pomdp Public

    Code for the paper "Safe Policy Improvement for POMDPs via Finite-State Controllers"

    Python 1

  3. SPIBB SPIBB Public

    Forked from RomainLaroche/SPIBB

    Safe Policy Improvement with Baseline Bootstrapping

    Python

  4. SPIBB-DQN SPIBB-DQN Public

    Forked from rems75/SPIBB-DQN

    Code for SPIBB-DQN and Soft-SPIBB-DQN

    Python

  5. gym-factored gym-factored Public

    Python 4