Skip to content
View initial-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report initial-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlphaZero_Gomoku_MPI AlphaZero_Gomoku_MPI Public

    An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

    Python 186 43

  2. CEER CEER Public

    Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023

    Python 4

  3. FlappyBird_DQN_with_target_network FlappyBird_DQN_with_target_network Public

    DQN with freezing target network in tensorflow on pygame FlappyBird

    Python 11 1

  4. haotiansun14/rl-rep haotiansun14/rl-rep Public

    Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference

    Python 20 6

  5. tensorlayer/TensorLayer tensorlayer/TensorLayer Public

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

    Python 7.3k 1.6k