-
Notifications
You must be signed in to change notification settings - Fork 325
Pull requests: pytorch/rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] multiagent data standardization: PPO advantages
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2677
opened Dec 26, 2024 by
matteobettini
Loading…
[CI] Fix conda on windows
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2676
opened Dec 20, 2024 by
vmoens
Loading…
10 tasks
[Tutorial] MCTS
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673
opened Dec 19, 2024 by
vmoens
Loading…
First draft for modular Hindsight Experience Replay Transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[Feature] Make PPO compatible with composite actions and log-probs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2665
opened Dec 18, 2024 by
vmoens
Loading…
[Feature] Add Hash transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2648
opened Dec 13, 2024 by
kurtamohler
Loading…
[Tutorial] Beam search with GPT models
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
tutorials
#2623
opened Dec 2, 2024 by
vmoens
Loading…
[Feature] PPOTrainer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550
opened Nov 11, 2024 by
vmoens
Loading…
[Feature] habitat env from config
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2539
opened Nov 6, 2024 by
vmoens
Loading…
10 tasks
[CI] Fix windows upload wheels
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2507
opened Oct 21, 2024 by
vmoens
Loading…
[Feature] Gymnasium 1.0 compatibility
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Environments
Adds or modifies an environment wrapper
#2473
opened Oct 9, 2024 by
vmoens
Loading…
[Examples] boiler plate code for multi-turn reward for RLHF
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2467
opened Oct 5, 2024 by
rghosh08
Loading…
3 of 10 tasks
[Algorithm] Update scripts with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2449
opened Sep 23, 2024 by
vmoens
Loading…
[Feature] RB compability with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2426
opened Sep 9, 2024 by
vmoens
Loading…
[CI] Add benchmarks to test runs
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2410
opened Sep 2, 2024 by
vmoens
Loading…
[Feature] non-functional SAC loss
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2393
opened Aug 13, 2024 by
vmoens
Loading…
[Feature] use_vmap=False for SAC
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2392
opened Aug 13, 2024 by
vmoens
Loading…
[Algorithm] TD3 fast
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2389
opened Aug 10, 2024 by
vmoens
Loading…
[Doc] Better doc for distributed RBs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2378
opened Aug 7, 2024 by
vmoens
Loading…
[Feature] MCTS Scoring functions
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2358
opened Aug 4, 2024 by
vmoens
Loading…
[Feature] AbsorbingStateTransform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
[WIP] Correct typos
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2263
opened Jul 2, 2024 by
vmoens
Loading…
[WIP] AlphaZero
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
new algo
New algorithm request or PR
#2246
opened Jun 24, 2024 by
vmoens
Loading…
[WIP] Remove functional calls if possible
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2153
opened May 2, 2024 by
vmoens
Loading…
[Example] Comprehensive dataset rendering examples
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2141
opened Apr 30, 2024 by
vmoens
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.