polish(mark): add hybrid action space support to ActionNoiseWrapper #829

MarkHolmstrom · 2024-09-11T03:05:08Z

Description

Previously, target policy smoothing could not be used in the TD3 algorithm with hybrid action spaces due to the action object being a dictionary composed of an action_type tensor, an action_args tensor, and a logit tensor instead of the expected action object being the action_type tensor. The PR adds checks for the nested dictionary and adds noise only to the action_args tensor corresponding to the continuous action.

Related Issue

#789

TODO

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

Add hybrid action space support to action noise wrapper.

ding/model/wrapper/model_wrappers.py

Update model_wrappers.py

efda82e

Add hybrid action space support to action noise wrapper.

puyuan1996 added the enhancement New feature or request label Sep 13, 2024

puyuan1996 reviewed Sep 13, 2024

View reviewed changes

ding/model/wrapper/model_wrappers.py Outdated Show resolved Hide resolved

Updated syntax and added comment

70839ba

puyuan1996 changed the title ~~Add Hybrid Action Space Support to Action Noise Wrapper~~ polish(mark): add hybrid action space support to ActionNoiseWrapper Sep 18, 2024

Merge branch 'main' into main

2046f44

PaParaZz1 mentioned this pull request Sep 20, 2024

Roadmap for DI-engine #548

Open

PaParaZz1 approved these changes Sep 20, 2024

View reviewed changes

PaParaZz1 merged commit 6ae1396 into opendilab:main Sep 20, 2024
12 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

polish(mark): add hybrid action space support to ActionNoiseWrapper #829

polish(mark): add hybrid action space support to ActionNoiseWrapper #829

MarkHolmstrom commented Sep 11, 2024

polish(mark): add hybrid action space support to ActionNoiseWrapper #829

polish(mark): add hybrid action space support to ActionNoiseWrapper #829

Conversation

MarkHolmstrom commented Sep 11, 2024

Description

Related Issue

TODO

Check List