v0.4.9
API Change
- refactor the implementation of Decision Transformer, DI-engine supports both discrete and continuous DT outputs with the multi-modal observation now (example:
ding/example/dt.py
) - Update the multi-GPU Distributed Data Parallel (DDP) example (link)
- Change the return value of
InteractionSerialEvaluator
, simplifying redundant results
Env
- add cliffwalking env (#677)
- add lunarlander ppo config and example
Algorithm
- add BCQ offline RL algorithm (#640)
- add Dreamerv3 model-based RL algorithm (#652)
- add tensor stream merge network tools (#673)
- add scatter connection model (#680)
- refactor Decision Transformer in new pipeline and support img input and discrete output (#693)
- add three variants of Bilinear classes and a FiLM class (#703)
Enhancement
- polish offpolicy RL multi-gpu DDP training (#679)
- add middleware for Ape-X distributed pipeline (#696)
- add example for evaluating trained DQN (#706)
Fix
- fix to_ndarray fails to assign dtype for scalars (#708)
- fix evaluator return episode_info compatibility bug
- fix cql example entry wrong config bug
- fix enable_save_figure env interface
- fix redundant env info bug in evaluator
- fix to_item unittest bug
Style
- polish and simplify requirements (#672)
- add Hugging Face Model Zoo badge (#674)
- add openxlab Model Zoo badge (#675)
- fix py37 macos ci bug and update default pytorch from 1.7.1 to 1.12.1 (#678)
- fix mujoco-py compatibility issue for cython<3 (#711)
- fix type spell error (#704)
- fix pypi release actions ubuntu 18.04 bug
- update contact information (e.g. wechat)
- polish algorithm doc tables
New Repo
- DOS: [CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Full Changelog: v0.4.8...v0.4.9
Contributors: @PaParaZz1 @zjowowen @zhangpaipai @AltmanD @puyuan1996 @Cloud-Pku @Super1ce @kxzxvbk @jayyoung0802 @Mossforest @lxl2gf @Privilger