v0.4.9

PaParaZz1 released this 23 Aug 09:49

· 109 commits to main since this release

API Change

refactor the implementation of Decision Transformer, DI-engine supports both discrete and continuous DT outputs with the multi-modal observation now (example: ding/example/dt.py)
Update the multi-GPU Distributed Data Parallel (DDP) example (link)
Change the return value of InteractionSerialEvaluator, simplifying redundant results

Env

add cliffwalking env (#677)
add lunarlander ppo config and example

Algorithm

add BCQ offline RL algorithm (#640)
add Dreamerv3 model-based RL algorithm (#652)
add tensor stream merge network tools (#673)
add scatter connection model (#680)
refactor Decision Transformer in new pipeline and support img input and discrete output (#693)
add three variants of Bilinear classes and a FiLM class (#703)

Enhancement

polish offpolicy RL multi-gpu DDP training (#679)
add middleware for Ape-X distributed pipeline (#696)
add example for evaluating trained DQN (#706)

Fix

fix to_ndarray fails to assign dtype for scalars (#708)
fix evaluator return episode_info compatibility bug
fix cql example entry wrong config bug
fix enable_save_figure env interface
fix redundant env info bug in evaluator
fix to_item unittest bug

Style

polish and simplify requirements (#672)
add Hugging Face Model Zoo badge (#674)
add openxlab Model Zoo badge (#675)
fix py37 macos ci bug and update default pytorch from 1.7.1 to 1.12.1 (#678)
fix mujoco-py compatibility issue for cython<3 (#711)
fix type spell error (#704)
fix pypi release actions ubuntu 18.04 bug
update contact information (e.g. wechat)
polish algorithm doc tables

New Repo

DOS: [CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Full Changelog: v0.4.8...v0.4.9

Contributors: @PaParaZz1 @zjowowen @zhangpaipai @AltmanD @puyuan1996 @Cloud-Pku @Super1ce @kxzxvbk @jayyoung0802 @Mossforest @lxl2gf @Privilger

Contributors

Privilger, AltmanD, and 10 other contributors

Assets 2