v0.5.1

opendilab · Feb 4, 2024 · fb24992 · fb24992
1 parent abdf68a
commit fb24992
Show file tree

Hide file tree

Showing 4 changed files with 24 additions and 4 deletions.
diff --git a/CHANGELOG b/CHANGELOG
@@ -1,3 +1,23 @@
+2024.02.04(v0.5.1)
+- env: add MADDPG pettingzoo example (#774)
+- env: polish NGU Atari configs (#767)
+- env: fix bug in cliffwalking env (#759)
+- env: add PettingZoo replay video demo
+- env: change default max retry in env manager from 5 to 1
+- algo: add QGPO diffusion-model related algorithm (#757)
+- algo: add HAPPO multi-agent algorithm (#717)
+- algo: add DreamerV3 + MiniGrid adaption (#725)
+- algo: fix hppo entropy_weight to avoid nan error in log_prob (#761)
+- algo: fix structured action bug (#760)
+- algo: polish Decision Transformer entry (#754)
+- algo: fix EDAC policy/model bug
+- fix: env typos
+- fix: pynng requirements bug
+- fix: communication module unittest bug
+- style: polish policy API doc (#762) (#764) (#768)
+- style: add agent API doc (#758)
+- style: polish torch_utils/utils API doc (#745) (#747) (#752) (#755) (#763)
+
 2023.11.06(v0.5.0)
 - env: add tabmwp env (#667)
 - env: polish anytrading env issues (#731)

diff --git a/README.md b/README.md
@@ -35,7 +35,7 @@
 [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-yellow)](https://huggingface.co/OpenDILabCommunity)
 [![Open in OpenXLab](https://cdn-static.openxlab.org.cn/header/openxlab_models.svg)](https://openxlab.org.cn/models?search=opendilab)
 
-Updated on 2023.12.05 DI-engine-v0.5.0
+Updated on 2024.02.04 DI-engine-v0.5.1
 
 
 ## Introduction to DI-engine
@@ -51,7 +51,7 @@ It provides **python-first** and **asynchronous-native** task and middleware abs
 - Offline RL algorithms: BCQ, CQL, TD3BC, Decision Transformer, EDAC, Diffuser, Decision Diffuser, SO2
 - Model-based RL algorithms: SVG, STEVE, MBPO, DDPPO, DreamerV3, MuZero
 - Exploration algorithms: HER, RND, ICM, NGU
-- LLM + RL Algorithms: PPO-max, DPO, MPDPO
+- LLM + RL Algorithms: PPO-max, DPO, MODPO，PromptPG
 - Other algorithms: such as PER, PLR, PCGrad
 
 **DI-engine** aims to **standardize different Decision Intelligence environments and applications**, supporting both academic research and prototype applications. Various training pipelines and customized decision AI applications are also supported:

diff --git a/conda/meta.yaml b/conda/meta.yaml
@@ -1,7 +1,7 @@
 {% set data = load_setup_py_data() %}
 package:
   name: di-engine
-  version: v0.5.0
+  version: v0.5.1
 
 source:
   path: ..

diff --git a/ding/__init__.py b/ding/__init__.py
@@ -1,7 +1,7 @@
 import os
 
 __TITLE__ = 'DI-engine'
-__VERSION__ = 'v0.5.0'
+__VERSION__ = 'v0.5.1'
 __DESCRIPTION__ = 'Decision AI Engine'
 __AUTHOR__ = "OpenDILab Contributors"
 __AUTHOR_EMAIL__ = "[email protected]"