Skip to content

Commit

Permalink
v0.5.1
Browse files Browse the repository at this point in the history
  • Loading branch information
PaParaZz1 committed Feb 4, 2024
1 parent abdf68a commit fb24992
Show file tree
Hide file tree
Showing 4 changed files with 24 additions and 4 deletions.
20 changes: 20 additions & 0 deletions CHANGELOG
Original file line number Diff line number Diff line change
@@ -1,3 +1,23 @@
2024.02.04(v0.5.1)
- env: add MADDPG pettingzoo example (#774)
- env: polish NGU Atari configs (#767)
- env: fix bug in cliffwalking env (#759)
- env: add PettingZoo replay video demo
- env: change default max retry in env manager from 5 to 1
- algo: add QGPO diffusion-model related algorithm (#757)
- algo: add HAPPO multi-agent algorithm (#717)
- algo: add DreamerV3 + MiniGrid adaption (#725)
- algo: fix hppo entropy_weight to avoid nan error in log_prob (#761)
- algo: fix structured action bug (#760)
- algo: polish Decision Transformer entry (#754)
- algo: fix EDAC policy/model bug
- fix: env typos
- fix: pynng requirements bug
- fix: communication module unittest bug
- style: polish policy API doc (#762) (#764) (#768)
- style: add agent API doc (#758)
- style: polish torch_utils/utils API doc (#745) (#747) (#752) (#755) (#763)

2023.11.06(v0.5.0)
- env: add tabmwp env (#667)
- env: polish anytrading env issues (#731)
Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@
[![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-yellow)](https://huggingface.co/OpenDILabCommunity)
[![Open in OpenXLab](https://cdn-static.openxlab.org.cn/header/openxlab_models.svg)](https://openxlab.org.cn/models?search=opendilab)

Updated on 2023.12.05 DI-engine-v0.5.0
Updated on 2024.02.04 DI-engine-v0.5.1


## Introduction to DI-engine
Expand All @@ -51,7 +51,7 @@ It provides **python-first** and **asynchronous-native** task and middleware abs
- Offline RL algorithms: BCQ, CQL, TD3BC, Decision Transformer, EDAC, Diffuser, Decision Diffuser, SO2
- Model-based RL algorithms: SVG, STEVE, MBPO, DDPPO, DreamerV3, MuZero
- Exploration algorithms: HER, RND, ICM, NGU
- LLM + RL Algorithms: PPO-max, DPO, MPDPO
- LLM + RL Algorithms: PPO-max, DPO, MODPO,PromptPG
- Other algorithms: such as PER, PLR, PCGrad

**DI-engine** aims to **standardize different Decision Intelligence environments and applications**, supporting both academic research and prototype applications. Various training pipelines and customized decision AI applications are also supported:
Expand Down
2 changes: 1 addition & 1 deletion conda/meta.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{% set data = load_setup_py_data() %}
package:
name: di-engine
version: v0.5.0
version: v0.5.1

source:
path: ..
Expand Down
2 changes: 1 addition & 1 deletion ding/__init__.py
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
import os

__TITLE__ = 'DI-engine'
__VERSION__ = 'v0.5.0'
__VERSION__ = 'v0.5.1'
__DESCRIPTION__ = 'Decision AI Engine'
__AUTHOR__ = "OpenDILab Contributors"
__AUTHOR_EMAIL__ = "[email protected]"
Expand Down

0 comments on commit fb24992

Please sign in to comment.