A naive question about updating parameters in DDPG. #46

HiddenBeginner · 2021-03-04T07:35:34Z

Hi, first of all, thanks for your awesome codes. This is not about any technical issue, but about the algorithm of the DDPG code.

As far as I know, the DDPG method can exploit online parameter update due to the TD learning. But, in your code, the parameters are updated after an episode is over.

I would like to ask you if there are some theoretical background behind this parameter update interval?

Thank you in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A naive question about updating parameters in DDPG. #46

A naive question about updating parameters in DDPG. #46

HiddenBeginner commented Mar 4, 2021 •

edited

Loading

A naive question about updating parameters in DDPG. #46

A naive question about updating parameters in DDPG. #46

Comments

HiddenBeginner commented Mar 4, 2021 • edited Loading

HiddenBeginner commented Mar 4, 2021 •

edited

Loading