You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The double_q is from the target DQN. And when updating the main DQN, the error will back propagated to the target DQN if we don't stop the flow, right? So do we need to stop the gradient as follows?
target_q = tf.stop_gradient(target_q)
Could you please give some advice? Thanks.
The text was updated successfully, but these errors were encountered:
Hi, thanks a lot for your great work!
I have a question, in the
Double DQN
, maybe the following code needs astop_gradient
?The
double_q
is from the target DQN. And when updating the main DQN, the error will back propagated to the target DQN if we don't stop the flow, right? So do we need to stop the gradient as follows?Could you please give some advice? Thanks.
The text was updated successfully, but these errors were encountered: