extrinsic and intrinsic combination #44

murtazabasu · 2019-12-23T11:44:30Z

Hello, I am trying to implement ICM in PPO with both extrinsic and intrinsic combination. I have seen in few repos where they weight out an extrinsic reward more than intrinsic i.e. combine_reward = (1-int_coef) * rewards + int_coef * intrinsic_reward whereint_coeff = 0.01which reduces the effect of intrinsic rewards significantly. Seeing your paper, you have nowhere mentioned this sort of equation for both the rewards. I wonder if you can tell me that the equation mentioned above can be implemented for a dual reward setting.

The text was updated successfully, but these errors were encountered:

Joll123 · 2020-05-24T13:13:04Z

Hello, do you understand the relationship between external rewards and internal rewards? how to adjust int_coef parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extrinsic and intrinsic combination #44

extrinsic and intrinsic combination #44

murtazabasu commented Dec 23, 2019

Joll123 commented May 24, 2020

extrinsic and intrinsic combination #44

extrinsic and intrinsic combination #44

Comments

murtazabasu commented Dec 23, 2019

Joll123 commented May 24, 2020