It's a general advanted estimator where you weight different n-step estimates of the advantage function ![[gae.png]]