GitHub - KBoumghar/IFT4055-RL

#IFT4055 - Journal

Questions I need to answer :

Auxiliary objective, what is this exactly?
Minimizing the R.H.S to get maximum reward
Estimate of state marginal (cannot seem to find reference for that)
How / how fast can we find the distribution that fits our p_{\theta_t}(s)
Maximum likelihood estimation : OK. Maximum likelihood state density estimation process???
We can't assume independence of states like what I've seen. What is used for Maximum likelihood?

What I (think) I need to do next :

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Minecraft		Minecraft
Notes		Notes
SMiRL_Code		SMiRL_Code
Snake		Snake
rlkit		rlkit
README.md		README.md

Provide feedback