Copy of Model-based Lifelong reinforcement learning with Bayesian Exploration Part of final exam for my RL course in Sapienza, prof. R. Capobianco. version 2.0 : include parallel propagation during inference