An implementation of Reinforce Algorithm with a parameterized baseline, with a detailed comparison against whitening. ##Performance of Reinforce trained on CartPole
##Average Performance of Reinforce for multiple runs
##Comparison of subtracting a learned baseline from the return vs. using return whitening