Skip to content

josephjohncox/test

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Misc experiments

$$V(s) , = \max_a \left{ \sum_{s'} P_a(s,s') \left( R_a(s,s') + \gamma V(s') \right) \right}$$

Without \left and \right:

$$V(s) , = \max_a { \sum_{s'} P_a(s,s') ( R_a(s,s') + \gamma V(s') ) }$$

The monad-parametrized representation is isomorphic to a state transition function together with an initial state $s$, i.e., $(a \times s \to b \times s) \ \times \ s$ (where $a$ and $b$ are the input and output value types, respectively).

About

Misc experiments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published