-
Notifications
You must be signed in to change notification settings - Fork 28
Transient execution model. How to get things done with the transient libraries
Haskell is not a conventional language. However, the basic elements of all languages are the same. Transient changes how the programmer does things in Haskell and other languages to make them, hopefully, simpler. To make things simpler and powerful, paradoxically, some of these things are done less conventionally than in Haskell and other languages.
The way to program in transient is deeply influenced by the execution model, which is different from the implicit single-threaded, with explicit multithreading, explicit loops and explicit communications of any conventional way of programming, including the current one in Haskell. Transient use techniques to perform implicit loops, implicitly multithreading and implicit communications to make modular and composable what is usually not so much.
Rather than a long technical explanation, the best way to grasp it is through examples of practical usage cases. This why I mix here two apparent different topics: the execution model and how to do things in transient.
Here follow some practical guidelines about how to do some common tasks using transient.
- Do NOT use loops. Loops destroy composition. Use streaming/non-determinism (multithreaded or not):
forM [1..10] $ \i -> ... use: do i <- threads 0 $ choose [1..10]
...
-- run in the current thread
-- like for i = 1 to 10...
loop = do do
name <- getLine use instead: name <- waitEvents getLine
liftIO $ print name liftIO $ print name
loop
Although getLine
does not compose with multithreading since two threads can not getLine
nicely and transient is intended to compose everything. For console IO, it is convenient to use specific transient primitives which permits the redirection of console IO to different threads and allows the creation of menus, like option
or input
- Do NOT use callbacks. De-invert callbacks with
react
:
onCallback wathever myCallback use: event <- react (onCallback wathever) (return())
myCallback event= continue event
continue event
So, if you use a framework with callbacks, which are inherently non-composable, you can make composable code by de-inverting the callbacks, so you will create modular and readable code. this expression stop the computation and execute `continue event` whenever the callback is called.
- You can Avoid IFs and be more modular (just like Haskell parsers do)
if processableByThis
then this
else that use: this' <|> that
where
this' = if not processableByThis then empty else this
- Don't use a global session state. As any global variable, that destroys composition and transient is focused on making modular software components that can be composed.
globalState= create mutable var
do do
v <- readGlobalState use: setRState initiaState
writeGlobalState v' (v :: MyType) <- getRState <|> error "no initialized state for this type??"
setRState v'
or, more Haskelly:
main= runReader initialState $ do
...
Since the state is created inside the monadic function, it is modular and composable without further ado. The runReader alternative, very common in Haskell at the time I write this, is equivalent to a mutable global state.
- Don't create big initial states, since you can create as many of them as you need at the moment you need them as long as they have a different type.
data Big= Big{
this :: This
that :: That do
... setRState (this .: This)
} use: use (this :: This)
more lines
do setRState (that :: That)
runReader bigstate $ do use that
use this use this
more lines ...
use that
more lines
use this
...
This also allows quite simple access and modification of fields without the need for lenses.
Use newtype
keyword to assign different types to simple variables.
- A pure (non-mutable) state can be defined and used the same way:
runState state $ do... use do
v <- get setState state
put v' v <- getState <|> error "MyState not initialized??"
... setState v'
...
When modifying a pure state, the changes are only accessible to the statements that are after the modification. Since in transient there are no loops and iterations are done using non-determinism, this may be confusing:
data Sum= Sum Int
do
setSate $ Sum 0
n <- threads 0 $ choose [1,2 :: Int]
Sum sum <- getState <|> error "no Sum??? I just initialized it two lines ago!!!"
setState . Sum $ sum+n
Sum sum <- getState
liftIO $ print sum
would print 1 and 2 since choose
will send 1 and 2 to the rest of the monadic sequence. But the state is pure and each of the two executions get Sum 0
as initial state for Sum
. If you repeat it with setRState
and getRState
, it will print 1 and 3.
- Do NOT fork threads with explicit concurrency. Use asynchronous primitives and applicatives:
forkIO job1;
forkIO job2; use: (,) <$> async job1 <*> job2
wait for job1 and job2
return (result1,result2)
- Do not fork threads for parallelism. Use alternative:
main= do
forkIO $ doThis
forkIO $ doThat use: main= async doThis <|> async doThat <|> more...
more...
Take care, since async creates a new thread, and every thread continues the execution until the end of the monad or empty
:
do do
forkIO $ doThis >> doOther async doThis <|> async doThat
forkIO $ doThat >> doOther use: doOther
If you don't want them to continue:
func=do do
forkIO $ doThis use: func= (async doThis >> empty) <|> (async doThat >> empty) <|> more
forkIO $ doThat
more
Otherwise, without empty
, func
on the right would return three different results: the one of doThis
, the one of doThat
and the one of more
If you want to reduce parallelism, use thread pooling by adding the threads
modifier, without changing the program code:
processing= do
results <- poolLibrary(numberOfThreads,[dothis,dothat,doOther])
mapM process results -- <- single threaded
use:
processing'= do
eachResult <- threads numberOfThreads $ async doThis <|> async doThat <|> async doOther
process eachResult -- <- still multithreaded
The first returns a list of results while the second return different results in different threads to continue the parallelism.
Use collect
if you want to collect the results and generate the list of result in a single thread:
processResults <- collect 0 processing'
0 as parameter forces collect
to wait until there are no more threads active in processing'
- Do not communicate threads (coroutines) with mutable variables using loops (loops are the end of composability) Use streaming/event vars:
someVar = new...
forkIO $ loop1 someVar
forkIO $ loop2 someVar evar <- newEVar
where use: noloop1 evar <|> noloop2 evar
loop1 var= do where
r <- wait var noloop1 evar= do
process r r <- readEVar evar
loop1 process r
... ...
Use transient exceptions to fix things and continue tasks without mixing exception code and application code
do do
myTask `onException`$ \e -> do onException $ \e -> when (tryToFix e) continue
if tryTofix e then myTask use: -- or: if tryToFix r then continue else return ()
else throw e myTask
(do
openThis
openThat `onException` $ \(e :: MyException) -> do closeThat; throwIO e)
`onException` $ \(e :: MyException) -> do closeThis; throwIO e
use:
do
openThis `onException` $ \(e :: MyException) -> closeThis e
openThat `onException` $ \(e :: MyException) -> closeThat e
Exceptions propagate back (bubble up like JavaScript events) by default trough the monad until continue
, which resume execution forward or empty
which stop executing; it is not necessary to re-throw them. The code preserves monadic composition. This makes exceptions more useful.
-
You can compose freely expressions that contain
react
async
and other asynchronous primitives (which stop the computation and return something in another thread) using alternatives and applicatives. -
Do NOT use the OOP patterns like MVC or the Actor Model. Use functions:
Client:
do do
send node message use: result <- runAt node $ local process
result <- receive node -- and run the program in both client and server
Server:
do
message <- receive
result <- process
send result
Node to node communications uses tcp.
Browser:
ajaxRequest url params \result -> process
use: result <- atServer $ local $ process params
-- compile with ghc and ghcjs
-- and run it both in browser and server
Server:
onRequest
[(url, process )
,(url2,process2)
...
Browser-server communication uses WebSockets.
Axiom widgets are coherent with the transient model:
do
text <- local $ render $ inputString (Just "rewrite this text")
`fire` OnChange
! atr "size" "80"
response <- atServer $ accessDatabase text
local $ render $ rawHtml $ p $ show response
where
accessDatabase text= local $ do
#ifndef #ghcjs_HOST_OS
databaseCode text -- will be compiled only in the server node
#else
empty -- no server-side code in the browser
#endif
The conditional compilation, as seen above, can be used to compile different local
code in the browser and server.
inputString
for example, runs empty
in the server. Its line renders a text box for 80 chars in the browser and stream responses to the server whenever the "OnChange" event is fired.
| Intro
| How-to
| Backtracking to undo IO actions and more
| Finalization: better than exceptions
| Event variables: Publish Suscribe
| Checkpoints(New), suspend and restore
| Remote execution: The Cloud monad
| Clustering: programming the cloud
| Mailboxes for cloud communications
| Distributed computing: map-reduce