Parallelization with Haskell - Easy as can be

June 07, 2009

The functional programming language Haskell provides a very easy way of parallelization.

Consider the following naive implementation of the Fibonacci function.

fib 0 = 0
fib 1 = 1
fib n = fib (n-1) + fib (n-2)

This implementation has a bad expontential time complexity, so it should be improved, for example with caching. But this is beyond the scope of this article. We just need a function that takes a while to finish.

In Haskell there are two operators that have to be used for parallelization: par and pseq. par a b is some kind of a “fork” operation: a is started in parallel and b is returned. Keep in mind that Haskell is has a lazy evaluation strategy. a is only evaluated if it is needed The function pseq a b evaluates first a then b.

Equipped with this two operations it is very easy to parallelize fib.

parfib n
  | n < 11    = fib n                        -- For small values of n we use the sequential version

  | otherwise = f1 `par` (f2 `pseq` (f1+f2)) -- calculate in parallel, return the sum

where
  f1 = parfib (n-1)
  f2 = parfib (n-2)

The code has to be compiled with the -threaded option.

$ ghc -O3 -threaded --make -o parfib ParFib.hs

The number of threads is specified at runtime with the -N command line option.

$ ./parfib +RTS -N7 -RTS

On an Intel Core i7 920 this resulted in a speedup of 4.13 for n=38. This processor has four physical cores. So this is efficient. Haskell is still one of the best programming languages.