[ANN] Parallel and distributed execution of command lines, pardi!

Dear OCaml community,

I am pleased to announce the first release of pardi (which the community recently helped to debug):

Pardi is a command line tool to parallelize programs which are not parallel;
provided that you can cut an input file into independent chunks.

For example, to compress a file in parallel using 1MB chunks:

pardi -d b:1048576 -m s -i <YOUR_BIG_FILE> -o <YOUR_BIG_FILE>.gz \
        -w 'xz -c -9 %IN > %OUT'

Using the right option, you can cut an input file by lines (e.g. SMI files),
by number of bytes (for binary files),
by separating lines verifying a regexp (quite generic)
or by a block separating line (e.g. MOL2/SDF/PDB file formats).

If processing a single record of your input file is too fine grained,
you can play with the -c option to reach better parallelization
(try 10,20,50,100,200,500,etc).

pardi ...
  {-i|--input} <file>: where to read from (default=stdin)
  {-o|--output} <file>: where to write to (default=stdout)
  {-n|--nprocs} <int>: max jobs in parallel (default=all cores)
  {-c|--chunks} <int>: how many chunks per job (default=1)
  {-d|--demux} {l|b:<int>|r:<regexp>|s:<string>}: how to cut input 
  file into chunks (line/bytes/regexp/sep_line; default=line)
  {-w|--work} <string>: command to execute on each chunk
  {-m|--mux} {c|s|n}: how to mux job results in output file
(cat/sorted_cat/null; default=cat)

Pardi should be available soon in the opam repository.