[ann] ocaml-xmlerr 0.08.2 available

This is an annouce for ocaml-xmlerr.

ocaml-xmlerr version 0.08.2 is available.

This main module of this package is a small module to read xml with errors.

The main purpose was not to really read xml with errors, but to read html from the web.

At the beginning I wrote in the read-me file that I wrote it in one afternoon, but this is without considering that in fact it was the third time I was trying to make something like this.

One of the first attempt even probably took me almost a day. So we can not say that I succeed easily.

In the .zip archive of ocaml-xmlerr version 0.08.2, you will find the different modules re-organized with dirs.

And there are also two additional commands build on top of the first module.

htmlxtr is a simple extractor for HTML from a simple template.
Please read the man page for more description about how to use it:

$ man ./htmlxtr.1

There is also htmluxtr - a simple extractor for .html using a simple un-template / re-template method.
Please read the man page for more informations :

$ man ./htmluxtr.1

There is a new example of use provided in the ‘using’ directory.

There is a script to help you writing your pattern matching of xml fragments.
The base module providing a list for tags and contents, the pattern matching is not done based on a tree-structure.

And there is now an additional module to convert this flat-list structure into a tree structure, inside the addon directory. The report module has not been widely tested yet.

This is not professional quality.

If I’m not mistaken “amateur” is both pejorative, in both french and in english languages, exept if it’s associated with “astronomer”.

http://decapode314.free.fr/ocaml/xmlerr/

PS: if you edit my posts, I would prefer you edit the links with normal links (like above), please.
PS-2 : I haven’t been able to pattern-match all the opam packages with uxtr, the total doesn’t match, I only find 38_000.

1 Like

With xtr, if you replace the tab ‘\t’ tabulation character, by a ‘,’ comma character, you almost get a .csv file.