What are the preferred tools for doing web scraping in OCaml? I’m interested both in what’s available for automating interaction (like the Mechanize packages for Ruby and Python) as well as the HTML parsing side.
The url for lambdasoup is https://github.com/aantron/lambda-soup
I can give a big thumbs up to Lambdasoup, which is an absolute pleasure to work with. It’s worked on all the random HTML I’ve thrown at it so far… thanks @antron for releasing it!
perl4caml is a possibility here. You can use Perl libraries for scraping from OCaml code.