Soupault is the first (to my knowledge) website generator that exploits the fact that well-formed HTML is machine readable and transformable (and thanks to @aantron’s lambdasoup it’s quite easy to do).
It can do things like “use the first
<h1> for the page title” or “insert output of
date -R into the
<time> element no matter where it’s in the page”.
- No templates, no themes, no front matter. You tell it where to insert stuff or what to extract using CSS selectors.
- Built-in ToC, footnotes, and breadcrumbs.
- Directories are site sections and can be nested.
- Extracted metadata can be exported to JSON and fed to external scripts for creating section indices or custom taxonomies.
- Configurable preprocessors for pages in formats other than HTML.
Soupault can be a drop-in automation tool for existing websites: the directory structure is fully configurable, clean URLs are optional, and it can preserve paths down to file extensions.