[ANN] Parseff: parser combinator library for OCaml 5

davesnx · March 24, 2026, 4:37pm

oh… that’s one combinator that I wouldn’t discard adding!

EDIT: Nevermind, the usage is parsera \#or parserb

davesnx · March 24, 2026, 4:38pm

Yes, don’t disagree, but If there’s combinator style worth adding it can be always added as a separate package or even in user’s code

davesnx · March 24, 2026, 4:39pm

Thanks, would you mind opening a PR? Happy to have a Angstron (optimised) version on the repo!

yawaramin · March 24, 2026, 6:26pm

No, it’s parsera or parserb

davesnx · March 25, 2026, 9:03am

maybe ocamlformat issue? dunno.

That’s a cute suggestion, but I don’t want to add operators in parseff

davesnx · March 25, 2026, 9:52am

Share it on https://x.com/davesnx/status/2036739000271007997
Let me expand here:

I’m exploring the idea of making parseff faster, autoresearch suggested to remove effects entirely from parseff and it makes fused operations equal, but primitive operations are much faster.

Kind of loses the purpose of the name tbh, but frankly, the comparison is:

46-75% faster without effects because each combinator call is now a direct function call + DLS.get (~5ns) instead of Effect.perform + handler dispatch + continuation resume (~50-100ns)

This PR Bench angstrom: make fair and angstrom more comparable by reynir · Pull Request #6 · davesnx/parseff · GitHub changed a bit the parseff (fair) version and angstrom to be actually slower
This commit Performance optimizations: +34% json_fused, +48% arith_generic throug… · davesnx/parseff@8d83a14 · GitHub creats an optimised version of anstrom and does micro optimisations on parseff (0.3.0)

So, parseff 0.4.0 will probably ship without effects and be faster than angstron in any case, but for fused operations it will be almost the same.

Kakadu · March 25, 2026, 10:48am

It looks like the alternatives parser will be much longer without specialized effect-based backtracking combinator. Is there any chance to make effects and reimplementation of recursive decent live together?

Kakadu · March 25, 2026, 11:00am

Also, if we want seriously talk about JSON parsing performance, we need to compare with GitHub - simdjson/simdjson: Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks · GitHub too

davesnx · March 25, 2026, 12:04pm

The public interface hasn’t changed. What do you mean, exactly?

davesnx · March 25, 2026, 4:25pm

We aren’t seriously comparing json parsing perf, we are comparing similar libraries.

lambda_foo · March 31, 2026, 3:28am

Not really surprising but we should be able to do better than ~50-100ns for the effects version. Could you say which OCaml version you used and what hardware it was on?

davesnx · March 31, 2026, 9:25am

Feel free to go deeper with this branch No effects version by davesnx · Pull Request #7 · davesnx/parseff · GitHub and running those benchmarks parseff/bench at 41cc5852b3cbc78e00710b4c6727211d476d7cee · davesnx/parseff · GitHub

OCaml 5.4.0

Hardware

CPU: 2x AMD EPYC 9755 128-Core Processor
Cores/threads: 256 physical cores, 512 logical CPUs total
RAM: 2.2 TiB total (771 GiB in use)

giltho · April 17, 2026, 5:26pm

Dumb question: how does it compare to menhir in terms of performance?

davesnx · April 17, 2026, 6:38pm

Hard to say, because they aren’t comparable or at least I have no idea how to do it.

Mostly because menhir is a parser generator and it will be used inside a pipeline with a Lexer, while parseff (or any parser combinator) often is used to parse strings; you could do Source.of_function and have some lexer producing tokens but still they are solving different problems.

Any parser combinator has backtracking, and it builds up from small functions, conflicts don’t exist while menhir is more like a monolithic "table-driven shift/reduce on a stream of tokens) which doesn’t compose with the rest of your program and has an implicit error mechanism.

If you compare them somehow it will depend on the syntax as well, and in the menhir’s case you might need to ignore the lexing time? Anyway, no idea how to make justice here, hope it helps

Topic		Replies	Views
OCaml Parser combinator library as powerful as fastparse(Scala)? Ecosystem	27	4509	November 25, 2022
Is it feasible to write parsers without using polymorphic variants for AST representation? Learning	16	2211	April 25, 2018
High-performance lexing in OCaml Ecosystem performance , ocamllex	20	3286	April 1, 2021
Unicode-aware parser combinators Learning parsing	25	1329	September 13, 2023
Idiomatic Parcoom (educational parser combinators library) Community	1	188	October 14, 2025

[ANN] Parseff: parser combinator library for OCaml 5

Related topics