Stdlib.List loop unrolling

K_N · January 28, 2023, 12:30pm

The benefit of TRMC is that your function is tail recursive, so won’t blow-up the stack for large lists. Doing a tail-recursive map which retains the performances of the non-tail-recursive one is quite the challenge (the following thread has all the info). A very crude summary:

Usual List.map : fast calls (which can further be unrolled) but not tail recursive, you may blow up the stack for long lists.
Simple tail-recursive List.map by doing reverse at the end: the cost of reversing hurts quite a bit in practice and makes this non competitive for medium sized lists.
Unsafe tail-recursive List.map by using Obj.magic and casting a mutable list that is allocated and modified in place. This relies on assumptions on unsafe features, which may be invalidated if you change backend or runtime (flambda, multicore,…).
Clever safe tail-recursive List.map (as explained in the linked discussion). Very clever, needs to be hand-tuned (to recover the performances of non tail-rec) and the code is quite complex which becomes a maintenance problem, especially if you want several versions hand tuned for several architectures.

TRMC, to simplify, translates the simple natural List.map (and other similar functions) to the version that creates a mutable structure. But the translation is done by the compiler. There is still a maintenance burden (the TRMC transform itself within the compiler) but it’s more general so the cost is deemed worth it (since now you don’t have to maintain a gazillion of unsafe tail recursive functions doing Obj.magic all over the place for map, concat, etc…).

Topic		Replies	Views
Fastest implementation of stack-safe List.map, featuring Containers, Batteries, and Base Community	4	1395	June 22, 2017
Stack overflow during evaluation (looping recursion?) Learning	16	1795	April 17, 2023
A new List.map that is both stack-safe and fast Community article , performance	34	12668	February 24, 2018
Array.map vs List.map performance Learning performance	6	1301	May 14, 2024
Switch to inline record gain 25% speed up of querying in avl tree based map implementation Community	3	482	August 25, 2023

Stdlib.List loop unrolling

Related topics