A line-by-line translation of the OCaml runtime from C to Rust

ETCs aside, there is some apparent (but small) performance improvement in the Rust version along the GC slow path that I haven’t pinned down. You kind of need benchmarks that specifically grind the GC to see it though.

It would be interesting to tease out what it is, exactly, and see if the C version can do it too as -O3 and -flto does not close the gap.