OCaml 64bits on 32bits platforms

EduardoRFS · October 13, 2021, 7:32pm

Proposal

For a long time OCaml 32bits is not exactly well maintained from a community point of view and that happens mostly because the memory model is different.

Couldn’t we have the 64bits memory model running on 32bits platforms? By that I mean, on 32bits platforms the word size is now 64bit. I understand that it will imply some performance loss and double memory usage, especially with multicore. But that would mean zero support required by the community.

Why not an RFC?

I just want to discuss the idea, and writing a proper RFC takes a LOT of time.

Performance

And I believe there is some ways to mitigate the performance problems. And even improve it for 64bits platforms.

Multicore Locking

As 32bits platforms may not have atomic writes to 64bits values, locking may be required to mutable fields under multicore, this could probably be mitigated by having a flag in the compiler --no-multicore which would enable a GIL.

Int31.t

Because 32bits platforms generally have fewer registers(looking at you i386) we could provide an Int31 module which on 32bits platforms will provide information to keep basic integer operations fast and use a single register if more than 32bits is not required.

This is also the case for pointers in 32bits platforms by extracting the data from the typer we could use a single register.

Int31array.t

Similar to Floatarray, a new kind of runtime tag, where every field uses only 32bits, it can then be used to describe Int31array.t.

On 64bits platforms a block with this tag will still always endup having a true size being a multiple of 64bits as that is required to keep memory alignment. So size 1 = 64bits, size 2 = 64bits, size 3 = 128bits. This is not required in 32bits platforms, but may be desirable for FFI reasons and avoiding a couple #ifdef in the codebase.

On 32bits platforms we can have another float array like optimization, where int31 array will automatically be packed, I would say if floatarray optimization is enabled then this can also be enabled for 64bits platforms without performance loss.

This tag can also be used for records { x: int31, y: int31 } will use just 64bits. Can also be done for 64bits platforms, where this record will only use a single word.

On 32bits platforms any record containing pointers can be treated as 32bits so that ptr array or { a: ptr, b: ptr } will be packed, this may not be desirable as it will break the FFI and the goal here is full compatibility. But it can probably be provided as a compiler flag to recover even more of the performance lost.

float32 and Float32array.t

If we provided a float32 type this could be used for more efficient memory usage on data structures on both 64 and 32bits platforms.

It behaves identically to float(but 32bits), but when it’s possible to be packed it will be packed as a Float32array.t instead of Floatarray.t, also the same for records, { x: float32, y: float32 } uses only 64bits of memory.

Another possible optimization would be to contain a tag for 32bits box, so that it uses half memory on 32bits platforms, on 64bits platforms it needs to use 64bits to be compatible with the memory model but it opens rooms for further optimizations. It may not be desirable for FFI reasons but may be interesting to have under a compiler flag.

Implementation

Yes, I understand that it is a lot of work. But most of it seems like it can be done without disrupting the OCaml core development. And in small incremental steps.

vlaviron · October 14, 2021, 9:32am

Could you provide some evidence of this ? In my experience OCaml software works reasonably well on 32-bit architectures, and the main issue where maintenance is concerned is the lack of easily available 32-bit hardware for testing, which your proposal does not address.

If your aim is to let all OCaml programmers assume that the word size is 64 bits, this is not going to help much as the most commonly used 32-bit platform, js_of_ocaml, doesn’t seem to be handled by your proposal.

I’ll also point out that the layout optimisation proposals are similar to the existing RFC about unboxed types (Unboxed types proposal by stedolan · Pull Request #10 · ocaml/RFCs · GitHub)

dinosaure · October 14, 2021, 10:25am

Could you provide some evidence of this ? In my experience OCaml software works reasonably well on 32-bit architectures, and the main issue where maintenance is concerned is the lack of easily available 32-bit hardware for testing, which your proposal does not address.

I would like to go further and say that ci.ocamllabs.io used by MirageOS projects proposes a CI with 32-bits architecture which help us to improve the support of this platform along on our projects (and, as far as I can say, the majority of our projects work on 32-bits platform).

It’s even more true if we want to support IoT for MirageOS.

EduardoRFS · October 14, 2021, 11:42am

Jane Street packages: dropping support for 32-bit
Tests don't work on `4.11.0+32bit` · Issue #150 · anmonteiro/ocaml-h2 · GitHub
Tezos is not expected to work on 32bits at all.
Cross compiling from to 32bits can only be made from a 32bits platform

It seems clear to me that your average package is not tested on 32bits platforms. Also as someone who did work on testing a bunch of stuff for armv7, even if stuff builds it doesn’t actually is expected to work.

i386 is available on any CI, so I fail to see how 32-bit hardware is lacking for testing. And most arm64 platforms you could use aarch32 mode, but that’s not gonna work forever as new arm64 platforms don’t even contain a 32bits mode including the latest ARM and Apple chips.

As you mentioned, JSOO is commonly used, but an important detail is that it is not a 32bits platforms in the same way as native, actually the ints are 32bits, so it’s not 2 platforms, is closer to 3, and here I’m proposing to reduce to 2.

I personally believe that 64bits and 32bits OCaml should have 32bits unboxed ints, by default but that is probably not happening as it would be a big breaking change.

Yes I understand that, a small difference is how it fits in the ecosystem, which is closer to how floats and float array work then to how unboxed types are used. But I believe the runtime change to be the same.

Maelan · October 14, 2021, 2:51pm

Minor remark:

I would strongly oppose that. 2³² is easily overflowed in real life. For most purposes I would be happy with e.g. 48-bit integers. but 32 bits are just not enough (which is actually an argument in favor of your original proposal, although I’m not convinced by its chances of adoption).

Topic		Replies	Views
Should 32-bit OCaml's header format allow arbitrary sizes? Ecosystem compiler , low-level , header	1	836	August 5, 2017
How hard do people work to make their C-based (FFI) OCaml libraries work on 32-bit OCaml? Ecosystem	1	220	February 6, 2025
32-bit native code support for OCaml 5+ Community ocaml , machine-learning	21	1353	July 12, 2023
OCaml 32bits memory limit Learning	7	992	June 26, 2020
Multicore OCaml: March 2022 Community multicore , multicore-monthly	8	3386	July 23, 2022