Multiple domains design question in Eio/etc

yawaramin · December 28, 2024, 6:51am

I have an app design question regarding the use of multiple domains in say Eio or some other multicore library. We can do eg

let num_domains = Domain.recommended_domain_count () - 1
let additional_domains = Eio.Stdenv.domain_manager env, num_domains
...
Cohttp_eio.Server.run ~additional_domains ...

…to make the server run on multiple domains (I assume that’s how it works).

But suppose I have another part of my app that uses the Eio.Executor_pool to run background jobs on multiple cores. And I also need to pass it similar arguments:

Eio.Executor_pool.create ~sw domain_count:num_domains (Eio.Stdenv.domain_manager env)

So do these different functions coordinate to ensure that they don’t create num_domains + num_domains domains? Ie, do they check the existing number of domains before creating new ones? Or is that check guaranteed by Eio at a lower level?

patricoferris · December 28, 2024, 12:51pm

Unfortunately the answer is no. The implementation of the Cohttp_eio.Server predates Eio.Executor_pool. This is why it uses the raw domain spawning API rather than using a pool. In fact, it is Eio.Net.run_server that uses the Domain_manager.run API – I think there’s a good argument here to change this to using a pool (perhaps an issue for this would be good).

As a workaround, you can wrap the existing domain manager implementation to do something more clever (e.g. use a pool or something like that maybe).

yawaramin · December 28, 2024, 3:57pm

Got it. I think two tracking issues will be needed as two APIs will change:

github.com/mirage/ocaml-cohttp

Cohttp-eio: take executor pool instead of creating domains directly

opened 03:48PM - 28 Dec 24 UTC

yawaramin

As discussed in the forum: https://discuss.ocaml.org/t/multiple-domains-design-q…uestion-in-eio-etc/15861 Currently cohttp-eio directly takes the domain manager and the number of domains to create, and creates the new domains itself. However, other parts of the app may also be creating new domains using the domain manager, eg https://ocaml-multicore.github.io/eio/eio/Eio/Executor_pool/index.html one strategy to prevent multiple parts of an app from creating multiple domain pools would be to coordinate by having all of them use the same pool. So, then cohttp-eio should take an `Eio.Executor_pool.t` as an argument instead of taking the domain manager directly.

EDIT: I also want to say that this analysis was fairly trivial thanks to Eio’s design exposing functions which take the domain manager. I think this is a win for the capabilities-based design.

yawaramin · December 28, 2024, 5:46pm

Although the Eio.Executor_pool.submit function seems to be designed for only one subsystem to have access to the pool and submit coarse-grained jobs to it, not for multiple decoupled subsystems. I can’t see a way for both an HTTP server and an async message queue (eg) subsystem to share the same pool.

EDIT: made a suggestion which I think is pertinent: Eio.Net.run_server should take an executor pool instead of the domain manager (maybe?) · Issue #791 · ocaml-multicore/eio · GitHub

yawaramin · January 4, 2025, 12:57am

OK, I’ve suggested a multi-core runtime strategy, basically:

let () = Par.run @@ env ->
  ...run on multiple cores...

  (* Distribute slices of the array across all worker domains *)
  let result = Par.sum env large_float_array in
  ...

So this takes care of starting up the recommended number of domains and running the app across all of them, while also setting up a way to submit parallelized (ie CPU-intensive) tasks and getting a promise of the result.

This is a POC right now (linked above) but I believe this is a good direction: users don’t need to worry about setting up domains, they don’t need to hand over all of the domains to a specific subsystem like the HTTP server, they don’t need to figure out how many domains to allocate for what.

Of course, this is not thoroughly tested or benchmarked right now; more to come. But happy to discuss more.

dinosaure · January 4, 2025, 9:07am

Sorry to interfere with this question, but I’d just like to mention that this is exactly what Miou offers with Miou.parallel:

let () = Miou.run @@ fun () ->
  let result = Miou.parallel sum large_float_array in
  ...

You can see the documentation here (with a little example): Miou.parallel. And about an HTTP server, httpcats is released and actually it follows the pattern you suggest.

dbuenzli · January 5, 2025, 11:04am

4 posts were split to a new topic: On concurrency models

yawaramin · January 5, 2025, 12:29am

Hey folks, just a reminder that this thread is meant to be about parallelism/concurrency strategies in Eio. Thanks.

bluddy · January 5, 2025, 6:35am

There’s an interesting discussion going on here regarding domains vs threads as building blocks and the merits of user-defined concurrency. Admins, can we split this discussion thread up?

Topic		Replies	Views
On concurrency models Community multicore , concurrency	19	1178	January 13, 2025
Interaction between eio and domainslib, unhandled exceptions? Ecosystem domainslib , eio	7	1550	April 20, 2023
No Domain.maximum_domain_count() in the stdlib Learning multicore-ocaml	35	2111	February 1, 2023
Domainslib blocking on Chan.recv Learning domainslib	10	502	April 23, 2024
Multicore: Building an in-memory pubsub with domainslib (and eio) Learning multicore , domainslib	7	1553	February 19, 2022

Multiple domains design question in Eio/etc

Related topics