Ignoring Lwt values

sapristi · August 29, 2019, 9:59pm

Basically my question is: what happens when one does

ignore (x : 'a Lwt.t)

I am writing a logging module, and I want it to be able to send messages through a Lwt_pipe and to stdout at the same time, e.g.

let pipe =  Lwt_pipe.create ~max_size:1000 () in
let log s = 
Lwt_io.write Lwt_io.stdout s
<&> Lwt_pipe.write_exn pipe s

but most of the time I call the log function, I am not inside of a Lwt context, so that the returned unit Lwt.t is ignored.

This seems to works (surprisingly) quite well, but I don’t understand why. Is there anything I am missing here ? (in the context where I have tested is, the program’s main loop is Lwt).

Leonidas · August 30, 2019, 2:22pm

It might work since the Lwt thread is possibly registered in the scheduler upon creation and since Lwt is eager it will try to execute it at the time the computer reaches the line, but I don’t think it is considered good style and has a likelihood of breaking.

sapristi · August 30, 2019, 6:34pm

I can understand that, but it still seems much better to me than (re)writing a somewhat big code base to be Lwt compatible.

Yes, but how so exactly ?

Leonidas · August 30, 2019, 7:42pm

I would disagree to some point, since you write something fragile that only accidentally works. I think rewriting a codebase so it actually works for sure is probably a better way to not be surprised by odd behaviour from your code in the long run.

I understand that it is painful that’s why I would advise against introducing Lwt or Async late into a programs life. We have some blocking code at work and we know that adding async to it be a major effort that is not worth it. You can of course just try to carefully write parts using Lwt with proper blocking and then encapsulate that by running the Lwt main thread in a blocking way.

You’re sort of depend on the implementation of the scheduler. It could change how things are implemented and also change in which order your promises are executed if at all. If you never wait on them they might get executed whenever so you don’t really have ordering guarantees that thread 1 executes before thread 2, thus you might run into odd out-of-order behaviour that is difficult to debug because it might also just work correctly in some or most of the time.

To really understand what exact issues you might run into you probably have to ask @antron or @jeremiedimino.

antron · August 30, 2019, 8:02pm

It is indeed ok to ignore an 'a Lwt.t. However, we have a function Lwt.async for this purpose.

The difference between ignore p and Lwt.async (fun () -> p) has to do with rejection handling: p can be rejected with an exception, in which case the exception is stored in p. ignore p silently forgets that stored exception. Lwt.async (fun () -> p) will pass the exception to !Lwt.async_exception_handler (which kills your program by default, but prints a stack trace).

In any case, it is desirable to either (1) make sure that the computation that resolves p doesn’t raise any exceptions, or (2) you handle rejection of p with Lwt.catch.

antron · August 30, 2019, 8:13pm

As for why the code works: computation in Lwt eager, as @Leonidas mentioned. So, just calling the Lwt I/O functions is enough to start I/O operations. As long as you call Lwt_main.run somewhere, those computations can eventually complete. Note that Lwt_main.run does not start operations — it’s only necessary to complete them.

When each operation completes, Lwt_main.run resolves its promise ('a Lwt.t), which calls more callbacks, which eagerly start more I/O operations, and so on.

It’s not necessary to depend on any of the promises returned. Each I/O operation is internally registered with Lwt_main.run by the function that starts the operation. Somewhat more precisely, each function registers the promise that should be resolved when the operation completes.

So, you can ignore these promises completely in your own code, as long as you call Lwt_main.run somewhere.

There is, of course, a lot more detail to this in the implementation, but this is a fairly accurate mental model of the API

andyman · August 30, 2019, 10:59pm

Awesome explanation. I found futures(/threads) most confusing until I got this into my head.

sapristi · August 31, 2019, 4:20pm

Ok thank you both for your input, I’m getting a much better picture now.
So, as i understand it, the promise dependency is here mainly to manage the scheduling of the I/O operations, and a promise can be ignored (or rather “executed asynchronously”) if nothing depends on it.

I didn’t notice at first, but in my usecase there is one (big) downside at ignoring promises, which is that the ignored promise will be ran only after the “main” promise completes, so that

logs display would be delayed
and, more problematic, that the log promises would fill up the memory.

So I think I’ll just give up with that for now (the initial aim was to send log messages through a websocket, but I can live without that).

antron · August 31, 2019, 5:55pm

Yes, the promises are for you to sequence operations one are after another, so scheduling in the sense of how you can compose your code, rather than scheduling in terms of the internals of Lwt.

That shouldn’t, in general, be the case. The reasons I can immediately think of why this is happening would be:

Your “main” promise is resolved immediately (synchronously), so it is “fast.”
Your logging operations are buffering or queuing, so they are “slow.”

But, yes, with the way Lwt_main.run works, if there are I/O operations whose promises you are not asking Lwt_main.run to wait on (by calling Lwt_main.run on a promise that depends on them), Lwt_main.run may return before those operations complete, because it returns when just the promised passed to it resolves.

sapristi · September 1, 2019, 2:50pm

An approximation of my program would be this (run by opium) :

for i = 0 to 100 do 
  Unix.sleepf 0.01;
  Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 1;  message %i \n" i);
  Unix.sleepf 0.01;
  Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 2;  message %i \n" i);
  Unix.sleepf 0.01;
done;
|> Lwt.return

So a solution might be pass promises returned by log events to Lwt.main_run ? Like writing unit for each log to a Lwt_pipe, that Lwt.main_run would read ?

antron · September 1, 2019, 3:08pm

I’m assuming that at the end of the program, it’s

|> Lwt.return
|> Lwt_main.run

since the code block doesn’t show where Lwt_main.run is called If that’s so, then your main promise is indeed "synchronous with regard to Lwt I/O (i.e. it won’t ever try to fully do Lwt I/O), because Lwt_main.run is handed an already-resolved promise (since you create it with Lwt.return). That means Lwt_main.run won’t do anything, i.e. it won’t try to resolve any of the other promises, that are associated with I/O, that may have completed. If you either change Unix.sleepf to Lwt_unix.sleep and depend on those promises in Lwt_main.run, or introduce any other Lwt I/O into the flow of your program (even a Lwt_unix.sleep right before Lwt_main.run may do), some or all of your Lwt_io.writepromises may actually have a chance to be resolved byLwt_main.run, thus Lwt_main.run` will drop its references to them.

For that to work, you would need to know when the last write is in the pipe. In that case, you may as well use a single promise to signal the last write, and have Lwt_main.run wait on that.

sapristi · September 1, 2019, 5:36pm

Ok thanks a lot, so e.g. this seems to achieve what I wanted !

for%lwt i = 0 to 100 do
    (
      Unix.sleepf 0.05;
      Lwt.async (fun () ->Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 1;  message %i  \n" i ) );
      Unix.sleepf 0.05;
      Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 2;  message %i  \n" i) ;
      Lwt_io.flush_all ()
    )
  done
|> Lwt_main.run

Ok, so this is why the following, which I had previously tried, did not work ?

for%lwt i = 0 to 100 do
    (
      Unix.sleepf 0.05;
      Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 1;  message %i  \n" i );
      Unix.sleepf 0.05;
      Lwt_io.write Lwt_io.stdout (Printf.sprintf "This is 2;  message %i  \n" i);
      Lwt.return ()
    )
  done
|> Lwt_main.run

So to resolve asynchrounous pending I/O promises, Lwt_main.run needs to depend on a I/O promise (EDIT: or Lwt_unix.sleep ?), but how does this generalize to other kind of operations ? E.g if I declare asynchronous write promises to a Lwt_pipe or Lwt_stream instead of I/O, how would I force these promises to be resolved ?

antron · September 1, 2019, 6:17pm

if you look at that code, every iteration, it synchronously does some blocking Unix sleeps, synchronously spawns some I/O computation, synchronously (immediately) returns a resolved promise. So, you end up with 101 iterations of executing synchronous code, and then pass an already-resolved promise (roughly, the last Lwt.return) to Lwt_main.run, so Lwt_main.run doesn’t try to finish any of the I/O, and returns right away. for%lwt doesn’t drive any Lwt I/O, only Lwt_main.run does that.

In the first code block in your comment, you have the for%lwt waiting on a non-resolved promise in each iteration. This means that on the first iteration, you construct a non-resolved promise, and ~~that first promise is what~~ for%lwt passes an non-resolved promise into Lwt_main.run. That promise from the first iteration doesn’t resolve until the last iteration completes (that’s what for%lwt does), so Lwt_main.run ends up waiting for the flush_all from iteration 101.

To completely guarantee that a promise is resolved before Lwt_main.run returns, you have to pass the promise, or a promise that depends on it, as the argument to Lwt_main.run. Note that you aren’t forcing the promises to become resolved. You are just forcing your program to wait until they become resolved.

antron · September 1, 2019, 6:20pm

Another way of thinking about for%lwt is that it tries to do as much work as it can up front, and then evaluates to either the final resolved promise (if it did all the work), or to the first pending promise it hits along the way. In that second case, it evaluates to a pending promise that represents the rest of the computation inside the for%lwt.

Topic		Replies	Views
Code smell? Lwt.async @@ fun () -> Learning	4	517	February 6, 2023
Lwt alternative to async.pipe Ecosystem lwt , cohttp , async	3	970	February 13, 2021
Best practice on mixing Async and Lwt? Ecosystem lwt , async	15	2525	November 23, 2019
Lwt: Why does this piece of code work !? Learning lwt	10	1326	November 23, 2022
Define callback function (int -> int -> unit) that can pause lwt thread it is called from? Learning lwt	0	557	May 19, 2022

Ignoring Lwt values

Related topics