Value guaranteed to be set after initialization

incomplete · July 25, 2017, 2:24am

I keep seeing this problem, chances are you might also have seen it, it often happens when initializing something, and is not specific to OCaml. The problem can be summarized as this:

When the application is starting, you often need some values to proceed to the next step, (these values can come from a config file, or from a HTTP request, etc.), if these values are not there, then there is no strictly better alternatives than terminating the application. (an example: you will need the password to the database if your application is database-focused and non-interactive).

Solution 1, is to use a ref to an option, it’s doable, but as far as I can see, it’s a bit of stretch, because the value will always be Some _ after the first time you turn it from None to Some _, and all codes that use this value has to match on None, you ends with lots of impossible branches if this value gets used a lot.

Solution 2, is to wrap solution 1 with a function like this:

  let get_value r =
    match !r with
    | Some x -> x
    | None   -> failwith "yay! null reference"

but this solution kinds of introducing the null back into runtime.

I guess what I’m looking for is some kind of “inference over time”: after a certain point in the execution of a program, some nullable values are guaranteed to be non-null.

Does something for this kind of purpose exist? It’s painful when you don’t know what to type into the Google search box.

Sometime I can rearrange the code flow to avoid this, (ending with more code), sometime I have no better alternatives than the solutions above.

antron · July 25, 2017, 2:49am

lazy may be the right answer for some situations: https://caml.inria.fr/pub/docs/manual-ocaml/libref/Lazy.html.

Otherwise, I commonly write a function for requiring optional values with type require : 'a option ref -> 'a, that raises an exception if the optional value is None. Sometimes I give it a string first parameter, that I partially apply, to be able to trace more easily what happened.

yawaramin · July 25, 2017, 5:04am

This suggests that you need a function. Specifically, ‘next step’ needs to be a function that takes the value as an input. This is the most surefire way of statically guaranteeing that the value will be available.

incomplete · July 25, 2017, 7:30am

Just tried refactoring with lazy, some other issue prevents me from adopting this method.

(Guess it’s not quite the same with what I imagined, which is a placeholder, holding zero or one value, and the compiler or the library guarantees that by the time the value gets used, it has already been set, (actively by the programmer, not set in the way lazy sets it, which is, oh, that value is a thunk, I, the lazy, should evaluate and cache it).

It’s more like a nullable value with a proof says that, ok, I see sometimes the value can be null, but when this thing is null, it’s never “read from”, so it’s ok, I, the compiler, gonna allow this.
)

I think require is not that elegant but perfection is not what this world made of, and it’s practical and gets the job done.

incomplete · July 25, 2017, 7:32am

You mean, if I need a config value in f config, I should write f (config ())?

Yaron_Minsky · July 25, 2017, 11:20am

This idiom comes up often enough that Core_kernel has a module for it, called Set_once.

github.com

janestreet/core_kernel/blob/master/src/set_once.mli

(** A ['a Set_once.t] is like an ['a option ref] that can only be set once.  A
    [Set_once.t] starts out as [None], the first [set] transitions it to [Some], and
    subsequent [set]s fail. *)

open! Import

type 'a t [@@deriving sexp_of]

include Invariant.S1 with type 'a t := 'a t (** Passes when unset. *)

val create : unit -> _ t

val set     : 'a t -> Source_code_position.t -> 'a -> unit Or_error.t
val set_exn : 'a t -> Source_code_position.t -> 'a -> unit

val get     : 'a t -> 'a option
val get_exn : 'a t -> Source_code_position.t -> 'a

val is_none : _ t -> bool
val is_some : _ t -> bool

This file has been truncated. show original

y

incomplete · July 26, 2017, 12:35am

Nice try, the genuine Yaron Minsky would have chosen the user name “Yaron_minsky”, details man, details.

yawaramin · July 26, 2017, 10:31pm

f config should be enough.

E.g.,

let main config =
  let db =
    Db.connect
      ~url:(Config.url config)
      ~user:(Config.user config)
      ~password:(Config.password config) in

  (* ... your program ... *)

let () =
  let config = Config.get () in
  main config

The point is, your requirement that you have the config value before you can proceed to the next step, your main program, can be encoded as a parameter to the main program (which can then be a function). Any time you have this kind of requirement, you can use this technique.

Yaron_Minsky · July 28, 2017, 5:37pm

This is a good point. Things like Set_once are definitely useful, but they come up rarely: sometimes when there are complex server states with mutual references that are hard to initialize simultaneously. But most of the time, you can and should write your code to avoid this kind of thing.

Topic		Replies	Views
C pointers on OCaml side and checks for null Learning ffi	1	197	February 2, 2025
Default of Option.value? Learning	1	252	May 8, 2024
Basic questions in ocaml Learning	11	10027	November 7, 2019
Checking value of a variable Learning	1	1104	March 25, 2021
Thoughts about adding operations to opaque types Learning	7	211	January 21, 2025

Value guaranteed to be set after initialization

Related topics