Where to find complete documentation on syntax overloading in OCaml?

chshersh · October 25, 2023, 8:28am

Hi everyone,

I’m reading source code of OCaml projects and sometimes I see some interesting syntax overloading features.

For example, from the ocaml-non-empty-list package, I learned that you can overload the list literal syntax by naming your constructor as (::):

(** The non-empty list type. The use of the [( :: )] infix constructor
    allows the usage of the built-in list syntactic sugar provided by OCaml. 
    
    For example, a singleton is given by [ [1] ]. A list containing 2 elements is given by
    [ [1; 2] ]. *)
type 'a t = ( :: ) of 'a * 'a list [@@deriving eq, ord, show]

But for the love of my life, I can’t find any documentation about this syntax. My Google skills fail me on this one

Could you help me with some links?

Other syntax overloading features I found:

Overloading let+, and+, let*, and* (and more general let<op>)
- Documentation: 23 Binding operators
- Example: https://github.com/imandra-ai/ocaml-opentelemetry/blob/6362bc55eff6198fe0b4e5bdc13cbc439666e0c3/src/opentelemetry.ml#L1134-L1142
Overloading array indexing operators .() and .()<-
- Documentation: 19 Extended indexing operators
- Example: https://github.com/ocaml/ocaml/blob/f9371a2ea294f75da793451ef7be5dc69aad6b53/testsuite/tests/lib-dynarray/heap_sort.ml#L23-L25

Would love to gather more links!

otini · October 25, 2023, 9:04am

I also often use this table from the manual to know the relative precedence of operators. It also serves me when I want to (re)define an operator: the first signs of the operator have an effect on the precedence.

As @polytypic said, this is not overloading, but a mere (re)binding of identifiers. Operator-like identifiers just happen to be treated differently by the parser.

~~You can also give operator-like names to type constructors,~~ (edit: not really, see below) in which case there will some form of type-based disambiguation, true, but I wouldn’t call it overloading. I find it confusing as it brings to mind C++ where you can overload any function, which is definitely not the case in OCaml.

chshersh · October 25, 2023, 10:42am

I see. It makes sense.

I called it “syntax overloading” because in my understanding, the meaning of a list literal like [1; 2; 3] depends on the (::) in scope and type, so in some sense you get an overloaded list literal (which is part of the syntax).

Still, I wonder, if there’s only (::) for lists or something similar for tuples or something else for something else? And where this particular thing is documented?

It’s also fine if the documentation is not complete but I wanted to ask first if anyone knows any available resources

otini · October 25, 2023, 11:45am

I have to backtrack on what I said:

Actually, you can only redefine a few existing constructors. I have not found an official list, but the type constructors that you can shadow seem to be: ::, [], (), true and false. Yes, you can redefine all those, so out of some arguably legitimate uses for :: and [], there is potential for obfuscation.

# type void = | (* Empty type *);;
# type _ tuple = () : void tuple | (::) : 'a * 'b -> ('a * 'b) tuple;;
type _ tuple = () : void tuple | (::) : 'a * 'b -> ('a * 'b) tuple

# let a :: b :: () = if true then 1 :: 2 :: () else assert false;;
val a : int = 1
val b : int = 2

All other operator-like identifiers cannot be used as constructors, so although you can use them to construct values, you can’t use them in patterns:

# let ( $-$ ) x y = x :: y;;
val ( $-$ ) : 'a -> 'b -> ('a * 'b) tuple = <fun>

# 42 $-$ 12 $-$ ();;
- : ((int * int) tuple * void tuple) tuple = (::) ((::) (42, 12), ())

lukstafi · October 25, 2023, 12:05pm

The one not yet mentioned in the discussion is that you can redefine .[] and .[]<- as long as you do it inside a module named String, AFAIR. There’s a thread here on Discuss on how such valuable lexical estate is wasted on strings, but I can’t find it.

P.S. .[]<- is not available anymore: Syntaxic sugar: String.set → Bytes.set? - Learning - OCaml

yawaramin · October 25, 2023, 5:03pm

The list, unit, and other built-in types are documented in the OCaml Manual chapter on the core library (not Jane Street Core, OCaml core): OCaml - The core library

cuihtlauac · October 26, 2023, 6:57am

Hi @chshersh

The documentation update we’ve started releasing includes a new tutorial on operators:

I’ve tried summarizing and grouping what’s in the reference manual and a couple of blog posts, in a readable form. Alas, it doesn’t cover the case of reusing the list syntactic sugar, yet. I’d be super happy if somebody could contribute some text. Up to my findings, documentation is available on unary operators, binary operators, indexing operators and custom binders. But I couldn’t find anything about what you mentioned. Interestingly, if you ask ChatGPT about that, it falls into a bad trip, which suggests it wasn’t fed with anything on that matter, probably because there isn’t anything available.

Also, I searched https://sherlocode.com/ with something like this:

\[^a-z_\]type \.\*=\.\*( \[^A-Z\]

It seems to indicate the list syntax is the only one we can play with (although this regexp needs to be improved)

P.S. The tutorial is brand new; any feedback would be appreciated

cuihtlauac · October 26, 2023, 7:37am

This is not overloading it is shadowing.

With overloading it would be possible to use both versions of ( :: ) in the same scope, and the “right” one would be picked, based on type. But once you have defined your own operator-constructor, it masks the previously available one, unless you use a type annotation

# type ('a, 'b) foo = ( :: ) of 'a * 'b;; 
type ('a, 'b) foo = (::) of 'a * 'b

# let bar = function x :: y -> Some (x, y);;
val bar : ('a, 'b) foo -> ('a * 'b) option = <fun>

# let tail = function ((_ :: u) : 'a list) -> Some u | _ -> None;;
val tail : 'a list -> 'a list option = <fun>

chshersh · October 26, 2023, 8:10am

@cuihtlauac Thanks a lot for this follow-up!

Indeed, writing a blog post with some info on operators is nice. But improving the official documentation would be even more awesome!

cuihtlauac · October 26, 2023, 8:17am

A clarification: what we’ve published is not a blog post; it is a tutorial that is part of the official documentation.

The manual provides authoritative reference information. Tutorials are for learners. That’s two different use cases. I agree something seems to be missing in the manual. But something else was also missing for learners.

Topic		Replies	Views
About Ocaml list type Learning	5	577	April 5, 2020
Idea: OCaml Symbol Glossary Community	7	524	December 14, 2023
Conventions for let+, let*, etc Learning	4	2823	December 15, 2020
[ANN] Operator lookup tool for OCaml Community	12	3285	December 5, 2020
OCaml's constructor and record field disambiguation feels like a bit of ad-hoc polymorphism Learning	1	495	May 30, 2023

Where to find complete documentation on syntax overloading in OCaml?

Related topics