Why is a hand-written equal function faster than ( = )?

giltho · April 22, 2022, 10:04pm

Hi!

The documentation of ppx_deriving explains that the equal and compare functions derived are faster than the usage of =. Is it really the case? And if so, why? I haven’t been able to find out.

Thanks!

OCamlUser · April 23, 2022, 2:10am

I believe this is because the default implementations are calling the polymorphic variants of those functions and those are very heavy. A specialized implementation can be much smaller and potentially inlined or have other optimisations.

For example here’s the polymorphic versions implementations:

github.com

ocaml/ocaml/blob/001997e81342fd0d321fd877b73608150601e7d9/runtime/compare.c

/**************************************************************************/
/*                                                                        */
/*                                 OCaml                                  */
/*                                                                        */
/*             Xavier Leroy, projet Cristal, INRIA Rocquencourt           */
/*                                                                        */
/*   Copyright 1996 Institut National de Recherche en Informatique et     */
/*     en Automatique.                                                    */
/*                                                                        */
/*   All rights reserved.  This file is distributed under the terms of    */
/*   the GNU Lesser General Public License version 2.1, with the          */
/*   special exception on linking described in the file LICENSE.          */
/*                                                                        */
/**************************************************************************/

#define CAML_INTERNALS

#include <string.h>
#include <stdlib.h>
#include "caml/custom.h"

This file has been truncated. show original

Here’s a small blog post by Jane Street on the topic:

giltho · April 23, 2022, 1:51pm

Hi!

Thank you for your answer.
I had read that article, but it is still not exactly clear why this would be slower.

I understand that equal functions can be manually optimised of course, but I am talking about the functions derived by ppx.
Polymorphic compare will compare tags, and if tags are equal, recursively compare the fields. But so will the equal functions derived by ppx won’t it?

But I think I get my answer from the code you linked however: polymorphic compare also has to decide on the type of what it’s comparing before being able to perform comparison, which obviously is slower. I could have guessed it probably hahaha

Thank you!

vlaviron · April 23, 2022, 4:49pm

There are two main reasons why the polymorphic comparison is slower. The first one is that it’s implemented in C, not in OCaml. In bytecode it’s not really worse (in fact, it’s likely better in many cases) but for native code switching from OCaml code to C code is relatively slow, so if your comparison function isn’t too complicated it’s usually faster to use a pure OCaml version. Plus, a pure OCaml function can be considered for inlining, which can again improve performance

The second reason is specialisation. The polymorphic version has to consider all possible cases that are valid for any given type, while a hand-written one (or ppx-generated one) only has to deal with the cases allowed for the particular type it’s written for.
It’s not as much a problem as the other point though, in particular since for a number of base types the specialisation is performed automatically by the compiler (i.e. Stdlib.compare (x : int) y is reasonably fast). Even when that doesn’t work, it’s usually only a few comparisons that you can skip by knowing the type.

cdaringe · April 23, 2022, 5:41pm

Thx for the ref. It’s just what I had always imagined it to be in my head… and more!

Topic		Replies	Views
Is this optimized by the OCaml compiler? Learning	9	790	January 12, 2023
Ocaml Bytecode performance? Learning	21	2550	October 11, 2023
Help: simplify this code Learning	4	356	March 24, 2023
Curiosity: boolean being represented as 1 or 3? Learning compiler	5	236	June 27, 2025
Why is int comparison not just (-)? Learning	6	3408	June 26, 2019

Why is a hand-written equal function faster than ( = )?

Related topics