`int_of_string` behaviour

lindig · February 25, 2025, 10:09am

utop # int_of_string "0b001";;
- : int = 1
utop # int_of_string "0xff";;
- : int = 255

int_of_string parses hex, binary, and octal notations in addition to the usual decimal notation. This is often surprising and can lead to subtle errors when using this to validate input syntax. Should the Int module contain conversion functions that are more explicit about what they accept and have an efficient implementation?

chshersh · February 26, 2025, 5:46pm

Looks like this behaviour is mentioned in the documentation, so as long as you read the docs, it shouldn’t be too surprising.

Generally, different languages provide different defaults. If you want something stricter or more lenient, you can easily roll your own function.

xavierleroy · February 26, 2025, 5:52pm

Agreed. I think validation of input syntax should be done / is often done before calling int_of_string. For example, a lexer naturally checks the syntax of integer literals before converting them to int using int_of_string. Likewise for input field validation in HTML forms.

lindig · February 26, 2025, 6:13pm

I am aware that this behaviour is documented but think it would have been better to relegate such flexible behaviour to a function inside a module that is not open by default. This is too late now - hence my suggestion to add a stricter function to the Int module. We can probably agree that most code is not doing syntactic checks before calling int_of_string and opens itself up for surprises.

Topic		Replies	Views
Convert hexadecimal to decimal Learning	4	5569	June 12, 2017
Comparing a string character with another character somehow implicitly casts one of them to int? Learning ocaml	4	1398	April 3, 2023
Hint about int vs. int64 converts hexadecimal to decimal notation: bug? Learning stdlib	3	730	November 14, 2021
Using non-ASCII characters in pretty printing? Learning	7	1621	June 29, 2017
Confusing Behaviour with Polymorphic Variants Learning	8	1099	July 15, 2019

`int_of_string` behaviour

Related topics