OCaml for Data Science

akabe · April 25, 2018, 1:44am

Hi. Sorry for late reply. I’m using OCaml for data science at work.

What makes OCaml good/bad for data science (long story short of your experience with OCaml in this area)?

(Good) OCaml is fast.
(Good) Static typing prevents many small bugs. For example, Python often shows me errors like not found key in dict after long-time computation, but OCaml finds them in compile time (when we use records).
(Bad) OCaml Libraries for machine learning are less than Python.
(Bad) OCaml cannot support multicore.

What the OCaml alternatives for Python’s Pandas, NumPy, SciPy, etc.?

As some people mentioned, owl is similar to numpy.

Do you know some frontier companies/products/projects that uses OCaml for data science?

I don’t know. I use OCaml for data science personally. However, my colleagues use their favorite languages, e.g., Java, Python, etc.

Is there any problems that are related to data science and was solved by other platforms, but not by OCaml as a platform?

Lack of libraries, multi-core support and scalable distributed-memory processing environments (I know some opam packages such as rpc_parallel, but I cannot find enough examples).

Maybe you may give me a good piece advise related to both OCaml and data science.

Jupyter (http://jupyter.org/) is very useful and it can execute OCaml code: OCaml Jupyter | An OCaml kernel for Jupyter notebook. A Docker image containing many packages for data science is available: GitHub - akabe/docker-ocaml-jupyter-datascience: Dockerfiles for data science in OCaml on Jupyter, and some examples are at docker-ocaml-jupyter-datascience/notebooks at master · akabe/docker-ocaml-jupyter-datascience · GitHub.
Please try them, if you are interested.

Topic		Replies	Views
Application-specific Improvements to the Ecosystem Community	52	3367	August 12, 2022
OCaml and OCaml libraries for data-oriented applications and data analysis Community machine-learning , data-science , statistics	6	1854	December 7, 2020
Applied numerical algebra, and type systems Ecosystem	18	1215	August 7, 2022
My Thoughts on OCaml vs Haskell/Rust in 2023 Ecosystem blog	91	28042	September 13, 2023
OCaml non orthogonal featrures (was Simplification of OCaml as a design goal?) Community	48	3392	July 31, 2022

OCaml for Data Science

Related topics