I guess I can use regex to split the string (though it seems overkill), but I’m not really sure how to determine the first complete unicode character in a UTF-8 string.
Example string from the data:
"ƒT01ƒULatnƒx13ƒaTsionut datitƒdhisṭoriah, raʿyon, ḥevrahƒhʿorekh: Dov Shṿartsƒlkerekh 3"
I need to determine that the first character actaully is ƒ, and then split on every occurrence of ƒ.
Sorry if this is obvious. I haven’t been using OCaml very long. I’m using
Base, if that makes a difference.