UnicodeUnicode utilities
val classify : int -> statusClassify a unicode char into 3 classes or unknown.
Return None if a given string can be used as a (Rocq) identifier. Return Some (b,s) otherwise, where s is an explanation and b is severity.
val is_valid_ident_initial : status -> boolTells if a valid initial character for an identifier
val is_valid_ident_trailing : status -> boolTells if a valid non-initial character for an identifier
val is_letter : status -> boolTells if a letter
val is_unknown : status -> boolTells if a character is unclassified
Split a string supposed to be an ident at the first letter; as an optimization, return None if the first character is a letter
Return true if all UTF-8 characters in the input string are just plain ASCII characters. Returns false otherwise.
ascii_of_ident s maps UTF-8 string to a string composed solely from ASCII characters. The non-ASCII characters are translated to "_UUxxxx_" where xxxx is the Unicode index of the character in hexadecimal (from four to six hex digits). To avoid potential name clashes, any preexisting substring "_UU" is turned into "_UUU".