Unicode text segmentation for OCaml
Uuseg is an OCaml library for segmenting Unicode text. It implements the locale-independent Unicode text segmentation algorithms (http://www.unicode.org/reports/tr29/) to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm (http://www.unicode.org/reports/tr14/) to detect line break opportunities. The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation. Uuseg depends on Uucp and optionally on Uutf for support for OCaml UTF-X encoded strings. It is distributed under the ISC license.
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 16.0.0-1.fc42 | - |
Fedora 41 | 16.0.0-1.fc41 | - |
Fedora 40 | 15.1.0-6.fc40 | - |
Fedora 39 | 15.0.0-4.fc39 | - |
You can contact the maintainers of this package via email at
ocaml-uuseg dash maintainers at fedoraproject dot org
.