Written language identification
TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language).
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 1.10-22.fc42 | - |
Fedora 42 | 1.10-22.fc42 | - |
Fedora 41 | 1.10-20.fc41 | - |
Fedora 40 | 1.10-19.fc40 | - |
You can contact the maintainers of this package via email at
textcat dash maintainers at fedoraproject dot org
.