Written language identification
TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language).
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 1.10-19.fc40 | - |
Fedora 40 | 1.10-19.fc40 | - |
Fedora 39 | 1.10-18.fc39 | - |
Fedora 38 | 1.10-17.fc38 | - |
EPEL 7 | 1.10-1.el7 | - |
You can contact the maintainers of this package via email at
textcat dash maintainers at fedoraproject dot org
.