Created by: proycon
Ucto is an advanced regular-expression based tokeniser and sentence splitter, with configurations for various languages, support for FoLiA XML. Ucto itself is written in C++, this is the Python binding to ucto.
Created by: proycon
Ucto is an advanced regular-expression based tokeniser and sentence splitter, with configurations for various languages, support for FoLiA XML. Ucto itself is written in C++, this is the Python binding to ucto.