Created by: proycon
FoLiA-tools are CLI tools and Python libraries to work with documents in the FoLiA (Format for Linguistic Annotation) format. FoLiA is an XML-based file format capable of storing a wide variety of linguistic annotation types, see https://proycon.github.io/folia . FoLiA is used in the academic Natural Language Processing community by various large corpora, mainly in The Netherlands and Flanders.
The FoLiA-tools are part of the folia github repo, the actual FoLiA library is part of PyNLPl (https://github.com/proycon/pynlpl), which is a mandatory dependency. I therefore combined the two in this single pull request, I hope you don't mind. However, PyNLPl contains more than just a FoLiA library. It contains various specialised modules, such as for parsing other specialized formats (Moses, GIZA++), clients for interfacing with certain NLP servers, a module for handling Corpus Query Language, and generic functionality for text processing and basic corpus statistics.
My work on FoLiA & CLAM was awarded the CLARIN Young Scientist Award 2015 (last week).