Computational Linguistics Toolset |
|
"The Computational Linguistics Toolset is a set of tools
for computational linguistics. It contains re-usable code for cleaning,
splitting, refining, and taking samples from corpora (ICE, Penn, and a
native one), for tagging them using the TnT-tagger, for doing permutation
statistics on N-grams (useful for finding statistically significant
syntactical differences between any two sets of tagged texts), and various
examination-tools. The tools themselves are well documented." |
|
open-source |