There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
A Java library for splitting text into constituent words. This can be tricky for non-trivial examples, therefore the jTokenizer package was designed to combine a set of tokenizers that range from basic whitespace tokenizers to more complex ones that deal intuitively with natural language.