• Stars
    star
    8
  • Rank 2,099,232 (Top 42 %)
  • Language
    Java
  • License
    Other
  • Created over 13 years ago
  • Updated almost 13 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Java library for splitting text into constituent words. This can be tricky for non-trivial examples, therefore the jTokenizer package was designed to combine a set of tokenizers that range from basic whitespace tokenizers to more complex ones that deal intuitively with natural language.