Awesome Java Document and Text Processing Libraries

  • updated 1 day ago GNU Lesser Genera...

    Style and Grammar Checker for 25+ Languages

  • tika tika 1,860
    star
    updated 8 months ago Apache License 2.0

    The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).