Awesome Java Web Scraping Libraries

  • updated about 2 months ago Apache License 2.0

    A set of reusable Java components that implement functionality common to any web crawler

  • updated over 1 year ago Apache License 2.0

    HtmlUnit is a "GUI-Less browser for Java programs".