Anemone web-spider framework
a simple, fast web-crawler written in Ruby using Watir or Typhoeus
Pure ruby implementation of the Boilerpipe content extraction algorithm tuned for online articles
Web crawler with very flexible crawling options. Can either use standalone or can be used with resque to perform clustered crawls.
A ruby web/screen scraping tool / gem.
Ruby gem that fetches images and metadata from a given URL. Much like popular social website with link preview.
Ruby gem for web scraping purposes. It scrapes a given URL, and returns you its title, meta description, meta keywords, links, images...
A Ruby DSL for structured web crawling, with a robust caching system.
Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.