Slybot is now part of Portia
Slybot is now part of the Portia scraper.
The commit history of this repository has been preserved only for reference purposes.
There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Slybot is now part of the Portia scraper.
The commit history of this repository has been preserved only for reference purposes.
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.scrapyd
A service daemon to run Scrapy spidersscrapely
A pure-python HTML screen-scraping librarydirbot
Scrapy project to scrape public web directories (educational) [DEPRECATED]quotesbot
This is a sample Scrapy project for educational purposesparsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectorsscrapyd-client
Command line client for Scrapyd serverw3lib
Python library of web-related functionscssselect
CSS Selectors for Pythonloginform
Fill HTML login forms automaticallyqueuelib
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Pythonscrapy.org
The scrapy.org websiteitemadapter
Common interface for data container classesprotego
A pure-Python robots.txt parser with support for modern conventions.itemloaders
Library to populate items using XPath and CSS with a convenient APIscrapy-bench
A CLI for benchmarking Scrapy.scurl
Performance-focused replacement for Python urllibpypydispatcher
A fork of http://pydispatcher.sourceforge.net/ with PyPy supportxtractmime
https://mimesniff.spec.whatwg.org/ implementation for Pythonbase-chromium
base component forked from Chromium source https://chromium.googlesource.com/chromium/src/base/scrapy-itemloader
[Archived] Library to populate Scrapy items using XPath and CSS with a convenient APIgsoc2014-integration-tests
GSoC2014 - Scrapy Integration tests projecturl-chromium
url component from Chromium source code, forked from https://chromium.googlesource.com/chromium/src/urlLove Open Source and this site? Check out how you can help us