There are no reviews yet. Be the first to send feedback to the community and the maintainers!
SearchScraperAPI
Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.Scrapio
Asyncio web crawling framework. Work in progress.SplashCrawler
A multi-threaded Python based crawler making use of Splash to render JavaScript.puppy_crawler
Example basic web crawler built on top of Pyppeteernltk_classify
Scripts for text classification with NLTKselenium_crawler_blog_example
Example of a simple selenium crawl - blog post can be found here: http://edmundmartin.com/selenium-based-crawler-in-python/webster
Minimal crawling framework built on top of pyppeteer, allowing multiple pages to be aysnc renderedimage_classifier
An image classifier built using TFLearn.pyppeteer_hub
Wraps an asyncio server around pyppeteer and allows for page rendering using multiple browsers and tabs allowing for high throughput page rendering.search_it
Asyncio package for scraping major Search Engines, supports pagination and other features.beanstalkg
Pure golang Beanstalkd clientbaiduBot
baidubot is a small package that allows users to scrape search results from the Chinese search engine BaiduLogIt
Fast and simple SEO log file parser written in Pythonsitemapcrawl
Highly concurrent sitemap crawling library written in Golang.gosearcher
Golang library for scraping search resultsgoogle_keyword_suggest
Scrapes the unofficial Google keyword suggestion API and gathers suggestions based on your entered keywords. Users can enter comma separated keywords into the tkinter app.democrawl
Code for my blog post: http://edmundmartin.com/writing-a-web-crawler-in-golang/GoogleScraper
Improved version of my previous Google scraperSooty
tflearn_document_classification
Classifying documents with tflearn.XMLSitemapChecker
A GUI program written in Python, that starts at the core sitemap index file and parses all of the files and URLs contained in these. The tool then checks the header status of the URLs found in sitemap files and reports whether this URLs are blocked bySimpleDB
Code following along with the book 'Database Design and Implementation'leetcode_solutions
Solutions to leetcode problems in PythonBingwebmasterAPI
Bing webmaster APIgstalk
GolangInterviewPrep
py2pyx
yantran
Yantran is a Python wrapper for Yandex's translate API.boselecta
Simple feature flagging service - WORK IN PROGRESSGoCrawler
A simple web crawler written in Golang500LinesOrLessCrawler
The asyncio crawler from the 500 Lines or Less book updated to make use of new Python syntax and improvementsdepatureboard
Command line depature board using data from the National Rail site written in Golang.yansearch
Python wrapper over the Yandex Search API makes the task of making a Yandex Search API request more easy to make. Additional features to be added.trackpix
Simple tracking pixel piping results into Yandex ClickhouseLove Open Source and this site? Check out how you can help us