Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

C

PHP

Assembly

Nix

Perl

Elixir

F#

R

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Dart

Kotlin

MATLAB

Java

Objective-C

Go

TypeScript

Rust

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇰🇲 Comoros

🇦🇩 Andorra

🇮🇩 Indonesia

🇲🇪 Montenegro

🇮🇸 Iceland

🇧🇪 Belgium

🇷🇴 Romania

🇫🇮 Finland

All Countries Compare Countries

wuchangfeng/vino-crawlers

Stars
208
Rank 188,393 (Top 4 %)
Language
Python
Created almost 9 years ago
Updated almost 8 years ago

wuchangfeng/vino-crawlers

wuchangfeng

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Some crawlers for getting data from the net.

关于

学习 Python 时写一些简单的爬虫来获取需要的数据。
有些程序估计写的比较早,一些网站的验证机制估计也变了,只做参考用。
不定期更新。欢迎 PR。

爬虫实例

Readme_Luowang:关于如何爬取落网音乐,下载到本地的小程序。
Readme_Baidu:关于如何基于 Py2.7 根据关键词从百度下载图片的小程序。
Readme_Zhihu:关于如何抓取知乎上一些信息的程序。
Readme_One:关于如何爬取 One 网站上的每日一图以及 One 问答,并且存储在 LeanCloud 云后台。
Readme_Sujin:关于如何爬取素锦网站上的好文章,并且存储在 LeanCloud 云后台。
Readme_Douban:关于如何爬取豆瓣图书 Top250。
Readme_Lagou:关于如何从拉勾网爬取较大量的职位信息以及存储至 NoSql 类型数据库中。
Readme_XiciDaili:抄自知乎一个回答。改成 MongoDB 存储以及加了验证机制。但是可用性不是很高，大概30%。

爬虫基础

爬虫进阶

数据分析

Python 相关

Python2 中编码的问题

书籍推荐

《用 Python 进行数据分析》
《Python 数据挖掘入门与实战》
《干净的数据-数据清洗与入门实践》
《Python 网络数据采集》
《集体智慧编程》
《数据挖掘导论》

感谢

suzumiyang 参与落网爬虫的改进

one

Use MVP+Dagger2+Realm as a major infrastructure of project and data get from One and Sujin with Crawler.

markdown-helper

Drop images on python script, get markdown url in txt file.

zhuanlan

An app for learning RxJava & Retrofit and data from Zhihu and Gank.io.

uninstall-app

A python script for uninstalling the app.

meizitu-scrapy

Use scrapy to download meizi imgs from web.

vino-workflows

Some workflows for mac os.

blog-backup

Articles for Learning and also a Backup for My Blog.

zhihu-hook

A Chrome App for looking Meizi in ZhiHu question.

vino-django-blog

Its a Blog based on Django and Python.

wuchangfeng.github.io

interview

Just a interview tips for my summer Internship

vino-easy-django

A Personal Blog，based on Django and deploy in pythonanywhere.

RxImageLoader

ImageLoader with rxjava2 and kotlin，like picasso . Reference:https://blog.csdn.net/github_27372715/article/details/80899243

vino-android-demos

Some demos for android dev

kotlin-notes

idea-plugins

Plugins for intellj idea and android studio.