• Stars
    star
    125
  • Rank 286,335 (Top 6 %)
  • Language
    Python
  • Created about 8 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A simple tool for fetching usable proxies from several websites.

Proxy

A tiny tool for crawling, assessing, storing some useful proxies.δΈ­ζ–‡η‰ˆ

Construct your ip pool

Install mysql:

pip install pymysql requests

Modify db connection information in config.py.

# crawl, assess and store proxies
python ip_pool.py

# assess proxies quality in db periodically.
python assess_quality.py

Demo on how to use these proxies.

Please first construct your ip pool.

Crawl github homepage data:

# visit database to get all proxies
ip_list = []
try:
    cursor.execute('SELECT content FROM %s' % cfg.TABLE_NAME)
    result = cursor.fetchall()
    for i in result:
        ip_list.append(i[0])
except Exception as e:
    print e
finally:
    cursor.close()
    conn.close()

# use this proxies to crawl website
for i in ip_list:
    proxy = {'http': 'http://'+i}
    url = "https://www.github.com/"
    r = requests.get(url, proxies=proxy, timeout=4)
    print r.text

More detail in crawl_demo.py。

Contact

[email protected]

More Repositories

1

Queryable

Run OpenAI's CLIP model on iOS to search photos.
Swift
2,430
star
2

tinymind

Tinymind - Write and sync your blog & thoughts with GitHub
TypeScript
617
star
3

disco-diffusion-wrapper

Implementation of disco-diffusion wrapper that could run on your own GPU with batch text input.
Jupyter Notebook
571
star
4

randomCNN-voice-transfer

Audio style transfer with shallow random parameters CNN.
Python
375
star
5

PodFind

Find what podcasters think of new things: GPT-4, SVB, etc.
JavaScript
149
star
6

BaiduCrawler

Sample of using proxies to crawl baidu search results.
Python
118
star
7

api-usage

Track your OpenAI API token usage & cost.
HTML
58
star
8

WaveGAN-pytorch

PyTorch implementation of " Synthesizing Audio with Generative Adversarial Networks"
Python
57
star
9

teach-show-consult

Teach ChatGPT the Alda music programming language, show it some superb code, and consult with it to compose a melody.
Python
47
star
10

QLearningMouse

Cat-and-Mouse game with Reinforcement Learning (Q-Learning).
Python
24
star
11

make-CelebA-HQ

Supposed you've downloaded CelebA & CelebA-HQ dataset, and want to get HQ images from them.
Python
15
star
12

Manzarek

A tiny bot reposts blind date information from website fanfou.
Python
11
star
13

Disentangled-Sequential-Autoencoder

PyTorch Implementation of Disentangled Sequential Autoencoder
Jupyter Notebook
8
star
14

Focus

Chrome Extension: One-click to batch open websites, double-click to close them.
JavaScript
8
star
15

N-Grams-novel

An English & Chinese novel generator based on N-Grams.
Python
4
star
16

DrQAChinese

Python
3
star
17

mazz.github.io

HTML
1
star
18

mazzzystar.github.io

HTML
1
star
19

MusicGAN

Generate long-term "structure" dependency raw piano audio, result: https://soundcloud.com/mazzzystar/sets/only-1-discriminator-to-control-both-local-long-term
Python
1
star