• Stars
    star
    2
  • Language
    Python
  • Created almost 13 years ago
  • Updated almost 13 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

How to scrape websites

More Repositories

1

urchin

Shell tests
Shell
212
star
2

openprism

Type your search in one search bar, and get results from all of the Socrata and CKAN portals.
JavaScript
39
star
3

nyc-crime-map

Python
22
star
4

data-wranglers-dc-pdfs

19
star
5

www.thomaslevine.com

Thomas Levine's old homepage
JavaScript
13
star
6

socrata-download

Shell
10
star
7

socrata-pricing

10
star
8

aaron-swartz

Python
8
star
9

socrata-analysis

JavaScript
7
star
10

scott

JavaScript
6
star
11

scott-documents

Wetland permit application documents
6
star
12

vlermv

Python
6
star
13

pickle-warehouse

Python
5
star
14

craigsgenerator

Python
5
star
15

special_snowflake

OpenEdge ABL
5
star
16

openlawsf

5
star
17

krounq

R
5
star
18

feedformatter

feedformatter is a Python library for generating news feeds in RSS and Atom formats.
Python
4
star
19

friendly_brief

Python
4
star
20

open-data-500

R
4
star
21

lazydriver

Scrape with a Chrome extension and push to a couch
JavaScript
4
star
22

ckan-datasets

3
star
23

nyc-crime-map-data

3
star
24

dadaportal

Python
3
star
25

picklecache

Python
3
star
26

pluplusch

Python
3
star
27

horetu

Python
2
star
28

highwall-fixtures

Fixtures for Highwall tests
2
star
29

plan-things

Shell
2
star
30

nps_weather

HTML
2
star
31

dicti

Case-insensitive dictionary
Python
2
star
32

data-guacamole

Python
2
star
33

barelywebgit

2
star
34

whom-to-email

R
2
star
35

cured-foods

Python
2
star
36

risley-floor-plans

2
star
37

wsync

2
star
38

ggplot-not-r

R
2
star
39

fizzbuzz-latex

2
star
40

arabic-tweets

R
2
star
41

hssas

Haskell
2
star
42

parsing-pdfs-workshop

Python
2
star
43

nyc-data-downloads

Downloads of the NYC data bank
2
star
44

socrata-defederate

2
star
45

n.an

Noncomprehensive Dotfile Archive Network
Shell
2
star
46

datakind-scy

2
star
47

open-data

R
2
star
48

delaware

Python
2
star
49

treegit

Off-the-self cgit server with some helpers
Shell
2
star
50

couch.thomaslevine.com

Tom's CouchDB server
1
star
51

status.thomaslevine.com

1
star
52

whitbygroup

JavaScript
1
star
53

openlawoakland

Python
1
star
54

geom_doner

R
1
star
55

dumptruck-website

Website for dumptruck
JavaScript
1
star
56

hipstogram

Pretty sure the hipstogram is a better visualization tool than the histogram.
R
1
star
57

socrata-nominate

JavaScript
1
star
58

intherooms-meetings-sf

1
star
59

nypd-xy

R
1
star
60

nfsn-helpers

PHP
1
star
61

bashful-fs

1
star
62

audio-cable-power-controller

1
star
63

meetup-users-meetup

http://www.meetup.com/MeetupUsers/
1
star
64

dep

1
star
65

course-map

1
star
66

formaldehyde

1
star
67

spreadsheet-corpus

1
star
68

closeddada

Python
1
star
69

united-states-middlenames

Python
1
star
70

gkebd

Python
1
star
71

csv-soundsystem

1
star
72

unwatermark

Python
1
star
73

code-against-america

1
star
74

Deleware-Corporations-Scraper

Python
1
star
75

smart_csv_dictreader

Python
1
star
76

bucket-wheel

Bucket-Wheel Excavator
Python
1
star
77

SocialValue

Python
1
star
78

python-daemon-example

Python
1
star
79

htmltable2matrix

Convert an html table to a list of lists in Python
Python
1
star
80

immaterialdigitallabor.net

HTML
1
star
81

github-user-growth

1
star
82

intherooms-meetings-ny

1
star
83

geom_taco

R
1
star
84

chainsaw.thomaslevine.com

1
star
85

scraperwiki-kpi

ScraperWiki key performance indicators
R
1
star
86

chainsaw

Web scraping tools
1
star
87

conveyor-belt

Distributed file-based email archival system
1
star
88

scraperwiki-exp14

1
star
89

ouekque

Haskell
1
star
90

postsecret-downloads

1
star
91

rifidec

Scraping http://www.rifidec.org/membres/infomembres.htm
1
star
92

gastronomification-big-data-talk

HTML
1
star
93

git.thomaslevine.com-fail1

PHP
1
star
94

hyde-h5bp

Hyde starter structure with HTML5Boilerplate post-processing
JavaScript
1
star
95

dieselfuel

Helpers for web scrapers
1
star
96

dwb-neef

1
star
97

euler

Haskell
1
star
98

script_kiddie_bot_confuser

Confuse bots that are trying to get into your site
Python
1
star
99

acpr-banque-de-france

Python
1
star
100

201204-homepage-experiment

R
1
star