• Stars
    star
    1
  • Language
  • Created almost 13 years ago
  • Updated almost 13 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Helpers for web scrapers

More Repositories

1

urchin

Shell tests
Shell
212
star
2

openprism

Type your search in one search bar, and get results from all of the Socrata and CKAN portals.
JavaScript
39
star
3

nyc-crime-map

Python
22
star
4

data-wranglers-dc-pdfs

19
star
5

www.thomaslevine.com

Thomas Levine's old homepage
JavaScript
13
star
6

socrata-download

Shell
10
star
7

socrata-pricing

10
star
8

aaron-swartz

Python
8
star
9

socrata-analysis

JavaScript
7
star
10

scott

JavaScript
6
star
11

scott-documents

Wetland permit application documents
6
star
12

vlermv

Python
6
star
13

pickle-warehouse

Python
5
star
14

craigsgenerator

Python
5
star
15

special_snowflake

OpenEdge ABL
5
star
16

openlawsf

5
star
17

krounq

R
5
star
18

feedformatter

feedformatter is a Python library for generating news feeds in RSS and Atom formats.
Python
4
star
19

friendly_brief

Python
4
star
20

open-data-500

R
4
star
21

lazydriver

Scrape with a Chrome extension and push to a couch
JavaScript
4
star
22

ckan-datasets

3
star
23

nyc-crime-map-data

3
star
24

dadaportal

Python
3
star
25

picklecache

Python
3
star
26

pluplusch

Python
3
star
27

horetu

Python
2
star
28

highwall-fixtures

Fixtures for Highwall tests
2
star
29

plan-things

Shell
2
star
30

nps_weather

HTML
2
star
31

dicti

Case-insensitive dictionary
Python
2
star
32

data-guacamole

Python
2
star
33

barelywebgit

2
star
34

whom-to-email

R
2
star
35

cured-foods

Python
2
star
36

risley-floor-plans

2
star
37

wsync

2
star
38

ggplot-not-r

R
2
star
39

fizzbuzz-latex

2
star
40

arabic-tweets

R
2
star
41

hssas

Haskell
2
star
42

parsing-pdfs-workshop

Python
2
star
43

nyc-data-downloads

Downloads of the NYC data bank
2
star
44

socrata-defederate

2
star
45

n.an

Noncomprehensive Dotfile Archive Network
Shell
2
star
46

how-to-scrape

How to scrape websites
Python
2
star
47

datakind-scy

2
star
48

open-data

R
2
star
49

delaware

Python
2
star
50

treegit

Off-the-self cgit server with some helpers
Shell
2
star
51

couch.thomaslevine.com

Tom's CouchDB server
1
star
52

status.thomaslevine.com

1
star
53

whitbygroup

JavaScript
1
star
54

openlawoakland

Python
1
star
55

geom_doner

R
1
star
56

dumptruck-website

Website for dumptruck
JavaScript
1
star
57

hipstogram

Pretty sure the hipstogram is a better visualization tool than the histogram.
R
1
star
58

socrata-nominate

JavaScript
1
star
59

intherooms-meetings-sf

1
star
60

nypd-xy

R
1
star
61

nfsn-helpers

PHP
1
star
62

bashful-fs

1
star
63

audio-cable-power-controller

1
star
64

meetup-users-meetup

http://www.meetup.com/MeetupUsers/
1
star
65

dep

1
star
66

course-map

1
star
67

formaldehyde

1
star
68

spreadsheet-corpus

1
star
69

closeddada

Python
1
star
70

united-states-middlenames

Python
1
star
71

gkebd

Python
1
star
72

csv-soundsystem

1
star
73

unwatermark

Python
1
star
74

code-against-america

1
star
75

Deleware-Corporations-Scraper

Python
1
star
76

smart_csv_dictreader

Python
1
star
77

bucket-wheel

Bucket-Wheel Excavator
Python
1
star
78

SocialValue

Python
1
star
79

python-daemon-example

Python
1
star
80

htmltable2matrix

Convert an html table to a list of lists in Python
Python
1
star
81

immaterialdigitallabor.net

HTML
1
star
82

github-user-growth

1
star
83

intherooms-meetings-ny

1
star
84

geom_taco

R
1
star
85

chainsaw.thomaslevine.com

1
star
86

scraperwiki-kpi

ScraperWiki key performance indicators
R
1
star
87

chainsaw

Web scraping tools
1
star
88

conveyor-belt

Distributed file-based email archival system
1
star
89

scraperwiki-exp14

1
star
90

ouekque

Haskell
1
star
91

postsecret-downloads

1
star
92

rifidec

Scraping http://www.rifidec.org/membres/infomembres.htm
1
star
93

gastronomification-big-data-talk

HTML
1
star
94

git.thomaslevine.com-fail1

PHP
1
star
95

hyde-h5bp

Hyde starter structure with HTML5Boilerplate post-processing
JavaScript
1
star
96

dwb-neef

1
star
97

euler

Haskell
1
star
98

script_kiddie_bot_confuser

Confuse bots that are trying to get into your site
Python
1
star
99

acpr-banque-de-france

Python
1
star
100

201204-homepage-experiment

R
1
star