• Stars
    star
    10
  • Rank 1,807,489 (Top 36 %)
  • Language
    Python
  • Created about 11 years ago
  • Updated about 11 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Scraping Workshop for Hacks/Hackers BA

More Repositories

1

dataset

Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
Python
4,766
star
2

normality

A tiny library for Python text normalisation. Useful for ad-hoc text processing.
Python
142
star
3

datafreeze

Dump (freeze) SQL query results from a database into a selection of file formats
Python
89
star
4

pgcsv

Load CSV files into Postgres without explicit schema creation.
Python
74
star
5

twindle

A set of utilities to track and mine Twitter streaming API data
CoffeeScript
47
star
6

databin

A pastebin for tables.
Python
33
star
7

btw13.js

JavaScript implementation of the German electoral system ;)
JavaScript
33
star
8

sedar

Scraping bits of SEDAR
Python
29
star
9

regenesis

Scraper for the GENESIS statistical database
Python
28
star
10

dbcopy

Copy the contents of one SQL database to another
Python
25
star
11

googlesheets

Simple-to-use wrapper for accessing Google Spreadsheets in Python.
Python
22
star
12

edgar-oil-contracts

Ming the SEC's EDGAR system for oil contracts.
Python
20
star
13

ted

Scraper for public public procurement data from the EU's Tenders Electronic Daily (TED)
XSLT
20
star
14

jsongraph

Little JSON object want to be graphs, too!
Python
16
star
15

banal

Commons of stupid, simple Python micro functions. Pull requests very welcome.
Python
16
star
16

thready

Single-function module for multi-threading.
Python
15
star
17

prefixdate

Provide partial dates and retain the date precision through processing
Python
13
star
18

graphkit

Process data based on JSON schema
Python
11
star
19

sparqlquery

A fork of telescope, a SPARQL query building library for Python
Python
11
star
20

typecast

Simple type converters: make ints, floats, bools and dates from your strings!
Python
10
star
21

datapatch

A Python library for defining rule-based overrides on messy data
Python
8
star
22

datastringer

datawi.re Python client library
Python
6
star
23

wahlprogramme

Election platforms and some analysis code for them.
Python
5
star
24

articledata

Mini-metadata format for media content exchange
Python
5
star
25

apikit

A set of utility functions for RESTful Flask applications.
Python
5
star
26

wikidata-ftm

Experiments in converting wikidata to ftm
Python
4
star
27

investor-disputes

Scraping data sources related to investor dispute settlements, specifically the types of companies, experts and treaties which are pertinent to each case.
Python
4
star
28

foerderkatalog

Förderkatalog der Bundesregierung
Python
4
star
29

md-companies

Moldova companies parser
Python
3
star
30

fts

Scraper for data in the EU Financial Transparency System (FTS)
Python
3
star
31

tweetvote

Twitter Classification Webapp
Python
2
star
32

pudo.org

Ein neues blahg, ein besseres blahg, oh Freunde will ich generieren!
HTML
2
star
33

bundeshaushalt

German Federal Budget
Python
2
star
34

newsapps

Various smallish news apps.
JavaScript
2
star
35

dpkg-wb-privatization

Worldbank Privatization Database
Python
2
star
36

morphium

Utility functions for scrapers hosted on morph.io
Python
2
star
37

transparency-register

Extracted scraper for the EU Transparency Register
Python
2
star
38

bafin

BaFin Director's Dealings Scraper
Python
2
star
39

expert-groups

Register of Expert Groups scraper
Python
1
star
40

assets.pudo.org

Re-usable libraries.
JavaScript
1
star
41

uqbar

EXPERIMENT entity name classifier
Python
1
star
42

serbia-news-scrapers

Python
1
star
43

linkage

Prototype
Python
1
star
44

norton.pudo.org

Base docker files for projects
Makefile
1
star