• Stars
    star
    46
  • Rank 613,923 (Top 13 %)
  • Language
    Ruby
  • License
    MIT License
  • Created about 15 years ago
  • Updated about 15 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Uses parselets and rwget to generate csv files from websites

More Repositories

1

parsley

Parsley is a simple language for extracting structured data from web pages. Parsley consists of an powerful selector language wrapped with a JSON structure that can represent page-wide formatting.
Shell
908
star
2

pquery

A javascript port of Parsley
JavaScript
46
star
3

robots

robots.txt parser
Ruby
40
star
4

pyparsley

python binding for parsley
C
40
star
5

sit

streaming index tool
C
34
star
6

thread_pool

Ruby Thread Pool
Ruby
29
star
7

fusefs-osx

Ruby fusefs for OS X
C
15
star
8

parsley-ruby

ruby binding for parsley
Ruby
15
star
9

scala-bootstrapper

Ruby
11
star
10

node-avro

node -> avro bindings
C
9
star
11

libbow-osx

libbow text classifier framework with patches that make it compile (for me) on OS X Snow Leopard
C
7
star
12

libregexp9

Copy of the *nix port of Plan9's regexp engine
C
6
star
13

rwget

a subset of recursive wget's functionality, but with regular expressions and sitemaps.
Ruby
6
star
14

skynet-for-twitter

Fork of tweetero to support Bayes filtering of tweets
Objective-C
6
star
15

parselets_com

JavaScript
6
star
16

pincer

Shell
5
star
17

snow

Solr NOW, realtime search
Scala
5
star
18

jailer

Tiny golang application for parsing log files into json logs
Go
5
star
19

rfeedparser

mirror of rfeedparser
Ruby
4
star
20

sandbox-keyhole

Ruby
4
star
21

goingo

Go
4
star
22

spraytan

Instant sunlight (labs, i.e. for 50 state parsing)
Ruby
3
star
23

date_range

Ruby
3
star
24

scripts

random scripts I find useful
3
star
25

csvget-ec2-recipe

CSVGet on EC2
Ruby
3
star
26

ordered_json

JSON parser to OrderedHash (Ruby)
Shell
3
star
27

collapsed_routes

Gem to clean up rails routes
Ruby
2
star
28

rakeutil

simple rake utilities
Ruby
2
star
29

yawn

Send and forget http library for ruby.
Ruby
2
star
30

empty

2
star
31

mod_api_limit

apache2 module to limit api access
2
star
32

eq

Ruby
2
star
33

osx-devtools-bootstrap

MacPorts, etc basic install for a Ruby-centric developer
2
star
34

http_echo_server

Ruby
2
star
35

twscala

2
star
36

sandbox-html_cleaner

Kinda like tidy, except buggy
Ruby
2
star
37

stfu-hoe

Fake hoe gem
2
star
38

remind

Micro-gem to allow growl notifications when a command finishes.
2
star
39

parsley-on-legs

Parsley + jQuery + 80legs
JavaScript
2
star
40

logs

golang logging
Go
2
star
41

drupal-apachesolr

PHP
2
star
42

git_graphs

Couple graph formats for git logs
2
star
43

sandbox-loggable

making ruby's logger easier to mix into classes
Ruby
2
star
44

assertion_fu

Grab bag of Ruby Test::Unit assertions
Ruby
2
star
45

uberchronic

Like chronic, plus GNU getdate to cover edge cases
2
star
46

tablegrok

HTML Tables to Ruby
Ruby
2
star
47

gizzard-toy

playing around with gizzard on vacation
Scala
2
star
48

redump

convert single-table INSERT-style mysqldumps into tab-separated values (TSV)
C
2
star
49

kylemaxwell_com

My personal site
JavaScript
2
star
50

kylemaxwell

home page
Ruby
1
star
51

govquery

1
star
52

taste

*Abandoned* aborted attempt to make some random distributed toolkit.
Scala
1
star
53

capsh

cap shell isolated
Ruby
1
star
54

docsy

Ruby
1
star
55

dsafadsfsdfads

1
star
56

plaid

Logging
Ruby
1
star
57

sometweets

Ruby
1
star
58

contentlogic

EXPERIMENTAL!
1
star
59

scala-style

command-line utils to maintain
Ruby
1
star
60

pullme

Ruby
1
star
61

solrj-hmac-auth

websolr hmac authentication demo
Java
1
star
62

kissunit

REALLY simple Javascript testing
JavaScript
1
star
63

geosolr

Java
1
star
64

plain_option_parser

Ruby
1
star
65

straitjacket

Postgres constraints in ruby
Ruby
1
star
66

factorial

*Abandoned* toy for editing web pages.
JavaScript
1
star
67

crapshoot

1
star
68

rolling-restart

1
star
69

solr-toy

scaffold + acts_as_solr
Ruby
1
star
70

hiring

1
star
71

multilingual_tokenizer

multilingual tokenizer for lucene/solr
1
star
72

rmv

regex mv
Ruby
1
star
73

protoc-gen-thrift

Go
1
star
74

squared2csv

Creates a csv file from google squared
Ruby
1
star
75

latch

This is a really simple countdown latch for Ruby.
Ruby
1
star
76

slow

A TCP reverse proxy that responds slowly
Ruby
1
star
77

act

act
JavaScript
1
star
78

sing

Scala Interpreting NailGun (fast scala tools)
1
star
79

pq

just the persistent queue from kestrel
Scala
1
star
80

erb-tidy

Like tidy, but handles erb tags the way i'd prefer
1
star
81

dotfiles

1
star
82

csvtoy

awk-like CSV manipulation in Ruby
Ruby
1
star
83

scala-maven-eclipse-twitter-scalatest-scaffold

Scala
1
star
84

stringset

String intersection calculations
Ruby
1
star
85

jira

Ruby
1
star
86

mailflow

Ruby
1
star
87

jane

An experiment
Swift
1
star
88

should_eventually

should_eventually is an rspec convention for testing async code.
1
star
89

balls

JavaScript
1
star
90

edhd

Ruby
1
star
91

github-contest

Java
1
star