• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
    Java
  • License
    Other
  • Created about 13 years ago
  • Updated about 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Java
344
star
2

cascading.hbase

HBase adapters for Cascading
Java
46
star
3

bash-emr

Simple bash functions for manipulating Amazon Elastic MapReduce clusters
Shell
45
star
4

riffle

Annotations and Classes for managing and executing dependent processes
Java
39
star
5

cascading.samples

Sample applications using Cascading
Java
38
star
6

cascading.jdbc

JDBC adapter for Cascading
Java
24
star
7

cascading.multitool

Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
Java
21
star
8

cascading.groovy

A Groovy DSL for Cascading
Java
11
star
9

notebook

Random notes on distributed computing and stuff.
10
star
10

cascading.memcached

Memecached/Membase/ElasticSearch integration for Cascading
Java
9
star
11

cascading.load

A simple command line interface for building high load cluster jobs.
Java
4
star
12

feed2torrent

A Python script that will fetch torrent files from an atom/rss feed and download the files they reference.
3
star
13

bash-ec2

A simple script for initializing an EC2 command line environment
Shell
2
star
14

docbook-template

A template for creating new DocBook projects
2
star
15

docbook-framework

A fork of the Velocity DocBook Framework
2
star
16

cascading-local

Now incorporated into Cascading 4.x
Java
2
star
17

cascading.work

Cascading.Work provides a simple framework for creating complex Cascading applications that work with data that must be accessed via multiple data format across multiple systems.
Java
2
star
18

cascading-regression

Groovy
1
star