• Stars
    star
    38
  • Rank 706,870 (Top 14 %)
  • Language
    Java
  • Created almost 16 years ago
  • Updated about 13 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Sample applications using Cascading

More Repositories

1

cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Java
344
star
2

cascading.hbase

HBase adapters for Cascading
Java
46
star
3

bash-emr

Simple bash functions for manipulating Amazon Elastic MapReduce clusters
Shell
45
star
4

riffle

Annotations and Classes for managing and executing dependent processes
Java
39
star
5

cascading.jdbc

JDBC adapter for Cascading
Java
24
star
6

cascading.multitool

Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
Java
21
star
7

cascading.groovy

A Groovy DSL for Cascading
Java
11
star
8

notebook

Random notes on distributed computing and stuff.
10
star
9

cascading.memcached

Memecached/Membase/ElasticSearch integration for Cascading
Java
9
star
10

cascading.load

A simple command line interface for building high load cluster jobs.
Java
4
star
11

cascading.bind

Java
4
star
12

feed2torrent

A Python script that will fetch torrent files from an atom/rss feed and download the files they reference.
3
star
13

bash-ec2

A simple script for initializing an EC2 command line environment
Shell
2
star
14

docbook-template

A template for creating new DocBook projects
2
star
15

docbook-framework

A fork of the Velocity DocBook Framework
2
star
16

cascading-local

Now incorporated into Cascading 4.x
Java
2
star
17

cascading.work

Cascading.Work provides a simple framework for creating complex Cascading applications that work with data that must be accessed via multiple data format across multiple systems.
Java
2
star
18

cascading-regression

Groovy
1
star