• Stars
    star
    134
  • Rank 265,311 (Top 6 %)
  • Language
    Java
  • License
    Other
  • Created almost 13 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Library to use Kestrel as a spout within Storm

Library to use Kestrel as a spout from within Storm.

This spout uses Kestrel's Thrift API which became available as of Kestrel 2.2.

Spout usage

The KestrelThriftSpout in this library reads messages off of one or more Kestrel servers. When using the spout, it is recommended that you increase the parallelism of the spout to increase the rate at which you can read messages from Kestrel.

The spout deals gracefully with any errors coming from Kestrel. It will blacklist Kestrel servers for 60 seconds if there is an error or timeout. After the 60 seconds is up, it will try that server again.

By default, KestrelThriftSpout emits 1-tuples containing a byte array as its output. You can provide a Scheme to the KestrelSpout to deserialize those byte arrays into a tuple structure of your choosing. This library comes with a StringScheme that will UTF-decode the byte arrays into Java strings.

Maven

storm-kestrel is hosted on the Clojars maven repo. To include it as a dependency in your project, add Clojars as a Maven repository to your pom.xml with the following snippet:

<repository>
  <id>clojars.org</id>
  <url>http://clojars.org/repo</url>
</repository>

Then, you can add storm-kestrel as a dependency like so:

<dependency>
  <groupId>storm</groupId>
  <artifactId>storm-kestrel</artifactId>
  <version>0.7.2-SNAPSHOT</version>
</dependency>

Previous versions

Previous versions of this spout required a fork of Kestrel, since it modified the memcached protocol used for communication to enable guaranteed message processing. It is highly recommended you upgrade to Kestrel 2.2 and the latest version of this spout.

More Repositories

1

storm

Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
Java
8,849
star
2

cascalog

Data processing on Hadoop without the hassle.
Clojure
1,374
star
3

storm-starter

Learn to use Storm!
Java
942
star
4

storm-contrib

A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm
Java
580
star
5

elephantdb

Distributed database specialized in exporting key/value data from Hadoop
Java
557
star
6

storm-deploy

One click deploy for Storm clusters on AWS
Clojure
517
star
7

dfs-datastores

Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.
Java
216
star
8

kafka-deploy

Automated deploy for Kafka on AWS
Clojure
124
star
9

storm-mesos

Run Storm on top of the Mesos cluster resource manager
Java
68
star
10

nanny

A simple dependency management system for your projects.
Python
46
star
11

cascalog-contrib

Java
45
star
12

trident-memcached

Trident state implementation for Memcached
Java
41
star
13

cascalog-demo

A short Cascalog program that produces a simplified version of a Facebook-like news feed.
Clojure
26
star
14

basic-specter

Implementation of core of Specter without any optimizations – a reference to understand the basics of how Specter works
Clojure
23
star
15

cascading-batch-query

Optimized joins using bloom filters on Hadoop via Cascading.
Java
21
star
16

cascalog-workshop

Materials for Cascalog workshop
Clojure
18
star
17

elephantdb-cascalog

Seamless integration of ElephantDB with Cascalog
Clojure
18
star
18

trident-kafka

NOTE: This project has been moved into storm-kafka in storm-contrib
Java
15
star
19

elephantdb-cascading

Adapters to write to ElephantDB using Cascading
Java
13
star
20

specter-demo

Code for Strange Loop talk on Specter
Clojure
13
star
21

cascalog-conj

Code from my presentation of Cascalog at Clojure/conj 2011
Clojure
10
star
22

storm-website

Source for storm-project.net
CSS
7
star
23

thrift-dev

Apache Thrift + additional patches that I need
C++
6
star
24

specter-clojure-west

Clojure
6
star
25

swarm

Intense Space Invaders-like game with "terminal graphics"
C++
5
star
26

warzone

Turn based strategy game
Java
4
star
27

formula-inverse

A high-speed 3D racing game where the track can curve any which way and your car is bound to the track
C
4
star
28

specter-wiki

Repository for wiki of https://github.com/redplanetlabs/specter
4
star
29

cascalog-workshop-starter

Starter code for Cascalog workshop
Clojure
2
star
30

specter-presentation

Clojure
2
star