Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Dart

Shell

C

C++

R

Solidity

F#

Elixir

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Lua

PowerShell

Go

PHP

F#

Perl

Julia

Elixir

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇸🇾 Syria

🇦🇴 Angola

🇶🇦 Qatar

🇬🇮 Gibraltar

🇳🇺 Niue

🇹🇿 Tanzania

🇨🇲 Cameroon

🇲🇷 Mauritania

All Countries Compare Countries

twitter-archive/haplocheirus

This repository has been archived on 18/Sep/2021
Stars
133
Rank 272,600 (Top 6 %)
Language
Scala
License
Other
Created over 14 years ago
Updated almost 8 years ago

twitter-archive/haplocheirus

twitter-archive

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A Redis-backed storage engine for timelines

STATUS

Twitter is no longer maintaining this project or responding to issues or PRs.

Haplocheirus

Haplocheirus is a redis-backed storage engine for timelines.

Disclaimer: This project is an experiment, and is not complete or deployed anywhere yet.

Timelines are lists of 64-bit ids, possibly with a small bit of attached metadata, in preserved order. New entries are always added at the "front". Old entries may be dropped off the "back" if the timeline exceeds a maximum size. Two timelines can be merged by assuming they're roughly sorted by id. A timeline query returns a slice of the timeline, newest first.

Goals

Highly available, partitioned
Structured vectors (each timeline entry is an atomic blob, not necessarily all alike)
Homogeneous service interface
Eliminate client side hashing
Preserve ability to expire timelines that aren't being read
Durable snapshots
Idempotent/commutative

Non-goals

Contain business logic for building timelines from scratch
Transactionally durable timelines

Structure

Gizzard is used to handle partitioning and job queueing, and Redis is used as the backend storage for each shard.

New features from redis 2.2 are required (LPUSHX and LINSERT for example), so for now, you will need to build redis from trunk: http://github.com/antirez/redis

Building

You need:

java 1.6
thrift 0.2.0
sbt 0.7.4
redis-server (2.2 trunk; see above)

You might want:

haplocheirus-client http://github.com/bitbckt/haplocheirus-client

A special build of jredis is used, too, but currently the jar is included in the repo.

Then:

$ sbt clean update package-dist

Running

Start up your local redis server, then:

$ ./dist/haplocheirus-1.0/scripts/setup-env.sh

Community

License: Apache 2 (see included LICENSE file)

IRC: #twinfra on freenode (irc.freenode.net)

snowflake

Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

diffy

Find potential bugs in your services with Diffy

flockdb

A distributed, fault-tolerant graph database

kestrel

simple, distributed message queue system (inactive)

twui

A UI framework for Mac based on Core Animation

CocoaSPDY

SPDY for iOS and OS X

gizzard

[Archived] A flexible sharding framework for creating eventually-consistent distributed datastores

distributedlog

A high performance replicated log service. (The development is moved to Apache Incubator)

recess

A simple and attractive code quality tool for CSS built on top of LESS

commons

Twitter common libraries for python and the JVM (deprecated)

iago

A load generator, built for engineers

twitter-text-js

A JavaScript implementation of Twitter's text processing library

ambrose

A platform for visualization and real-time monitoring of data workflows

twitter-kit-android

Twitter Kit for Android

ostrich

A stats collector & reporter for Scala servers (deprecated)

twitter-kit-ios

Twitter Kit is a native SDK to include Twitter content inside mobile apps.

twitter-text-rb

A library that does auto linking and extraction of usernames, lists and hashtags in tweets

mysos

Cotton (formerly known as Mysos)

twitter-text-objc

An Objective-C implementation of Twitter's text processing library

torch-autograd

Autograd automatically differentiates native Torch code

ospriet

An example audience moderation app built on Twitter

cloudhopper-smpp

Efficient, scalable, and flexible Java implementation of the Short Messaging Peer to Peer Protocol (SMPP)

twitter-text-java

A Java implementation of Twitter's text processing library

jvmgcprof

A simple utility for profile allocation and garbage collection activity in the JVM

css-flip

A CSS BiDi flipper

clockworkraven

Human-Powered Data Analysis with Mechanical Turk

torch-twrl

Torch-twrl is a package that enables reinforcement learning in Torch.

cassie

A Scala client for Cassandra

twemperf

A tool for measuring memcached server performance

hdfs-du

Visualize your HDFS cluster usage

pycascading

A Python wrapper for Cascading

RTLtextarea

Automatically detects RTL and configures a text input

standard-project

A slightly more standard sbt project plugin library

torch-decisiontree

This project implements random forests and gradient boosted decision trees (GBDT). The latter uses gradient tree boosting. Both use ensemble learning to produce ensembles of decision trees (that is, forests).

elephant-twin

Elephant Twin is a framework for creating indexes in Hadoop

torch-ipc

A set of primitives for parallel computation in Torch

torch-distlearn

A set of distributed learning algorithms for Torch

libcrunch

A lightweight mapping framework that maps data objects to a number of nodes, subject to constraints

scribe

A Ruby client library for Scribe

sbt-package-dist

sbt 11 plugin codifying best practices for building, packaging, and publishing

twisitor

A simple and spectacular photo-tweeting birdhouse

flockdb-client

A Ruby client library for FlockDB

code-of-conduct

Open Source Code of Conduct at Twitter

twitter-text-conformance

Conformance testing data for the twitter-text-* repositories

torch-dataset

An extensible and high performance method of reading, sampling and processing data for Torch

cdk

CDK is a tool to quickly generate single-file html slide presentations from AsciiDoc

naggati2

Protocol builder for netty using scala (DEPRECATED)

twitter-kit-unity

Twitter Kit for Unity

plumage.js

Batteries Included App Framework for Data Intensive UIs

gozer

Prototype mesos framework using new low-level API built in Go

bookkeeper

Twitter's fork of Apache BookKeeper (will push changes upstream eventually)

grabby-hands

A JVM Kestrel client that aggregates queues from multiple servers. Implemented in Scala with Java bindings. In use at Twitter for all JVM Search and Streaming Kestrel interactions.

gizzmo

A command-line client for Gizzard

thrift

Twitter's out-of-date, forked thrift

libkestrel

time_constants

Time constants, in seconds, so you don't have to use slow ActiveSupport helpers

sbt-scrooge

An SBT plugin that adds a mixin for doing Thrift code auto-generation during your compile phase

cli-guide.js

CLI Guide JQuery Plugin

sbt-thrift

sbt rules for generating source stubs out of thrift IDLs, for java & scala

jaqen

A type-safe heterogenous Map or a Named field Tuple

spitball

A very simple gem package generation tool built on bundler

torch-thrift

A Thrift codec for Torch

jsr166e

JSR166e for Twitter

unishark

Unishark: Another unittest extension for Python

raggiana

A simple standalone Finagle stats viewer

sekhmet

foundational tools and building blocks for gaining insights and diagnosing system health in real-time

periscope-live-engagement-unity-sdk

Periscope Live Engagement Unity SDK

twitterActors

Improved Scala actors library; used internally at Twitter

finatra-activator-http-seed

Typesafe activator template for constructing a Finatra HTTP server application:

killdeer

Killdeer is a simple server for replaying a sample of responses to sythentically recreate production response characteristics.

elephant-twin-lzo

Elephant Twin LZO uses Elephant Twin to create LZO block indexes

bittern

Bittern Cache uses nvdimm to speed up block io operations

finatra-activator-thrift-seed

Typesafe activator template for constructing a Finatra Thrift server application: https://twitter.github.io/finatra/user-guide/ —

chainsaw

A thin Scala wrapper for SLF4J

PerfTracepoint

Perf tracepoint support for the JVM

oscon-puzzles

OSCON 2014 Puzzle

scala-json

JSON in Scala (deprecated)

scala-csp-config

A Scala library for configuring Content Security Policy headers for HTTP responses.

.github

finatra-misc

Miscellaneous libraries and utils used by Finatra

autolog-clustering

USF Capstone Project for Auto-log Clustering