• Stars
    star
    487
  • Rank 90,247 (Top 2 %)
  • Language
    C++
  • License
    Other
  • Created over 9 years ago
  • Updated about 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

**No Longer Maintained** Official RAMCloud repo

RAMCloud

Build Status

For up to date information on how to install and use RAMCloud, see the RAMCloud Wiki: https://ramcloud.atlassian.net/wiki/spaces/RAM/overview

What is RAMCloud?

note: the following is an excerpt copied from the RAMCloud wiki on 1/22/16.

RAMCloud is a new class of super-high-speed storage for large-scale datacenter applications. It is designed for applications in which a large number of servers in a datacenter need low-latency access to a large durable datastore. RAMCloud offers the following properties:

  • Low Latency: RAMCloud keeps all data in DRAM at all times, so applications can read RAMCloud objects remotely over a datacenter network in as little as 5μs. Writes take less than 15μs. Unlike systems such as memcached, applications never have to deal with cache misses or wait for disk/flash accesses. As a result, RAMCloud storage is 10-1000x faster than other available alternatives.
  • Large scale: RAMCloud aggregates the DRAM of thousands of servers to support total capacities of 1PB or more.
  • Durability: RAMCloud replicates all data on nonvolatile secondary storage such as disk or flash, so no data is lost if servers crash or the power fails. One of RAMCloud's unique features is that it recovers very quickly from server crashes (only 1-2 seconds) so the availability gaps after crashes are almost unnoticeable. As a result, RAMCloud combines the durability of replicated disk with the speed of DRAM. If you have used memcached, you have probably experienced the challenges of managing a second durable storage system and maintaining consistency between it and memcached. With RAMCloud, there is no need for a second storage system.
  • Powerful data model: RAMCloud's basic data model is a key-value store, but we have extended it with several additional features, such as:
    • Multiple tables, each with its own key space.
    • Transactional updates that span multiple objects in different tables.
    • Secondary indices.
    • Strong consistency: unlike other NoSQL storage systems, all updates in RAMCloud are consistent, immediately visible, and durable.
  • Easy deployment: RAMCloud is a software package that runs on commodity Intel servers with the Linux operating system. RAMCloud is available freely in open source form.

From a practical standpoint, RAMCloud enables a new class of applications that manipulate large data sets very intensively. Using RAMCloud, an application can combine tens of thousands of items of data in real time to provide instantaneous responses to user requests. Unlike traditional databases, RAMCloud scales to support very large applications, while still providing a high level of consistency. We believe that RAMCloud, or something like it, will become the primary storage system for structured data in cloud computing environments such as Amazon's AWS or Microsoft's Azure. We have built the system not as a research prototype, but as a production-quality software system, suitable for use by real applications.

RAMCloud is also interesting from a research standpoint. Its two most important attributes are latency and scale. The first goal is to provide the lowest possible end-to-end latency for applications accessing the system from within the same datacenter. We currently achieve latencies of around 5μs for reads and 15μs for writes, but hope to improve these in the future. In addition, the system must scale, since no single machine can store enough DRAM to meet the needs of large-scale applications. We have designed RAMCloud to support at least 10,000 storage servers; the system must automatically manage all the information across the servers, so that clients do not need to deal with any distributed systems issues. The combination of latency and scale has created a large number of interesting research issues, such as how to ensure data durability without sacrificing the latency of reads and writes, how to take advantage of the scale of the system to recover very quickly after crashes, how to manage storage in DRAM, and how to provide higher-level features such as secondary indexes and multiple-object transactions without sacrificing the latency or scalability of the system. Our solutions to these problems are described in a series of technical papers.

The RAMCloud project is based in the Department of Computer Science at Stanford University.

Learn More about RAMCloud

https://ramcloud.atlassian.net/wiki/spaces/RAM/overview

More Repositories

1

NanoLog

Nanolog is an extremely performant nanosecond scale logging system for C++ that exposes a simple printf-like API.
C++
2,987
star
2

Arachne

Core aware thread management system
C++
228
star
3

Homa

Low-Latency Data Center Network Transport
C++
191
star
4

HomaModule

A Linux kernel module that implements the Homa transport protocol.
C
175
star
5

PerfUtils

A collection of eclectic tools for measuring performance using the cycle counter and pinning threads.
C++
36
star
6

HomaSimulation

C++
34
star
7

grpc_homa

Allows Homa to be used as a transport with gRPC.
C++
25
star
8

ldbc-snb-impls

A collection of workload implementations for the LDBC SNB benchmark driver
Java
20
star
9

TorcDB

TorcDB: A Low-Latency Graph Database on RAMCloud
Java
17
star
10

mappy

Demo re-implementation of the Hadoop MapReduce scheduler in Python
Java
13
star
11

CoreArbiter

C++
11
star
12

memcached-A

The repo for both original memcached and memcached-A
C
8
star
13

CacheTools

Tools for measuring the speed of various CPU Caches
C++
8
star
14

Ramdis

A RAMCloud-based implementation of Redis
C
6
star
15

Log-Analyzer

Collection of scripts to statically analyze log statements in open-source software.
Python
5
star
16

arachne-all

Arachne thread management together with all dependencies and benchmarks.
Shell
5
star
17

memtier_skewsyn

Memcached skew and colocation workload benchmarking tool (modified from https://github.com/RedisLabs/memtier_benchmark).
C++
4
star
18

ramcloud-ycsb

Runs YCSB benchmarks using RAMCloud for storage
HTML
3
star
19

Roo

Multi-Hop Communication Framework
C++
3
star
20

homa-paper-artifact

Materials for the artifact evaluation of the SIGCOMM'18 Homa paper
R
3
star
21

koho

A robust multipath tunnel
C++
3
star
22

MPSCQueueBenchmarks

A framework for benchmarking multiple-consumer-single-producer queues
C++
2
star
23

consistentRedis

C
2
star
24

gtest

Mirror of gtest for use with RAMCloud
C++
2
star
25

ArachnePerfTests

Microbenchmarks for Arachne threading library.
C++
2
star
26

PlatformLab-Identity

Platform Lab logos, symbols, and design guidelines
2
star
27

x264

Mirror of the x264 repository by VLC.
C
1
star