• Stars
    star
    228
  • Rank 175,115 (Top 4 %)
  • Language
    C++
  • Created almost 8 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Core aware thread management system

Arachne: Towards Core-Aware Scheduling

What is core aware scheduling?

In today's large-scale data center systems, there are many complex software components which make a binary trade-off between latency and throughput. They either overprovision their systems to obtain lower latencies and consequently waste resources, or oversubscribe their systems and experience very high latencies due to imbalance between application load and system resources.

Core-aware scheduling is the notion that we can balance an application's offered load to a system's available resources by scheduling threads at user level, and performing coarse-grained core allocation at operating system level.

Under this approach, the kernel no longer preemptively multiplexes between threads without any awareness of what the application is doing. This enables us to avoid the performance degradations caused by slow context switches, priority inversion, and cache pollution from the threads of other processes.

What is Arachne?

According to Greek mythology, Arachne was a mortal weaver who challenged the goddess Athena to a weaving competition. Similarly, the Arachne user threading system attempts to challenge the current dominance of kernel threads in the C++ world.

Arachne is the first step towards core-aware scheduling, allowing an application to run only as many threads in parallel as cores available to it.

Arachne is a user-level, cooperative thread management system written in C++, designed to improve core utilization and maximize throughput in server applications without impacting latency. It performs M:N scheduling over kernel threads running exclusively on CPU cores and features ~200 ns cross-core thread creations and ~100 ns cross-core signals on Nehalem X3470. Arachne also estimates CPU load and adjusts the number of cores accordingly.

How do I use it?

  1. Recursively clone Arachne super repository.

     git clone --recursive https://github.com/PlatformLab/arachne-all.git
    
  2. Build the library with ./buildAll.sh in the top level directory.

     cd arachne-all
     ./buildAll.sh
    
  3. Write your application using the public Arachne API, documented here.

    #include <stdio.h>
    #include "Arachne/Arachne.h"

    void numberPrinter(int n) {
        printf("NumberPrinter says %d\n", n);
    }

    // This is where user code should start running.
    void AppMain(int argc, const char** argv) {
        printf("Arachne says hello world and creates a thread.\n");
        auto tid = Arachne::createThread(numberPrinter, 5);
        Arachne::join(tid);
    }

    // The following bootstrapping code should be copied verbatim into most Arachne
    // applications.
    void AppMainWrapper(int argc, const char** argv) {
        AppMain(argc, argv);
        Arachne::shutDown();
    }
    int main(int argc, const char** argv){
        Arachne::init(&argc, argv);
        Arachne::createThread(&AppMainWrapper, argc, argv);
        Arachne::waitForTermination();
    }
  1. Link your application against Arachne.

     g++ -std=c++11 -o MyApp MyApp.cc  -Iarachne-all/Arachne/include -Iarachne-all/CoreArbiter/include  -Iarachne-all/PerfUtils/include -Larachne-all/Arachne/lib -lArachne -Larachne-all/CoreArbiter/lib -lCoreArbiter -Larachne-all/PerfUtils/lib/ -lPerfUtils  -lpcrecpp -pthread
    

User Threading vs Kernel Threadpool

For those who are unfamiliar with the benefits of user threading, it may seem that a simple kernel thread pool would achieve the same result as a user threading library. However, tasks running in a kernel thread pool generally should not block at user level, so they must run to completion without blocking.

Here is an example of a use case that would require manual stack ripping in a thread pool, but could be implemented as a single function under Arachne.

More Repositories

1

NanoLog

Nanolog is an extremely performant nanosecond scale logging system for C++ that exposes a simple printf-like API.
C++
2,987
star
2

RAMCloud

**No Longer Maintained** Official RAMCloud repo
C++
487
star
3

Homa

Low-Latency Data Center Network Transport
C++
191
star
4

HomaModule

A Linux kernel module that implements the Homa transport protocol.
C
175
star
5

PerfUtils

A collection of eclectic tools for measuring performance using the cycle counter and pinning threads.
C++
36
star
6

HomaSimulation

C++
34
star
7

grpc_homa

Allows Homa to be used as a transport with gRPC.
C++
25
star
8

ldbc-snb-impls

A collection of workload implementations for the LDBC SNB benchmark driver
Java
20
star
9

TorcDB

TorcDB: A Low-Latency Graph Database on RAMCloud
Java
17
star
10

mappy

Demo re-implementation of the Hadoop MapReduce scheduler in Python
Java
13
star
11

CoreArbiter

C++
11
star
12

memcached-A

The repo for both original memcached and memcached-A
C
8
star
13

CacheTools

Tools for measuring the speed of various CPU Caches
C++
8
star
14

Ramdis

A RAMCloud-based implementation of Redis
C
6
star
15

Log-Analyzer

Collection of scripts to statically analyze log statements in open-source software.
Python
5
star
16

arachne-all

Arachne thread management together with all dependencies and benchmarks.
Shell
5
star
17

memtier_skewsyn

Memcached skew and colocation workload benchmarking tool (modified from https://github.com/RedisLabs/memtier_benchmark).
C++
4
star
18

ramcloud-ycsb

Runs YCSB benchmarks using RAMCloud for storage
HTML
3
star
19

Roo

Multi-Hop Communication Framework
C++
3
star
20

homa-paper-artifact

Materials for the artifact evaluation of the SIGCOMM'18 Homa paper
R
3
star
21

koho

A robust multipath tunnel
C++
3
star
22

MPSCQueueBenchmarks

A framework for benchmarking multiple-consumer-single-producer queues
C++
2
star
23

consistentRedis

C
2
star
24

gtest

Mirror of gtest for use with RAMCloud
C++
2
star
25

ArachnePerfTests

Microbenchmarks for Arachne threading library.
C++
2
star
26

PlatformLab-Identity

Platform Lab logos, symbols, and design guidelines
2
star
27

x264

Mirror of the x264 repository by VLC.
C
1
star