• Stars
    star
    84
  • Rank 389,211 (Top 8 %)
  • Language
    Java
  • Created almost 10 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Insight Engineering Platform Components

Insight Engineering Platform

Set of base libraries used primarily by the Insight Engineering team at Netflix to support applications that need to run internally and externally.

Overview

The internal platform libraries used at Netflix, provide many useful capabilities that help make the applications more consistent, easier to debug, and generally integrate well into the internal environment. For example it is straightforward to:

  • Examine the properties and jars being used on running instance.
  • Get consistent logging and ability to adjust log levels dynamically for debugging.
  • Configuring JMX so common JVM tooling can be used through the firewall.
  • Register with Eureka and use it to communicate with other services.

Over time some of this functionality was extracted into standalone libraries, many of which have been open sourced as NetflixOSS Common Runtime Services and Libraries. Examples are Archaius (configuration), Eureka (discovery service), Karyon (base server), Ribbon (Eureka aware HTTP client), Governator (dependency injection), and blitz4j (logging).

However, there are still some gaps and for many of the libraries mentioned above an internal wrapper library needs to be used to work well internally. Originally the intent was for the internal wrappers to get deprecated and phased out, but that turned out to be harder than expected and never actually happened. Further some of these libraries like Ribbon and blitz4j are no longer receiving much investment. The IEP libraries were created as part of the work to open source Atlas to allow us to:

  • Have our open source applications be able to work the same way internally and externally.
  • Ensure that the core debugging capabilities and key internal integrations work.
  • Opt-in instead of opt-out. The internal platform provides a lot by default and you typically have to explicitly opt-out to turn off stuff you do not need. To make our apps lighter weight we wanted to explicitly opt-in instead. This has improved with some of the newer internal libraries, but is not available as part of NetflixOSS and thus doesn't satisfy our goal of consistency.

Modules

Libraries named with a prefix of iep-spring- are Spring configurations that can be pulled in. All Insight libraries should work using plain Spring, we do not rely on Governator extensions.

You can pick and choose just the set of modules you need. If one module requires another, then it will install that module explicitly so you do not need to worry about the transitive dependencies.

Module Description
iep-spring-admin Setup admin service for debugging the application.
iep-spring-atlas Configure Spectator to use AtlasRegistry.
iep-spring-aws2 Setting up and injecting AWS clients.
iep-spring-leader Default bindings for the simple leader election API.
[iep-spring-leader-dynamodb] DynamoDB bindings for the simple leader election API.
iep-spring-userservice User service for validating known email addresses.

Libraries

These are standalone libraries used in various Insight products.

Module Description
iep-admin Simple admin service to aid in debugging.
iep-leader-api Simple leader election API with a default implementation.
iep-nflxenv Configuration for accessing context from the environment.
iep-service Simple abstraction for a service that is part of an application.
iep-ses Helper for sending HTML emails with SES.

Compatibility

The IEP libraries follow a semantic versioning scheme. Backwards incompatible changes should be marked with an incremented major version number. Forwards compatibility may work, but is in not required or guaranteed. It is highly recommended that all iep-* versions in the classpath are the same.

Prior to 1.0, it was mostly backwards compatible with major changes resulting in the minor version being incremented.

Common Runtime Libraries Comparison

Where possible we will use the other NetflixOSS Common Runtime Libraries, but as the support for some of those libraries has waned the usage of several have been dropped. This section will provide a quick summary of which parts we use and the differences.

Archaius

Archaius is the primary configuration library used at Netflix. The primary feature over other alternatives is that it can communicate with a property service to allow for properties that can be changed at runtime. This can be used for things like feature flags to enable or quickly disable functionality. Insight no longer uses Archaius 2.

It should be noted that as deployment automation and velocity has increased our (Insight team) interest in runtime properties has waned. Changing runtime properties can be just as risky as a deployment and adding proper checks such as canary analysis, staggered rollout, etc to the property path means there is little advantage to using properties over just doing a new deployment. The base layer for our configuration is using the Typesafe Config library and many new uses inject that directly rather than injecting the Archaius2 Config object.

Blitz4j

Logging library providing performance improvements on top of log4j 1.x and making the logging levels configurable via properties. Most code should be using the slf4j interfaces so the logging framework is pluggable, so this decision should only matter when selecting a binding for running the application. For Insight apps we bind slf4j to vanilla log4j2 when running internally. The logging configuration is mapped to a file that is monitored by log4j2 so we can tune log levels on an instance.

Eureka

Eureka is the Netflix service discovery system. All of the Insight apps should register with Eureka to integrate well with internal systems that check this as part of ensuring the service is healthy. Insight apps will map the healthcheck endpoint to the service state so that healthcheck polling used to populate the Eureka state will accurately reflect the service state.

For client side uses, see section discussing Ribbon.

Governator

IEP is compatible with, but does not directly use or require Governator. All Insight libraries should work using plain Spring, we do not rely on Governator.

Karyon

Karyon provides the server framework and admin for internal applications. Internally it is wrapped by the base-server library, though newer applications can use Karyon 2 directly. For a long time we have only used the admin aspect of Karyon. The IEP admin is much lighter weight and was originally developed as part of Karyon 3. The Runtime team decided to go a different direction so we simplified and inlined the aspects we care about as iep-admin.

Ribbon

Ribbon is the Eureka aware HTTP client layer used heavily at Netflix. Internally it is often wrapped by the platform-ipc library and often referred to as NIWS (Netflix Integrated Web Services). Similar to the story with Karyon, Insight was an early adopter of RxNetty that was intended to become the underlying library powering Ribbon. Insight developed the [iep-rxhttp] library to guinea pig RxNetty internally. It is still heavily used by the Insight team for interacting with services that require middle tier load balancing. However, the Runtime team has since de-prioritized RxNetty and is focusing on gRPC for new use-cases. Ribbon is not used by Insight and we'll likely phase out [iep-rxhttp] over time.

Another trend is that since all Netflix services are now in the VPC, the AWS ELBs can have proper security groups. That was not true in classic and was a big reason for initially building out Eureka. Using ELBs and standard DNS means the selection of client library is less important. The main add-on we want is consistent metrics and access logs which we achieve by using the Spectator HTTP log util.

Related Projects

Open source projects maintained by the Netflix Insight team are:

  • Apps
    • Atlas: in-memory dimensional time series database.
    • Edda: read-only cache of AWS resources.
    • IEP Apps: small example apps using Insight libraries.
  • Libraries
    • AWSObjectMapper: provides mappings for Jackson 2 to allow the AWS Java SDK model objects to easily be converted to/from JSON.
    • Spectator: client library for instrumenting an application to report data into Atlas or similar data stores.
  • Legacy Libraries
    • Edda Client: client library to access Edda via the AWS Java SDK interfaces.
    • Rx AWS Java SDK: experimental implementation of an AWS Java SDK that uses RxNetty internally and automatically handles pagination via RxJava Observables.
    • Servo: legacy library for instrumenting code. Use Spectator for new projects.

More Repositories

1

Hystrix

Hystrix is a latency and fault tolerance library designed to isolate points of access to remote systems, services and 3rd party libraries, stop cascading failure and enable resilience in complex distributed systems where failure is inevitable.
Java
23,594
star
2

chaosmonkey

Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
Go
14,410
star
3

zuul

Zuul is a gateway service that provides dynamic routing, monitoring, resiliency, security, and more.
Java
12,993
star
4

conductor

Conductor is a microservices orchestration engine.
Java
12,842
star
5

eureka

AWS Service registry for resilient mid-tier load balancing and failover.
Java
11,991
star
6

falcor

A JavaScript library for efficient data fetching
JavaScript
10,338
star
7

pollyjs

Record, Replay, and Stub HTTP Interactions.
JavaScript
10,184
star
8

metaflow

πŸš€ Build and manage real-life ML, AI, and data science projects with ease!
Python
8,012
star
9

SimianArmy

Tools for keeping your cloud operating in top form. Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
Java
7,955
star
10

fast_jsonapi

No Longer Maintained - A lightning fast JSON:API serializer for Ruby Objects.
Ruby
5,078
star
11

vmaf

Perceptual video quality assessment based on multi-method fusion.
Python
4,563
star
12

dispatch

All of the ad-hoc things you're doing to manage incidents today, done for you, and much more!
Python
4,548
star
13

ribbon

Ribbon is a Inter Process Communication (remote procedure calls) library with built in software load balancers. The primary usage model involves REST calls with various serialization scheme support.
Java
4,468
star
14

security_monkey

Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.
Python
4,347
star
15

dynomite

A generic dynamo implementation for different k-v storage engines
C
4,104
star
16

vizceral

WebGL visualization for displaying animated traffic graphs
JavaScript
4,047
star
17

vector

Vector is an on-host performance monitoring framework which exposes hand picked high resolution metrics to every engineer’s browser.
JavaScript
3,588
star
18

atlas

In-memory dimensional time series database.
Scala
3,331
star
19

concurrency-limits

Java
3,216
star
20

consoleme

A Central Control Plane for AWS Permissions and Access
Python
3,114
star
21

dgs-framework

GraphQL for Java with Spring Boot made easy.
Kotlin
3,044
star
22

flamescope

FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.
Python
2,979
star
23

bless

Repository for BLESS, an SSH Certificate Authority that runs as a AWS Lambda function
Python
2,722
star
24

archaius

Library for configuration management API
Java
2,435
star
25

asgard

[Asgard is deprecated at Netflix. We use Spinnaker ( www.spinnaker.io ).] Web interface for application deployments and cloud management in Amazon Web Services (AWS). Binary download: http://github.com/Netflix/asgard/releases
Groovy
2,235
star
26

curator

ZooKeeper client wrapper and rich ZooKeeper framework
Java
2,138
star
27

EVCache

A distributed in-memory data store for the cloud
Java
2,001
star
28

titus

1,995
star
29

lemur

Repository for the Lemur Certificate Manager
Python
1,651
star
30

bpftop

bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.
Rust
1,647
star
31

genie

Distributed Big Data Orchestration Service
Java
1,635
star
32

metacat

Java
1,555
star
33

netflix.github.com

HTML
1,419
star
34

servo

Netflix Application Monitoring Library
Java
1,408
star
35

mantis

A platform that makes it easy for developers to build realtime, cost-effective, operations-focused applications
Java
1,406
star
36

vectorflow

D
1,287
star
37

hubcommander

A Slack bot for GitHub organization management -- and other things too
Python
1,262
star
38

rend

A memcached proxy that manages data chunking and L1 / L2 caches
Go
1,174
star
39

hollow

Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.
Java
1,148
star
40

repokid

AWS Least Privilege for Distributed, High-Velocity Deployment
Python
1,104
star
41

astyanax

Cassandra Java Client
Java
1,034
star
42

Priam

Co-Process for backup/recovery, Token Management, and Centralized Configuration management for Cassandra.
Java
1,024
star
43

aminator

A tool for creating EBS AMIs. This tool currently works for CentOS/RedHat Linux images and is intended to run on an EC2 instance.
Python
938
star
44

Turbine

SSE Stream Aggregator
Java
831
star
45

governator

Governator is a library of extensions and utilities that enhance Google Guice to provide: classpath scanning and automatic binding, lifecycle management, configuration to field mapping, field validation and parallelized object warmup.
Java
821
star
46

Fido

C#
816
star
47

suro

Netflix's distributed Data Pipeline
Java
783
star
48

spectator

Client library for collecting metrics.
Java
743
star
49

security-bulletins

Security Bulletins that relate to Netflix Open Source
734
star
50

Fenzo

Extensible Scheduler for Mesos Frameworks
Java
703
star
51

msl

Message Security Layer
C++
687
star
52

unleash

Professionally publish your JavaScript modules in one keystroke
JavaScript
590
star
53

denominator

Portably control DNS clouds using java or bash
Java
573
star
54

blitz4j

Logging framework for fast asynchronous logging
Java
559
star
55

edda

AWS API Read Cache
Scala
554
star
56

PigPen

Map-Reduce for Clojure
Clojure
551
star
57

netflix-graph

Compact in-memory representation of directed graph data
Java
548
star
58

go-env

a golang library to manage environment variables
Go
542
star
59

karyon

The nucleus or the base container for Applications and Services built using the NetflixOSS ecosystem
Java
495
star
60

Prana

A sidecar for your NetflixOSS based services.
Java
492
star
61

iceberg

Iceberg is a table format for large, slow-moving tabular data
Java
465
star
62

Lipstick

Pig Visualization framework
JavaScript
464
star
63

Surus

Java
453
star
64

aws-autoscaling

Tools and Documentation about using Auto Scaling
Shell
429
star
65

go-expect

an expect-like golang library to automate control of terminal or console based programs.
Go
422
star
66

nf-data-explorer

The Data Explorer gives you fast, safe access to data stored in Cassandra, Dynomite, and Redis.
TypeScript
420
star
67

Workflowable

Ruby
370
star
68

osstracker

Github organization OSS metrics collector and metrics dashboard
Scala
365
star
69

vizceral-example

Example Vizceral app
JavaScript
363
star
70

ndbench

Netflix Data Store Benchmark
HTML
360
star
71

Raigad

Co-Process for backup/recovery, Auto Deployments and Centralized Configuration management for ElasticSearch
Java
346
star
72

recipes-rss

RSS Reader Recipes that uses several of the Netflix OSS components
Java
339
star
73

aegisthus

A Bulk Data Pipeline out of Cassandra
Java
323
star
74

weep

The ConsoleMe CLI utility
Go
322
star
75

metaflow-ui

🎨 UI for monitoring your Metaflow executions!
TypeScript
318
star
76

titus-control-plane

Titus is the Netflix Container Management Platform that manages containers and provides integrations to the infrastructure ecosystem.
Java
316
star
77

dyno-queues

Dyno Queues is a recipe that provides task queues utilizing Dynomite.
Java
264
star
78

image_compression_comparison

Image Compression Comparison Framework
Python
258
star
79

falcor-express-demo

Demonstration Falcor end point for a Netflix-style Application using express
HTML
246
star
80

gradle-template

Java
244
star
81

ember-nf-graph

Composable graphing component library for EmberJS.
JavaScript
241
star
82

falcor-router-demo

A demonstration of how to build a Router for a Netflix-like application
JavaScript
236
star
83

titus-executor

Titus Executor is the container runtime/executor implementation for Titus
Go
233
star
84

photon

Photon is a Java implementation of the Interoperable Master Format (IMF) standard. IMF is a SMPTE standard whose core constraints are defined in the specification st2067-2:2013
Java
233
star
85

dial-reference

C
228
star
86

s3mper

s3mper - Consistent Listing for S3
Java
218
star
87

ReactiveLab

Experiments and prototypes with reactive application design.
Java
208
star
88

inviso

JavaScript
205
star
89

NfWebCrypto

Web Cryptography API Polyfill
C++
205
star
90

staash

A language-agnostic as well as storage-agnostic web interface for storing data into persistent storage systems, the metadata layer abstracts a lot of storage details and the pattern automation APIs take care of automating common data access patterns.
Java
204
star
91

zeno

Netflix's In-Memory Data Propagation Framework
Java
200
star
92

brutal

A multi-network asynchronous chat bot framework using twisted
Python
200
star
93

vizceral-react

JavaScript
199
star
94

dispatch-docker

Shell
193
star
95

metaflow-service

πŸš€ Metadata tracking and UI service for Metaflow!
Python
187
star
96

pytheas

Web Resources and UI Framework
JavaScript
187
star
97

dyno

Java client for Dynomite
Java
184
star
98

hal-9001

Hal-9001 is a Go library that offers a number of facilities for creating a bot and its plugins.
Go
178
star
99

Nicobar

Java
171
star
100

lemur-docker

Docker files for the Lemur certificate orchestration tool
Python
170
star