πŸ‡ΊπŸ‡Έ Made in United States

Discover United States's Leading Open Source Projects: Explore top-notch open source initiatives hailing from the vibrant tech community of United States.

TOP Scala Projects

1
twitter/the-algorithm

twitter/the-algorithm

Source code for Twitter's Recommendation Algorithm
Scala
61,982
star
2
prisma/prisma1

prisma/prisma1

πŸ’Ύ Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]
Scala
16,552
star
3
twitter/finagle

twitter/finagle

A fault tolerant, protocol-agnostic RPC system
Scala
8,769
star
4
twitter-archive/snowflake

twitter-archive/snowflake

Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
Scala
7,648
star
5
enso-org/enso

enso-org/enso

Hybrid visual and textual functional programming.
Scala
7,342
star
6
microsoft/SynapseML

microsoft/SynapseML

Simple and Distributed Machine Learning
Scala
5,041
star
7
airbnb/aerosolve

airbnb/aerosolve

A machine learning package built for humans.
Scala
4,795
star
8
mesos/chronos

mesos/chronos

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
Scala
4,388
star
9
mesosphere/marathon

mesosphere/marathon

Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Scala
4,065
star
10
twitter-archive/diffy

twitter-archive/diffy

Find potential bugs in your services with Diffy
Scala
3,825
star
11
twitter/scalding

twitter/scalding

A Scala API for Cascading
Scala
3,497
star
12
twitter-archive/flockdb

twitter-archive/flockdb

A distributed, fault-tolerant graph database
Scala
3,337
star
13
Netflix/atlas

Netflix/atlas

In-memory dimensional time series database.
Scala
3,331
star
14
awslabs/deequ

awslabs/deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala
2,871
star
15
twitter-archive/kestrel

twitter-archive/kestrel

simple, distributed message queue system (inactive)
Scala
2,774
star
16
twitter/util

twitter/util

Wonderful reusable code from Twitter
Scala
2,686
star
17
databricks/Spark-The-Definitive-Guide

databricks/Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository
Scala
2,678
star
18
twitter/algebird

twitter/algebird

Abstract Algebra for Scala
Scala
2,288
star
19
twitter/finatra

twitter/finatra

Fast, testable, Scala services built on TwitterServer and Finagle
Scala
2,272
star
20
twitter-archive/gizzard

twitter-archive/gizzard

[Archived] A flexible sharding framework for creating eventually-consistent distributed datastores
Scala
2,256
star
21
twitter/summingbird

twitter/summingbird

Streaming MapReduce with Scalding and Storm
Scala
2,138
star
22
metarank/metarank

metarank/metarank

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Scala
2,042
star
23
feathr-ai/feathr

feathr-ai/feathr

Feathr – A scalable, unified data and AI engineering platform for enterprise
Scala
1,978
star
24
sangria-graphql/sangria

sangria-graphql/sangria

Scala GraphQL implementation
Scala
1,962
star
25
riscv-boom/riscv-boom

riscv-boom/riscv-boom

SonicBOOM: The Berkeley Out-of-Order Machine
Scala
1,710
star
26
ucb-bar/chipyard

ucb-bar/chipyard

An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
Scala
1,601
star
27
ThoughtWorksInc/Binding.scala

ThoughtWorksInc/Binding.scala

Reactive data-binding for Scala
Scala
1,579
star
28
twitter/twitter-server

twitter/twitter-server

Twitter-Server defines a template from which services at Twitter are built
Scala
1,567
star
29
GravityLabs/goose

GravityLabs/goose

Html Content / Article Extractor in Scala - open sourced from Gravity Labs
Scala
1,528
star
30
sryza/aas

sryza/aas

Code to accompany Advanced Analytics with Spark from O'Reilly Media
Scala
1,518
star
31
holdenk/spark-testing-base

holdenk/spark-testing-base

Base classes to use when writing tests with Spark
Scala
1,513
star
32
combust/mleap

combust/mleap

MLeap: Deploy ML Pipelines to Production
Scala
1,479
star
33
pathikrit/better-files

pathikrit/better-files

Simple, safe and intuitive Scala I/O
Scala
1,473
star
34
vkostyukov/scalacaster

vkostyukov/scalacaster

Purely Functional Algorithms and Data Structures in Scala
Scala
1,455
star
35
paypal/squbs

paypal/squbs

Akka Streams & Akka HTTP for Large-Scale Production Deployments
Scala
1,433
star
36
mauricio/postgresql-async

mauricio/postgresql-async

Async, Netty based, database drivers for PostgreSQL and MySQL written in Scala
Scala
1,429
star
37
locationtech/geomesa

locationtech/geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Scala
1,428
star
38
mesos/spark

mesos/spark

Lightning-fast cluster computing in Java, Scala and Python.
Scala
1,426
star
39
twitter-archive/iago

twitter-archive/iago

A load generator, built for engineers
Scala
1,347
star
40
locationtech/geotrellis

locationtech/geotrellis

GeoTrellis is a geographic data processing engine for high performance applications.
Scala
1,339
star
41
twitter/rsc

twitter/rsc

Experimental Scala compiler focused on compilation speed
Scala
1,243
star
42
sryza/spark-timeseries

sryza/spark-timeseries

A library for time series analysis on Apache Spark
Scala
1,192
star
43
wavesplatform/Waves

wavesplatform/Waves

⛓️ Reference Waves Blockchain Node (client) implementation on Scala
Scala
1,169
star
44
databricks/LearningSparkV2

databricks/LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Scala
1,158
star
45
tumblr/colossus

tumblr/colossus

I/O and Microservice library for Scala
Scala
1,144
star
46
pauljamescleary/scala-pet-store

pauljamescleary/scala-pet-store

An implementation of the java pet store using FP techniques in scala
Scala
1,068
star
47
sifive/freedom

sifive/freedom

Source files for SiFive's Freedom platforms
Scala
1,058
star
48
databricks/spark-csv

databricks/spark-csv

CSV Data Source for Apache Spark 1.x
Scala
1,051
star
49
twitter/cassovary

twitter/cassovary

Cassovary is a simple big graph processing library for the JVM
Scala
1,046
star
50
TIBCOSoftware/snappydata

TIBCOSoftware/snappydata

Project SnappyData - memory optimized analytics database, based on Apache Sparkβ„’ and Apache Geodeβ„’. Stream, Transact, Analyze, Predict in one cluster
Scala
1,041
star
51
lensesio/stream-reactor

lensesio/stream-reactor

A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.
Scala
1,005
star
52
twosigma/flint

twosigma/flint

A Time Series Library for Apache Spark
Scala
999
star
53
cloudera/livy

cloudera/livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere
Scala
996
star
54
amplab/shark

amplab/shark

Development in Shark has been ended.
Scala
994
star
55
broadinstitute/cromwell

broadinstitute/cromwell

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Scala
990
star
56
h2oai/sparkling-water

h2oai/sparkling-water

Sparkling Water provides H2O functionality inside Spark cluster
Scala
958
star
57
eaplatanios/tensorflow_scala

eaplatanios/tensorflow_scala

TensorFlow API for the Scala Programming Language
Scala
937
star
58
wzhe06/SparkCTR

wzhe06/SparkCTR

CTR prediction model based on spark(LR, GBDT, DNN)
Scala
902
star
59
twitter/twitter-korean-text

twitter/twitter-korean-text

Korean tokenizer
Scala
857
star
60
precog/matryoshka

precog/matryoshka

Generalized recursion schemes and traversals for Scala.
Scala
810
star
61
NVIDIA/spark-rapids

NVIDIA/spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Scala
800
star
62
ucb-bar/gemmini

ucb-bar/gemmini

Berkeley's Spatial Array Generator
Scala
793
star
63
twitter/scrooge

twitter/scrooge

A Thrift parser/generator
Scala
790
star
64
twitter-archive/ostrich

twitter-archive/ostrich

A stats collector & reporter for Scala servers (deprecated)
Scala
773
star
65
ThoughtWorksInc/DeepLearning.scala

ThoughtWorksInc/DeepLearning.scala

A simple library for creating complex neural networks
Scala
766
star
66
databricks/tensorframes

databricks/tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
Scala
749
star
67
MrPowers/spark-daria

MrPowers/spark-daria

Essential Spark extensions and helper methods ✨😲
Scala
746
star
68
gregdurrett/berkeley-doc-summarizer

gregdurrett/berkeley-doc-summarizer

The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.
Scala
741
star
69
jdegoes/blueeyes

jdegoes/blueeyes

A lightweight Web 3.0 framework for Scala, featuring a purely asynchronous architecture, extremely high-performance, massive scalability, high usability, and a functional, composable design.
Scala
738
star
70
p2t2/figaro

p2t2/figaro

Figaro Programming Language and Core Libraries
Scala
737
star
71
airbnb/chronon

airbnb/chronon

Chronon is a data platform for serving for AI/ML applications.
Scala
731
star
72
ezhulenev/orderbook-dynamics

ezhulenev/orderbook-dynamics

Modeling high-frequency limit order book dynamics with support vector machines
Scala
718
star
73
sksamuel/avro4s

sksamuel/avro4s

Avro schema generation and serialization / deserialization for Scala
Scala
717
star
74
rchain/rchain

rchain/rchain

Blockchain (smart contract) platform using CBC-Casper proof of stake + Rholang for concurrent execution.
Scala
693
star
75
ucb-bar/riscv-sodor

ucb-bar/riscv-sodor

educational microarchitectures for risc-v isa
Scala
673
star
76
actionml/universal-recommender

actionml/universal-recommender

Highly configurable recommender based on PredictionIO and Mahout's Correlated Cross-Occurrence algorithm
Scala
666
star
77
sameeragarwal/blinkdb

sameeragarwal/blinkdb

BlinkDB: Sub-Second Approximate Queries on Very Large Data.
Scala
659
star
78
twitter/bijection

twitter/bijection

Reversible conversions between types
Scala
657
star
79
databricks/reference-apps

databricks/reference-apps

Spark reference applications
Scala
648
star
80
deanwampler/programming-scala-book-code-examples

deanwampler/programming-scala-book-code-examples

The code examples used in Programming Scala, 2nd and 3rd Editions (O'Reilly)
Scala
643
star
81
ucb-bar/chisel-tutorial

ucb-bar/chisel-tutorial

chisel tutorial exercises and answers
Scala
643
star
82
shadaj/slinky

shadaj/slinky

Write Scala.js React apps just like you would in ES6
Scala
632
star
83
open-korean-text/open-korean-text

open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor
Scala
610
star
84
twitter/chill

twitter/chill

Scala extensions for the Kryo serialization library
Scala
608
star
85
amplab/SparkNet

amplab/SparkNet

Distributed Neural Networks for Spark
Scala
605
star
86
databricks/spark-redshift

databricks/spark-redshift

Redshift data source for Apache Spark
Scala
598
star
87
allenai/pdffigures2

allenai/pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.
Scala
593
star
88
tpolecat/tut

tpolecat/tut

doc/tutorial generator for scala
Scala
584
star
89
tumblr/collins

tumblr/collins

groovy kind of love
Scala
572
star
90
enragedginger/akka-quartz-scheduler

enragedginger/akka-quartz-scheduler

Quartz Extension and utilities for cron-style scheduling in Akka
Scala
559
star
91
Netflix/edda

Netflix/edda

AWS API Read Cache
Scala
554
star
92
jsuereth/scala-arm

jsuereth/scala-arm

This project aims to be the Scala Incubator project for Automatic-Resource-Management in the scala library
Scala
549
star
93
hyperledger-labs/Scorex

hyperledger-labs/Scorex

Scorex 2.0 Core
Scala
545
star
94
databricks/spark-sql-perf

databricks/spark-sql-perf

Scala
543
star
95
ucb-bar/riscv-mini

ucb-bar/riscv-mini

Simple RISC-V 3-stage Pipeline in Chisel
Scala
538
star
96
databricks/spark-avro

databricks/spark-avro

Avro Data Source for Apache Spark
Scala
538
star
97
Stratio/sparta

Stratio/sparta

Real Time Analytics and Data Pipelines based on Spark Streaming
Scala
525
star
98
outr/scribe

outr/scribe

The fastest logging library in the world. Built from scratch in Scala and programmatically configurable.
Scala
524
star
99
guardrail-dev/guardrail

guardrail-dev/guardrail

Principled code generation from OpenAPI specifications
Scala
519
star
100
orbeon/orbeon-forms

orbeon/orbeon-forms

Orbeon Forms is an open source web forms solution. It includes an XForms engine, the Form Builder web-based form editor, and the Form Runner runtime.
Scala
514
star