πŸ‡ΊπŸ‡Έ Made in United States

Discover United States's Leading Open Source Projects: Explore top-notch open source initiatives hailing from the vibrant tech community of United States.

TOP Scala Projects

1
twitter/the-algorithm

twitter/the-algorithm

Source code for Twitter's Recommendation Algorithm
Scala
60,968
star
2
prisma/prisma1

prisma/prisma1

πŸ’Ύ Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]
Scala
16,608
star
3
twitter/finagle

twitter/finagle

A fault tolerant, protocol-agnostic RPC system
Scala
8,742
star
4
twitter-archive/snowflake

twitter-archive/snowflake

Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
Scala
7,566
star
5
enso-org/enso

enso-org/enso

Hybrid visual and textual functional programming.
Scala
7,287
star
6
airbnb/aerosolve

airbnb/aerosolve

A machine learning package built for humans.
Scala
4,792
star
7
mesos/chronos

mesos/chronos

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
Scala
4,366
star
8
microsoft/SynapseML

microsoft/SynapseML

Simple and Distributed Machine Learning
Scala
4,335
star
9
mesosphere/marathon

mesosphere/marathon

Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Scala
4,068
star
10
twitter-archive/diffy

twitter-archive/diffy

Find potential bugs in your services with Diffy
Scala
3,827
star
11
twitter/scalding

twitter/scalding

A Scala API for Cascading
Scala
3,469
star
12
Netflix/atlas

Netflix/atlas

In-memory dimensional time series database.
Scala
3,331
star
13
twitter-archive/flockdb

twitter-archive/flockdb

A distributed, fault-tolerant graph database
Scala
3,326
star
14
awslabs/deequ

awslabs/deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala
2,871
star
15
twitter-archive/kestrel

twitter-archive/kestrel

simple, distributed message queue system (inactive)
Scala
2,780
star
16
twitter/util

twitter/util

Wonderful reusable code from Twitter
Scala
2,679
star
17
databricks/Spark-The-Definitive-Guide

databricks/Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository
Scala
2,678
star
18
twitter/algebird

twitter/algebird

Abstract Algebra for Scala
Scala
2,288
star
19
twitter/finatra

twitter/finatra

Fast, testable, Scala services built on TwitterServer and Finagle
Scala
2,271
star
20
twitter-archive/gizzard

twitter-archive/gizzard

[Archived] A flexible sharding framework for creating eventually-consistent distributed datastores
Scala
2,255
star
21
twitter/summingbird

twitter/summingbird

Streaming MapReduce with Scalding and Storm
Scala
2,136
star
22
tpolecat/doobie

tpolecat/doobie

Functional JDBC layer for Scala.
Scala
2,117
star
23
sangria-graphql/sangria

sangria-graphql/sangria

Scala GraphQL implementation
Scala
1,965
star
24
metarank/metarank

metarank/metarank

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Scala
1,964
star
25
feathr-ai/feathr

feathr-ai/feathr

Feathr – A scalable, unified data and AI engineering platform for enterprise
Scala
1,929
star
26
riscv-boom/riscv-boom

riscv-boom/riscv-boom

SonicBOOM: The Berkeley Out-of-Order Machine
Scala
1,596
star
27
ThoughtWorksInc/Binding.scala

ThoughtWorksInc/Binding.scala

Reactive data-binding for Scala
Scala
1,579
star
28
twitter/twitter-server

twitter/twitter-server

Twitter-Server defines a template from which services at Twitter are built
Scala
1,542
star
29
GravityLabs/goose

GravityLabs/goose

Html Content / Article Extractor in Scala - open sourced from Gravity Labs
Scala
1,528
star
30
sryza/aas

sryza/aas

Code to accompany Advanced Analytics with Spark from O'Reilly Media
Scala
1,514
star
31
holdenk/spark-testing-base

holdenk/spark-testing-base

Base classes to use when writing tests with Spark
Scala
1,493
star
32
combust/mleap

combust/mleap

MLeap: Deploy ML Pipelines to Production
Scala
1,479
star
33
pathikrit/better-files

pathikrit/better-files

Simple, safe and intuitive Scala I/O
Scala
1,462
star
34
vkostyukov/scalacaster

vkostyukov/scalacaster

Purely Functional Algorithms and Data Structures in Scala
Scala
1,455
star
35
mauricio/postgresql-async

mauricio/postgresql-async

Async, Netty based, database drivers for PostgreSQL and MySQL written in Scala
Scala
1,433
star
36
mesos/spark

mesos/spark

Lightning-fast cluster computing in Java, Scala and Python.
Scala
1,426
star
37
paypal/squbs

paypal/squbs

Akka Streams & Akka HTTP for Large-Scale Production Deployments
Scala
1,423
star
38
ucb-bar/chipyard

ucb-bar/chipyard

An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
Scala
1,415
star
39
locationtech/geomesa

locationtech/geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Scala
1,391
star
40
twitter-archive/iago

twitter-archive/iago

A load generator, built for engineers
Scala
1,351
star
41
locationtech/geotrellis

locationtech/geotrellis

GeoTrellis is a geographic data processing engine for high performance applications.
Scala
1,317
star
42
twitter/rsc

twitter/rsc

Experimental Scala compiler focused on compilation speed
Scala
1,245
star
43
sryza/spark-timeseries

sryza/spark-timeseries

A library for time series analysis on Apache Spark
Scala
1,188
star
44
wavesplatform/Waves

wavesplatform/Waves

⛓️ Reference Waves Blockchain Node (client) implementation on Scala
Scala
1,170
star
45
tumblr/colossus

tumblr/colossus

I/O and Microservice library for Scala
Scala
1,143
star
46
databricks/LearningSparkV2

databricks/LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Scala
1,077
star
47
sifive/freedom

sifive/freedom

Source files for SiFive's Freedom platforms
Scala
1,058
star
48
pauljamescleary/scala-pet-store

pauljamescleary/scala-pet-store

An implementation of the java pet store using FP techniques in scala
Scala
1,052
star
49
databricks/spark-csv

databricks/spark-csv

CSV Data Source for Apache Spark 1.x
Scala
1,051
star
50
TIBCOSoftware/snappydata

TIBCOSoftware/snappydata

Project SnappyData - memory optimized analytics database, based on Apache Sparkβ„’ and Apache Geodeβ„’. Stream, Transact, Analyze, Predict in one cluster
Scala
1,041
star
51
twitter/cassovary

twitter/cassovary

Cassovary is a simple big graph processing library for the JVM
Scala
1,039
star
52
cloudera/livy

cloudera/livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere
Scala
996
star
53
amplab/shark

amplab/shark

Development in Shark has been ended.
Scala
994
star
54
twosigma/flint

twosigma/flint

A Time Series Library for Apache Spark
Scala
989
star
55
lensesio/stream-reactor

lensesio/stream-reactor

A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.
Scala
976
star
56
h2oai/sparkling-water

h2oai/sparkling-water

Sparkling Water provides H2O functionality inside Spark cluster
Scala
954
star
57
broadinstitute/cromwell

broadinstitute/cromwell

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Scala
953
star
58
eaplatanios/tensorflow_scala

eaplatanios/tensorflow_scala

TensorFlow API for the Scala Programming Language
Scala
933
star
59
wzhe06/SparkCTR

wzhe06/SparkCTR

CTR prediction model based on spark(LR, GBDT, DNN)
Scala
894
star
60
twitter/twitter-korean-text

twitter/twitter-korean-text

Korean tokenizer
Scala
834
star
61
precog/matryoshka

precog/matryoshka

Generalized recursion schemes and traversals for Scala.
Scala
810
star
62
twitter/scrooge

twitter/scrooge

A Thrift parser/generator
Scala
785
star
63
twitter-archive/ostrich

twitter-archive/ostrich

A stats collector & reporter for Scala servers (deprecated)
Scala
774
star
64
ThoughtWorksInc/DeepLearning.scala

ThoughtWorksInc/DeepLearning.scala

A simple library for creating complex neural networks
Scala
763
star
65
databricks/tensorframes

databricks/tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
Scala
751
star
66
MrPowers/spark-daria

MrPowers/spark-daria

Essential Spark extensions and helper methods ✨😲
Scala
742
star
67
gregdurrett/berkeley-doc-summarizer

gregdurrett/berkeley-doc-summarizer

The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.
Scala
742
star
68
jdegoes/blueeyes

jdegoes/blueeyes

A lightweight Web 3.0 framework for Scala, featuring a purely asynchronous architecture, extremely high-performance, massive scalability, high usability, and a functional, composable design.
Scala
738
star
69
p2t2/figaro

p2t2/figaro

Figaro Programming Language and Core Libraries
Scala
737
star
70
NVIDIA/spark-rapids

NVIDIA/spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Scala
722
star
71
ezhulenev/orderbook-dynamics

ezhulenev/orderbook-dynamics

Modeling high-frequency limit order book dynamics with support vector machines
Scala
714
star
72
sksamuel/avro4s

sksamuel/avro4s

Avro schema generation and serialization / deserialization for Scala
Scala
713
star
73
rchain/rchain

rchain/rchain

Blockchain (smart contract) platform using CBC-Casper proof of stake + Rholang for concurrent execution.
Scala
688
star
74
ucb-bar/gemmini

ucb-bar/gemmini

Berkeley's Spatial Array Generator
Scala
668
star
75
actionml/universal-recommender

actionml/universal-recommender

Highly configurable recommender based on PredictionIO and Mahout's Correlated Cross-Occurrence algorithm
Scala
665
star
76
sameeragarwal/blinkdb

sameeragarwal/blinkdb

BlinkDB: Sub-Second Approximate Queries on Very Large Data.
Scala
661
star
77
twitter/bijection

twitter/bijection

Reversible conversions between types
Scala
657
star
78
databricks/reference-apps

databricks/reference-apps

Spark reference applications
Scala
648
star
79
ucb-bar/chisel-tutorial

ucb-bar/chisel-tutorial

chisel tutorial exercises and answers
Scala
643
star
80
deanwampler/programming-scala-book-code-examples

deanwampler/programming-scala-book-code-examples

The code examples used in Programming Scala, 2nd and 3rd Editions (O'Reilly)
Scala
643
star
81
ucb-bar/riscv-sodor

ucb-bar/riscv-sodor

educational microarchitectures for risc-v isa
Scala
641
star
82
shadaj/slinky

shadaj/slinky

Write Scala.js React apps just like you would in ES6
Scala
632
star
83
twitter/chill

twitter/chill

Scala extensions for the Kryo serialization library
Scala
607
star
84
amplab/SparkNet

amplab/SparkNet

Distributed Neural Networks for Spark
Scala
605
star
85
airbnb/chronon

airbnb/chronon

Chronon is a data platform for serving for AI/ML applications.
Scala
600
star
86
databricks/spark-redshift

databricks/spark-redshift

Redshift data source for Apache Spark
Scala
598
star
87
open-korean-text/open-korean-text

open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor
Scala
597
star
88
tpolecat/tut

tpolecat/tut

doc/tutorial generator for scala
Scala
584
star
89
tumblr/collins

tumblr/collins

groovy kind of love
Scala
571
star
90
enragedginger/akka-quartz-scheduler

enragedginger/akka-quartz-scheduler

Quartz Extension and utilities for cron-style scheduling in Akka
Scala
559
star
91
Netflix/edda

Netflix/edda

AWS API Read Cache
Scala
554
star
92
jsuereth/scala-arm

jsuereth/scala-arm

This project aims to be the Scala Incubator project for Automatic-Resource-Management in the scala library
Scala
551
star
93
hyperledger-labs/Scorex

hyperledger-labs/Scorex

Scorex 2.0 Core
Scala
545
star
94
databricks/spark-sql-perf

databricks/spark-sql-perf

Scala
543
star
95
databricks/spark-avro

databricks/spark-avro

Avro Data Source for Apache Spark
Scala
538
star
96
Stratio/sparta

Stratio/sparta

Real Time Analytics and Data Pipelines based on Spark Streaming
Scala
525
star
97
allenai/pdffigures2

allenai/pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.
Scala
514
star
98
orbeon/orbeon-forms

orbeon/orbeon-forms

Orbeon Forms is an open source web forms solution. It includes an XForms engine, the Form Builder web-based form editor, and the Form Runner runtime.
Scala
512
star
99
guardrail-dev/guardrail

guardrail-dev/guardrail

Principled code generation from OpenAPI specifications
Scala
512
star
100
outr/scribe

outr/scribe

The fastest logging library in the world. Built from scratch in Scala and programmatically configurable.
Scala
502
star