Awesome Scala Big Data

  • almond almond 1,561
    star
    updated 25 days ago BSD 3-Clause "New...

    A Scala kernel for Jupyter

  • updated 3 months ago Other

    Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.

  • updated over 1 year ago BSD 3-Clause "New...

    CPU and GPU-accelerated Machine Learning Library

  • breeze breeze 3,413
    star
    updated over 1 year ago Apache License 2.0

    Breeze is a numerical processing library for Scala.

  • updated about 2 years ago Apache License 2.0

    Lightweight real-time big data streaming engine over Akka

  • updated about 2 months ago GNU Affero Genera...

    Scala library for accessing various file, batch systems, job schedulers and grid middlewares.

  • hail hail 934
    star
    updated 11 days ago MIT License

    Cloud-native genomic dataframes and batch computing

  • updated 3 months ago MIT License

    A simplified, lightweight ETL Framework based on Apache Spark

  • updated almost 6 years ago Other

    Spark DataFrames for earth observation data

  • scalding scalding 3,467
    star
    updated 11 months ago Apache License 2.0

    A Scala API for Cascading

  • updated about 4 years ago Apache License 2.0

    Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

  • scoobi scoobi 482
    star
    updated almost 2 years ago

    A Scala productivity framework for Hadoop.

  • updated almost 9 years ago Other

    Scala DSL on top of Oozie XML

  • spark spark 36,719
    star
    updated 7 months ago Apache License 2.0

    Apache Spark - A unified analytics engine for large-scale data processing

  • updated over 7 years ago Apache License 2.0

    Deploy Spark cluster in an easy way.

  • updated over 7 years ago Apache License 2.0

    Spark library for easy MongoDB access

  • updated almost 4 years ago Apache License 2.0

    Spark package to "plug" holes in data using SQL based rules ⚑️ πŸ”Œ

  • updated about 3 years ago MIT License

    Executable Apache Spark Tools: Format Converter & SQL Processor

  • updated about 1 year ago MIT License

    Basic framework utilities to quickly start writing production ready Apache Spark applications

  • sparta sparta 525
    star
    updated over 4 years ago Apache License 2.0

    Real Time Analytics and Data Pipelines based on Spark Streaming

  • updated over 2 years ago Apache License 2.0

    Streaming MapReduce with Scalding and Storm

  • Vegas Vegas 729
    star
    updated about 2 years ago MIT License

    The missing MatPlotLib for Scala + Spark