Awesome Scala Big Data

  • updated about 1 year ago BSD 3-Clause "New...

    CPU and GPU-accelerated Machine Learning Library

  • updated over 7 years ago Apache License 2.0

    Spark library for easy MongoDB access

  • Vegas Vegas 731
    star
    updated almost 2 years ago MIT License

    The missing MatPlotLib for Scala + Spark

  • almond almond 1,554
    star
    updated 10 days ago BSD 3-Clause "New...

    A Scala kernel for Jupyter

  • updated 22 days ago Other

    Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.

  • breeze breeze 3,413
    star
    updated over 1 year ago Apache License 2.0

    Breeze is a numerical processing library for Scala.

  • updated almost 2 years ago Apache License 2.0

    Lightweight real-time big data streaming engine over Akka

  • updated 5 months ago GNU Affero Genera...

    Scala library for accessing various file, batch systems, job schedulers and grid middlewares.

  • hail hail 895
    star
    updated 19 days ago MIT License

    Cloud-native genomic dataframes and batch computing

  • updated 7 months ago MIT License

    A simplified, lightweight ETL Framework based on Apache Spark

  • updated over 5 years ago Other

    Spark DataFrames for earth observation data

  • scalding scalding 3,454
    star
    updated 6 months ago Apache License 2.0

    A Scala API for Cascading

  • updated 7 months ago Apache License 2.0

    Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

  • scoobi scoobi 482
    star
    updated over 1 year ago

    A Scala productivity framework for Hadoop.

  • updated over 8 years ago Other

    Scala DSL on top of Oozie XML

  • spark spark 36,719
    star
    updated 3 months ago Apache License 2.0

    Apache Spark - A unified analytics engine for large-scale data processing

  • updated about 7 years ago Apache License 2.0

    Deploy Spark cluster in an easy way.

  • updated over 2 years ago MIT License

    Executable Apache Spark Tools: Format Converter & SQL Processor

  • updated 7 months ago MIT License

    Basic framework utilities to quickly start writing production ready Apache Spark applications

  • updated 11 months ago Apache License 2.0

    Spark package to "plug" holes in data using SQL based rules ⚑️ πŸ”Œ

  • sparta sparta 525
    star
    updated about 4 years ago Apache License 2.0

    Real Time Analytics and Data Pipelines based on Spark Streaming

  • updated almost 2 years ago Apache License 2.0

    Streaming MapReduce with Scalding and Storm