There are no reviews yet. Be the first to send feedback to the community and the maintainers!
flowman
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.spark-training
Repository used for Spark Trainingsdocker-jupyter-spark
Docker image for Jupyter notebooks with PySparkterraform-emr-training
Terraform script for launching multiple EMR clusters for training purposes.weather-analysis
pyspark-advanced
Jupyter Notebooks for PySpark Advanced Workshoppyspark-ml-crashcourse
flowman-demo-weather
docker-hive
Docker container running the Hive Metastoredocker-spark
Repository for building Docker containers for Sparkvagrant-cloudera
A Vagrant setup to run a virtual Cloudera clusterpyspark-ml-taxis
Jupyter Notebooks for PySpark Workshop using NYC Taxi Trip datadocker-alluxio
Docker image for Apache Alluxiovagrant-druid
docker-jupyter-anaconda
Docker Jupyter image based on Anaconda distributiondocker-hadoop
Repository for building Docker containers for Hadoopdocker-presto
Repository for building Docker containers for PrestoLove Open Source and this site? Check out how you can help us