There are no reviews yet. Be the first to send feedback to the community and the maintainers!
flowman
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.spark-training
Repository used for Spark Trainingsdocker-jupyter-spark
Docker image for Jupyter notebooks with PySparkterraform-emr-training
Terraform script for launching multiple EMR clusters for training purposes.weather-analysis
pyspark-advanced
Jupyter Notebooks for PySpark Advanced Workshoppyspark-ml-crashcourse
flowman-demo-weather
docker-hive
Docker container running the Hive Metastoredocker-spark
Repository for building Docker containers for Sparkvagrant-cloudera
A Vagrant setup to run a virtual Cloudera clusterdocker-alluxio
Docker image for Apache Alluxiovagrant-druid
docker-miniconda
Miniconda base imagedocker-jupyter-anaconda
Docker Jupyter image based on Anaconda distributiondocker-hadoop
Repository for building Docker containers for Hadoopdocker-presto
Repository for building Docker containers for PrestoLove Open Source and this site? Check out how you can help us