There are no reviews yet. Be the first to send feedback to the community and the maintainers!
hudi-on-glue-quick-start
AWS Glue PySpark - Apache Hudi Quick Start Guideairflow-pyspark-emr
This project demonstrate how to process data stored in a data lake fashion, transforming it into an OLAP optimized structure by using PySpark. The PySpark Job runs on AWS EMR, and the Data Pipeline is orchestrated by Apache Airflow, including the infrastructure creation and the EMR cluster termination.datasprints-open-spaces
Repository for the code demoed in the talkflame
Flame π₯ Opinionated Flask & MongoDB backend boilerplate.delta-lake-on-glue-quickstart
This is a quick start guide for the Delta Lake (delta.io) Python Spark connector, running on AWS Glue.pyspark-emr-s3-datalake
aws-cft-samples
Sample Cloud Formation Template YAML configurationscode-pipeline-python-test
data-modeling-postgresql
Data Modeling and ETL with PostgreSQL and PythonLove Open Source and this site? Check out how you can help us