There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
In this project I have created an end to end Big data ETL pipeline which comprised of Hadoop HDFS as storage layer, Apache Hive as Datawarehouse, Apache spark as ETL engine, Apache airflow as data orchestration and presto for query analysis.