• Stars
    star
    1
  • Language
    Python
  • Created 5 months ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

In this project I have created an end to end Big data ETL pipeline which comprised of Hadoop HDFS as storage layer, Apache Hive as Datawarehouse, Apache spark as ETL engine, Apache airflow as data orchestration and presto for query analysis.