There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Data-Pipeline-with-dbt-using-Airflow-on-GCP
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.Youtube-video-data-analytics-using-AWS
This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data engineering, data analysis and data science parts.Microsoft-Azure-Medallion-Data-pipeline
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.End-to-end-machine-learning
The idea of this project is to apply statistical methods learned in university lectures to find patterns in the data and use machine learning to solve a supervised classification problemSnowflake-data-ingestion-hands-on-tutorial
This repo covers the two most widely used and recommended file based data ingestion approaches: COPY INTO and Snowpipe.Vision-Transformer-Research
The purpose of this research project is to compare traditional CNNs to vision transformers, can transformers give a higher AUC when classifying Atypical Femoral Fracture / Normal Femoral Fracture?Neural-Networks-and-Learning-Systems
Solving problems using different machine learning algorithms. Machine learning, classification, pattern recognition and high-dimensional data analysis.Advanced-Regression-Techniques-for-Ames-housing-data-prediction
Prediction of Ames house prices using advanced regression techniques and ML algorithms.GPT-and-LangChain-for-Data-analysis
Prediction-of-used-car-prices-using-various-regression-techniques
It is a work on a regression problem in which our objective is to predict the prices of used cars given a number of features/predictors about themData-Science-Apache-Spark-Databricks-ETL-Project
Using Databricks community edition to build multiple end-to-end ETL pipelines using PySpark for different file formats such as CSV, Parquet, Delta table. Predictive modeling is performed using different machine learning algorithms.dbt_learn_fundamentals
dbt is a SQL-first transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation.Love Open Source and this site? Check out how you can help us