• Stars
    star
    208
  • Rank 189,015 (Top 4 %)
  • Language
    Jupyter Notebook
  • Created over 1 year ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Uber Data Analytics | Modern Data Engineering GCP Project

Introduction

The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.

Architecture

Technology Used

  • Programming Language - Python

Google Cloud Platform

  1. Google Storage
  2. Compute Instance
  3. BigQuery
  4. Looker Studio

Modern Data Pipeine Tool - https://www.mage.ai/

Contibute to this open source project - https://github.com/mage-ai/mage-ai

Dataset Used

TLC Trip Record Data Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

Here is the dataset used in the video - https://github.com/darshilparmar/uber-etl-pipeline-data-engineering-project/blob/main/data/uber_data.csv

More info about dataset can be found here:

  1. Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
  2. Data Dictionary - https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf

Data Model

Complete Video Tutorial

Video Link - https://youtu.be/WpQECq5Hx9g

More Repositories

1

python-for-data-engineering

This repo contains all the code used in the Python for Data Engineering Course
Jupyter Notebook
201
star
2

stock-market-kafka-data-engineering-project

Jupyter Notebook
163
star
3

dataengineering-youtube-analysis-project

Data Engineering YouTube Analysis Project by Darshil Parmar
Python
132
star
4

tokyo-olympic-azure-data-engineering-project

tokyo-olympic-azure-data-engineering-project
Jupyter Notebook
130
star
5

twitter-airflow-data-engineering-project

YouTube tutorial project
Python
92
star
6

apache-spark-with-data-bricks-for-data-engineering

apache-spark-with-databricks-for-data-engineering
Jupyter Notebook
45
star
7

amazon-web-scraping-python-project

Jupyter Notebook
44
star
8

Data-Engineer-Tutorial-Series

Jupyter Notebook
21
star
9

sql-for-data-engineering-course

sql-for-data-engineering-course
Jupyter Notebook
16
star
10

Exam_Notes_Detection

At the time of exams most of the time student share their notes via social media and after the exam gets over it become really difficut to delete all those images manually. So face this problem I have created this system which detects exam notes(pictures which are clicked from mobile camera) and deletes it.
Python
13
star
11

uber-data-engineering-mage-project

Uber Data Engineering Pipeline using Mage AI and BigQuery
Jupyter Notebook
13
star
12

ipl-data-analysis-apache-spark-project

Jupyter Notebook
12
star
13

data-warehouse-snowflake-for-data-engineering

data-warehouse-snowflake-for-data-engineering
PLpgSQL
11
star
14

Amazon_Website_Scraping_Scrapy

Using Scrapy python library to scrap amazon website and store: Title,Ratings and Reviews
Python
7
star
15

darshilparmar

About Me
6
star
16

python-tutorial

Jupyter Notebook
4
star
17

Face_Recognition_System_FaceNet

A facial recognition system is a technology capable of identifying or verifying a person from a digital image or a video frame from a video source. There are multiples methods in which facial recognition systems work, but in general, they work by comparing selected facial features from given image with faces within a database.
Python
4
star
18

Keywords_and_keyphrases_extraction

Keyword extraction is tasked with the automatic identification of terms that best describe the subject of a document.
Jupyter Notebook
4
star
19

kafka-in-10min-video-code

Jupyter Notebook
4
star
20

data-engineering-sql-tutorial

3
star
21

Step-by-step-DataScience

Data Science Workshop
Jupyter Notebook
3
star
22

Loan-Predicition

AnalyticsVidhya Hackathon: Loan Prediction System
Jupyter Notebook
3
star
23

Clustering-Indian-Postal-code-based-on-most-visited-Venues

Clustering Indian Postal Code Based on Most visited Venues
Jupyter Notebook
3
star
24

Face_Landmarks_Detection

A facial landmarks detection system is a technology capable of detecting a person from a digital image or a video frame from a video source. There are multiples methods in which facial detection systems work, but in general, they work by comparing selected facial features from given image with faces within a database.
Python
3
star
25

workflow-orchestration-apache-airflow-for-data-engineering

workflow-orchestration-apache-airflow-for-data-engineering
Python
3
star
26

scraping_tutorialspoint

Scraping all data from tutorialspoint
Python
2
star
27

food_classification-FoodPal

PHP
1
star
28

School-Management

School-Management diploma mini project
JavaScript
1
star
29

Blood-Donation

Blood Donation diploma mini project
JavaScript
1
star