• Stars
    star
    201
  • Rank 194,491 (Top 4 %)
  • Language
    Jupyter Notebook
  • Created almost 2 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This repo contains all the code used in the Python for Data Engineering Course

Python for Data Engineering

This repo contains all the code used in the Python for Data Engineering Course

Code is free to use, if you want in-depth understanding of concepts then check out my python course designed to teach you how to think programmatically and perform data engineering tasks using python

Module 1: iPhone Data Analysis Project

Get right into it, don't waste time learning basics, and execute the complete project to boost your confidence.

Work on Python Data Analysis project on iPhone Apple Dataset, do some basic transformation and analysis.

Module 2: Python Basics

Build strong programming fundamentals by learning the basics of python

  • Working with Strings
  • Working with Numbers
  • Understanding DataTypes
  • Python Data Structures (List, Dictionary, Sets, Tuples)
  • Working with Loops (For & While)
  • Conditional Statements (IF-ELSE)

Module 3: Python Intermediate

Let's learn some intermediate concepts to build a strong foundation.

  • List Compherention
  • Understand Python Packages
  • Exception Handling
  • Working with a text file
  • Working with Functions
  • Lambda functions basics

Module 4: Python Advance

Let's learn some advanced topics and packages used by data engineer on a daily basis

  • Py Package - Numpy
  • Py Package - Pandas (Basics and Understanding Transformation with Project)

Module 5: Python for Data Engineering

Python for data engineering, learn how to work with different file formats and handle DateTime columns

  • Working with DateTime Formats
  • Working with different file formats (JSON, CSV, Excel, AVRO)
  • Scrape Amazon Data Using Python

Module 6: Spotfiy Data Pipeline End-To-End Python Data Engineering Project

Implement Complete Data Pipeline Data Engineering Project using Spotify

  • Integrating with Spotify API and extracting Data
  • Deploying code on AWS Lambda for Data Extraction
  • Adding trigger to run the extraction automatically
  • Writing transformation function
  • Building automated trigger on transformation function
  • Store files on S3 properly
  • Building Analytics Tables on data files using Glue and Athena

More Repositories

1

uber-etl-pipeline-data-engineering-project

Jupyter Notebook
208
star
2

stock-market-kafka-data-engineering-project

Jupyter Notebook
163
star
3

dataengineering-youtube-analysis-project

Data Engineering YouTube Analysis Project by Darshil Parmar
Python
132
star
4

tokyo-olympic-azure-data-engineering-project

tokyo-olympic-azure-data-engineering-project
Jupyter Notebook
130
star
5

twitter-airflow-data-engineering-project

YouTube tutorial project
Python
92
star
6

apache-spark-with-data-bricks-for-data-engineering

apache-spark-with-databricks-for-data-engineering
Jupyter Notebook
45
star
7

amazon-web-scraping-python-project

Jupyter Notebook
44
star
8

Data-Engineer-Tutorial-Series

Jupyter Notebook
21
star
9

sql-for-data-engineering-course

sql-for-data-engineering-course
Jupyter Notebook
16
star
10

Exam_Notes_Detection

At the time of exams most of the time student share their notes via social media and after the exam gets over it become really difficut to delete all those images manually. So face this problem I have created this system which detects exam notes(pictures which are clicked from mobile camera) and deletes it.
Python
13
star
11

uber-data-engineering-mage-project

Uber Data Engineering Pipeline using Mage AI and BigQuery
Jupyter Notebook
13
star
12

ipl-data-analysis-apache-spark-project

Jupyter Notebook
12
star
13

data-warehouse-snowflake-for-data-engineering

data-warehouse-snowflake-for-data-engineering
PLpgSQL
11
star
14

Amazon_Website_Scraping_Scrapy

Using Scrapy python library to scrap amazon website and store: Title,Ratings and Reviews
Python
7
star
15

darshilparmar

About Me
6
star
16

python-tutorial

Jupyter Notebook
4
star
17

Face_Recognition_System_FaceNet

A facial recognition system is a technology capable of identifying or verifying a person from a digital image or a video frame from a video source. There are multiples methods in which facial recognition systems work, but in general, they work by comparing selected facial features from given image with faces within a database.
Python
4
star
18

Keywords_and_keyphrases_extraction

Keyword extraction is tasked with the automatic identification of terms that best describe the subject of a document.
Jupyter Notebook
4
star
19

kafka-in-10min-video-code

Jupyter Notebook
4
star
20

data-engineering-sql-tutorial

3
star
21

Step-by-step-DataScience

Data Science Workshop
Jupyter Notebook
3
star
22

Loan-Predicition

AnalyticsVidhya Hackathon: Loan Prediction System
Jupyter Notebook
3
star
23

Clustering-Indian-Postal-code-based-on-most-visited-Venues

Clustering Indian Postal Code Based on Most visited Venues
Jupyter Notebook
3
star
24

Face_Landmarks_Detection

A facial landmarks detection system is a technology capable of detecting a person from a digital image or a video frame from a video source. There are multiples methods in which facial detection systems work, but in general, they work by comparing selected facial features from given image with faces within a database.
Python
3
star
25

workflow-orchestration-apache-airflow-for-data-engineering

workflow-orchestration-apache-airflow-for-data-engineering
Python
3
star
26

scraping_tutorialspoint

Scraping all data from tutorialspoint
Python
2
star
27

food_classification-FoodPal

PHP
1
star
28

School-Management

School-Management diploma mini project
JavaScript
1
star
29

Blood-Donation

Blood Donation diploma mini project
JavaScript
1
star