• Stars
    star
    3,496
  • Rank 12,732 (Top 0.3 %)
  • Language
  • Created over 5 years ago
  • Updated 6 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A list of useful resources to learn Data Engineering from scratch

How To Become a Data Engineer

Useful articles

Talks

Algorithms & Data Structures

SQL

Programming

Databases

Distributed Systems

Books

Courses

Blogs

  • Martin Kleppmann author of Designing Data-Intensive Application
  • BaseDS by Vaidehi Joshi about Distributed Systems

Tools

  • Apache Airflow is a platform to programmatically author, schedule and monitor workflows in Python
  • Apache Spark is a unified analytics engine for large-scale data processing
  • Apache Kafka is a distributed streaming platform
  • Luigi is a Python package that helps you build complex pipelines of batch jobs.
  • Dagster.io is a system for building modern data applications.
  • Prefect includes everything you need to create and run data applications.
  • Metaflow build and manage real-life data science projects with ease
  • lakeFS build repeatable, atomic and versioned data lake operations – from complex ETL jobs to data science and analytics.

Cloud Platforms

Communities

Data Engineering Jobs

Other

Newsletters & Digests

More Repositories

1

planetpython_telegrambot

Django App Planet Python Telegram Bot
Python
56
star
2

apache-airflow-course-materials

Курс про Apache Airflow 2.0
Python
31
star
3

django-telegram-auth-example

Telegram Login Widget Django example
Python
25
star
4

django-trix-editor

Django Trix WYSIWYG Editor integration
Python
18
star
5

django_channels_demo

Demonstration of Django Channels using WebSocket
Python
17
star
6

apache-airflow-intro

Python
17
star
7

notion

Simple Django blog engine
HTML
12
star
8

luigi-course-materials

Материалы для курса Введение в Data Engineering: дата пайплайны
Python
12
star
9

VKMixin

Vkontakte Oauth 2.0 Tornado Mixin
Python
6
star
10

qiwi

Python QIWI API Client
Python
5
star
11

vagrant_demo

Vagrant + Demo Django app
Python
5
star
12

pycon-ru-2019-etl-examples

Luigi, Airflow, Prefect Examples for PyCon RU 2019 Presentation
Python
4
star
13

pyPdf417

PDF417 Barcode Generator in Python
Python
3
star
14

django-cbv-webinar-yandex

Code samples for the Django Class Based Views Webinar in Yandex
Python
3
star
15

awesome-data-engineering

A curated list of resources to navigate data engineering
3
star
16

luigi-telegram

Luigi Tasks status notifications to Telegram
Python
3
star
17

python-alfabank

Alfabank Payment Gateway Python Client
Python
2
star
18

apache-airflow-xcom-examples

XCom code examples for Apache Airflow 2.0
Python
2
star
19

airflow-taskflow-api-examples

Python
2
star
20

django-qiwi-kassa

Reusable Django App for Qiwi Kassa
Python
2
star
21

adilkhash

1
star
22

data-engineering-blogs

Data Engineering Blogs
1
star
23

leetcode

Solutions to LeetCode problems in Python
Python
1
star
24

appmetrica-logs-api

AppMetrica Logs API client
Python
1
star