• Stars
    star
    11,082
  • Rank 3,038 (Top 0.06 %)
  • Language
    Jupyter Notebook
  • Created about 3 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Free MLOps course from DataTalks.Club

MLOps Zoomcamp

Our MLOps Zoomcamp course

Taking the course

2023 Cohort

Self-paced mode

All the materials of the course are freely available, so that you can take the course at your own pace

  • Follow the suggested syllabus (see below) week by week
  • You don't need to fill in the registration form. Just start watching the videos and join Slack
  • Check FAQ if you have problems
  • If you can't find a solution to your problem in FAQ, ask for help in Slack

Overview

Objective

Teach practical aspects of productionizing ML services — from training and experimenting to model deployment and monitoring.

Target audience

Data scientists and ML engineers. Also software engineers and data engineers interested in learning about putting ML in production.

Pre-requisites

  • Python
  • Docker
  • Being comfortable with command line
  • Prior exposure to machine learning (at work or from other courses, e.g. from ML Zoomcamp)
  • Prior programming experience (at least 1+ year)

Asking for help in Slack

The best way to get support is to use DataTalks.Club's Slack. Join the #course-mlops-zoomcamp channel.

To make discussions in Slack more organized:

Syllabus

Module 1: Introduction

  • What is MLOps
  • MLOps maturity model
  • Running example: NY Taxi trips dataset
  • Why do we need MLOps
  • Course overview
  • Environment preparation
  • Homework

More details

Module 2: Experiment tracking and model management

  • Experiment tracking intro
  • Getting started with MLflow
  • Experiment tracking with MLflow
  • Saving and loading models with MLflow
  • Model registry
  • MLflow in practice
  • Homework

More details

Weights and biases workshop

Module 3: Orchestration and ML Pipelines

  • Workflow orchestration
  • Prefect 2.0
  • Turning a notebook into a pipeline
  • Deployment of Prefect flow
  • Homework

More details

Module 4: Model Deployment

  • Three ways of model deployment: Online (web and streaming) and offline (batch)
  • Web service: model deployment with Flask
  • Streaming: consuming events with AWS Kinesis and Lambda
  • Batch: scoring data offline
  • Homework

More details

Module 5: Model Monitoring

  • Monitoring ML-based services
  • Monitoring web services with Prometheus, Evidently, and Grafana
  • Monitoring batch jobs with Prefect, MongoDB, and Evidently

More details

Module 6: Best Practices

  • Testing: unit, integration
  • Python: linting and formatting
  • Pre-commit hooks and makefiles
  • CI/CD (GitHub Actions)
  • Infrastructure as code (Terraform)
  • Homework

More details

Project

  • End-to-end project with all the things above

More details

Instructors

  • Cristian Martinez
  • Jeff Hale
  • Alexey Grigorev
  • Emeli Dral
  • Sejal Vaidya

Other courses from DataTalks.Club:

FAQ

I want to start preparing for the course. What can I do?

If you haven't used Flask or Docker

If you have no previous experience with ML

  • Check Module 1 from ML Zoomcamp for an overview
  • Module 3 will also be helpful if you want to learn Scikit-Learn (we'll use it in this course)
  • We'll also use XGBoost. You don't have to know it well, but if you want to learn more about it, refer to module 6 of ML Zoomcamp

I registered but haven't received an invite link. Is it normal?

Yes, we haven't automated it. You'll get a mail from us eventually, don't worry.

If you want to make sure you don't miss anything:

Is it going to be live?

No and yes. There will be two parts:

  • Lectures: Pre-recorded, you can watch them when it's convenient for you.
  • Office hours: Live on Mondays (17:00 CET), but recorded, so you can watch later.

I just joined. Can I still get a certificate?

  • To get a certificate, you need to complete a project
  • There will be two attempts to do a project
  • First: in July, second: in August
  • If you manage to finish all the materials till August, and successfully finish the project, you'll get the certificate

Supporters and partners

Thanks to the course sponsors for making it possible to create this course

More Repositories

1

data-engineering-zoomcamp

Free Data Engineering course!
Jupyter Notebook
24,859
star
2

machine-learning-zoomcamp

Learn ML engineering for free in 4 months!
Jupyter Notebook
9,429
star
3

llm-zoomcamp

LLM Zoomcamp - a free online course about building a Q&A system
Jupyter Notebook
2,813
star
4

stock-markets-analytics-zoomcamp

Course Materials for Analytics in Stock Markets Zoomcamp
Jupyter Notebook
397
star
5

project-of-the-week

Learn by doing: DIY project groups at DataTalks.Club
375
star
6

awesome-data-podcasts

A list of awesome data podcasts
362
star
7

datatalksclub.github.io

The web page for DataTalks.Club
Python
180
star
8

nyc-tlc-data

Backup for NYC TLC data for the DE Zoomcamp course
151
star
9

data-paths

Learning paths for data roles
123
star
10

data-analytics-interviews

Data analytics interview questions and answers
55
star
11

course-management-platform

Django-based course management platform for Zoomcamps
Python
46
star
12

zoomcamp-analytics

Public data and analytics for our open course
Jupyter Notebook
29
star
13

kaggle-qa-challenge-starter

The getting started notebook for the DTC Zoomcamp Q&A challenge
Jupyter Notebook
28
star
14

mlzoomcamp.com

The page for the ML Zoomcamp course
25
star
15

reading-club-nlp

Notes from our NLP reading club!
16
star
16

kitchenware-competition-starter

A starter notebook for the Kitchenware classification competition on Kaggle
Jupyter Notebook
14
star
17

whylogs-workshop

The code from the whylogs workshop in DataTalks.Club on 29 March 2022
Jupyter Notebook
13
star
18

reading-club-books

12
star
19

llm-zoomcamp-saturncloud

Saturn Cloud starter code for LLM Zoomcamp
Jupyter Notebook
8
star
20

website-django

The DTC website in Django
Jupyter Notebook
4
star
21

course-management-platform-old

A platform for hosting our courses
Python
4
star
22

fashion

Python
2
star