• Stars
    star
    165
  • Rank 227,666 (Top 5 %)
  • Language
    Jupyter Notebook
  • Created over 2 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A simple guide to MLOps through ZenML and its various integrations.

ZenBytes

ZenBytes is a series of short practical MLOps lessons through ZenML and its various integrations. It is intended for people looking to learn about MLOps generally, and also for ML practitioners who want to get started with ZenML.

πŸ’‘ What you will learn

  • Define an MLOps stack tailored to your project requirements.
  • Build transparent and reproducible data-centric ML pipelines with automated artifact versioning, tracking, caching, and more.
  • Deploy ML pipelines with tooling and infrastructure of your choice (e.g. as a serverless microservice in the cloud).
  • Monitor and address production issues like training-serving skew and data drift.
  • Use some of the most popular MLOps tools like ZenML, Kubeflow, MLflow, Weights & Biases, Evidently, Seldon, Feast, and many more.

In the end, you will be able to take any of your ML models from experimentation to a customized, fully fleshed-out production-grade MLOps setup in a matter of minutes!

Sam

πŸ§‘β€πŸ« Syllabus

The series is structured into four chapters with several lessons each. Click on any of the links below to open the respective lesson directly in Colab.

🍑 1. ML Pipelines ♻️ 2. Training / Serving πŸ“ 3. Data Management πŸš€ More Coming Soon!
1.1 ML Pipelines 2.1 Experiment Tracking 3.1 Data Skew
1.2 Artifact Lifecycle 2.2 Local Deployment
2.3 Inference Pipelines

πŸ™ About ZenML

ZenML is an extensible, open-source MLOps framework for creating production-ready ML pipelines. Built for data scientists, it has a simple, flexible syntax, is cloud- and tool-agnostic, and has interfaces/abstractions that are catered towards ML workflows.

If you enjoy these courses and want to learn more:

πŸ’» Setup

System Requirements

  • Linux or MacOS
  • Python 3.7, 3.8, 3.9, or 3.10
  • Jupyter notebook and ZenML: pip install zenml notebook

Integrations

As you progress through the course, you will need to install additional packages for the various other MLOps tools we will use. You will find corresponding instructions in the respective notebooks, but we recommend you install all integrations ahead of time with the following command:

zenml integration install sklearn wandb evidently mlflow -y

πŸš€ Getting Started

If you haven't done so already, clone ZenBytes to your local machine. Then, use Jupyter Notebook to go through the course lesson-by-lesson, starting with 1-1_Pipelines.ipynb:

git clone https://github.com/zenml-io/zenbytes
cd zenbytes
jupyter notebook

❓ FAQ

1. ZenML cannot find a component even though I have it in my stack

Updating or switching your ZenML stack is sometimes not immediately loaded in Jupyter notebooks.

Solution: First, make sure you really have the correct component installed and registered in your currently active stack with zenml stack describe. If the component is indeed there, restart the kernel of your Jupyter notebook, which will also reload the stack.

More Repositories

1

zenml

ZenML πŸ™: Build portable, production-ready MLOps pipelines. https://zenml.io.
Python
3,654
star
2

awesome-open-data-annotation

Open Source Data Annotation & Labeling Tools
390
star
3

mlops-stacks

A series of Terraform based recipes to provision popular MLOps stacks on the cloud.
HCL
170
star
4

zenml-projects

A repository for all ZenML projects that are specific production use-cases.
Python
120
star
5

zenml-gitflow

A repository that showcases how you can use ZenML with Git
Python
47
star
6

zenml-dashboard

The repository that contains all source code for the ZenML UI.
TypeScript
24
star
7

vscode-zenml

VSCode extension for ZenML
TypeScript
7
star
8

template-nlp

A copier template repository for an e2e Natural language processing (NLP) ZenML MLOps project
Python
5
star
9

cengine

Core Engine CLI/ Python SDK/ Python API Client repository
Python
5
star
10

github-actions-orchestrator-tutorial

A repository for the ZenML GitHub Actions Orchestrator Tutorial
Python
5
star
11

zenml-hub-plugins

Python
5
star
12

zenml-project-templates

Templates for your base ZenML projects!
Python
4
star
13

template-e2e-batch

A copier template repository for a e2e batch ZenML MLOps pipeline.
Python
3
star
14

blog.zenml.io

JavaScript
3
star
15

zenml-hub-plugin-template

Template for ZenML Hub Plugins
1
star
16

terraform-aws-remote-state

Terraform module to create remote state backend for AWS provider
HCL
1
star
17

terraform-gcp-remote-state

Terraform module to create remote state backend for GCP provider
HCL
1
star
18

terraform-azurerm-remote-state

Terraform module to create remote state backend for Azure provider
HCL
1
star