• Stars
    star
    392
  • Rank 109,735 (Top 3 %)
  • Language
    TypeScript
  • License
    MIT License
  • Created over 3 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

🍫 Example code for a basic ML Platform based on Pulumi, FastAPI, DVC, MLFlow and more

ML Platform Workshop

This repo contains example code for a (very basic) ML platform.

  • The model-template directory contains an example for a Cookiecutter-based template that data scientists can clone to start a new project.
  • The infra directory contains Pulumi code that spins up the shared infrastructure of the ML platform, such as Kubernetes, MLFlow, etc.

Made with ❀️ by Aporia

The YouTube Video

IMAGE ALT TEXT HERE

Why?

As data science teams become more mature with models reaching actual production, the need for a proper infrastructure becomes crucial. Leading companies in the field with massive engineering teams like Uber, Netflix and Airbnb had created multiple solutions for their infrastructure and named the combination of them as β€œML Platform”.

We hope this repo can help you get started with building your own ML platform ❀️

Architecture

Based on the following projects:

When building your own ML platform, do not take these tools for granted! Check out alternatives and find the best tools that solve each one of your problems.

What's missing from this?

Well... a lot actually. Here's a partial list:

  • HTTPS & Authentication
  • Environments (staging, production)
  • Common library for preprocessing, postprocessing, etc
  • Model input & validation
  • Training orchestration
  • and probably much more!

We would love your help!