BentoML (@bentoml)

Top repositories

1

OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
Python
9,124
star
2

BentoML

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
Python
6,714
star
3

Yatai

Model Deployment at Scale on Kubernetes πŸ¦„οΈ
TypeScript
771
star
4

OneDiffusion

OneDiffusion: Run any Stable Diffusion models and fine-tuned weights with ease
Python
325
star
5

stable-diffusion-server

Deploy Your Own Stable Diffusion Service
Python
191
star
6

bentoctl

Fast model deployment on any cloud πŸš€
Python
172
star
7

gallery

BentoML Example Projects 🎨
Python
134
star
8

OCR-as-a-Service

Turn any OCR models into online inference API endpoint πŸš€ πŸŒ–
Python
47
star
9

transformers-nlp-service

Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more
Python
41
star
10

CLIP-API-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
Jupyter Notebook
36
star
11

BentoVLLM

Self-host LLMs with vLLM and BentoML
Python
32
star
12

simple_di

Simple dependency injection framework for Python
Python
19
star
13

yatai-deployment

πŸš€ Launching Bento in a Kubernetes cluster
Go
16
star
14

Fraud-Detection-Model-Serving

Online model serving with Fraud Detection model trained with XGBoost on IEEE-CIS dataset
Jupyter Notebook
14
star
15

aws-sagemaker-deploy

Fast model deployment on AWS Sagemaker
Python
14
star
16

yatai-image-builder

🐳 Build OCI images for Bentos in k8s
Go
14
star
17

sentence-embedding-bento

Sentence Embedding as a Service
Jupyter Notebook
14
star
18

google-cloud-run-deploy

Fast model deployment on Google Cloud Run
Python
13
star
19

aws-lambda-deploy

Fast model deployment on AWS Lambda
Python
13
star
20

aws-ec2-deploy

Fast model deployment on AWS EC2
Python
13
star
21

IF-multi-GPUs-demo

Python
13
star
22

rag-tutorials

a series of tutorials implementing rag service with BentoML and LlamaIndex
Python
11
star
23

diffusers-examples

API serving for your diffusers models
Python
10
star
24

BentoSVD

Python
9
star
25

Pneumonia-Detection-Demo

Pneumonia Detection - Healthcare Imaging Application built with BentoML and fine-tuned Vision Transformer (ViT) model
Python
8
star
26

yatai-chart

Helm Chart for installing Yatai on Kubernetes ⎈
Mustache
7
star
27

benchmark

BentoML Performance Benchmark πŸ†š
Jupyter Notebook
7
star
28

plugins

the swish knife to all things bentoml.
Starlark
6
star
29

bentoctl-operator-template

Python
6
star
30

heroku-deploy

Deploy BentoML bundled models to Heroku
Python
6
star
31

BentoLMDeploy

Self-host LLMs with LMDeploy and BentoML
Python
5
star
32

bentoml-core

Rust
5
star
33

BentoControlNet

Python
4
star
34

BentoWhisperX

Python
4
star
35

google-compute-engine-deploy

HCL
4
star
36

BentoCLIP

building a CLIP application using BentoML
Python
4
star
37

BentoRAG

Tutorial: Build RAG Apps with Custom Models Served with BentoML
Python
4
star
38

quickstart

BentoML Quickstart Example
Python
4
star
39

deploy-bento-action

A GitHub Action to deploy bento to cloud
3
star
40

azure-functions-deploy

Fast model deployment on Azure Functions
Python
3
star
41

azure-container-instances-deploy

Fast model deployment on Azure container instances
Python
3
star
42

containerize-push-action

docker's build-and-push-action equivalent for bentoml
TypeScript
3
star
43

BentoSentenceTransformers

how to build a sentence embedding application using BentoML
Python
2
star
44

BentoTRTLLM

Python
2
star
45

bentoml-arize-fraud-detection-workshop

Jupyter Notebook
2
star
46

BentoSDXLTurbo

how to build an image generation application using BentoML
Python
2
star
47

yatai-schemas

Go
1
star
48

bentoctl-workshops

Python
1
star
49

llm-bench

Python
1
star
50

bentocloud-homepage-news

1
star
51

yatai-common

Go
1
star
52

BentoBLIP

how to build an image captioning application on top of a BLIP model with BentoML
Python
1
star
53

BentoYolo

BentoML service of YOLO v8
Python
1
star
54

.github

βœ¨πŸ±πŸ¦„οΈ
1
star
55

BentoBark

Python
1
star
56

BentoMLCLLM

Python
1
star
57

BentoTGI

Python
1
star
58

openllm-benchmark

Python
1
star