• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language MDX
  • Created 9 months ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Configuration for generating SDKs and Documentation.

More Repositories

1

Apple-M1-BERT

3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1
Python
135
star
2

octoml-profile

Home for OctoML PyTorch Profiler
105
star
3

synr

A library for syntactically rewriting Python programs, pronounced (sinner).
Python
70
star
4

octoai-textgen-cookbook

Simple getting-started code examples for LLM applications powered by OctoAI
Python
42
star
5

deformable-attention-kernel

TVMScript kernel for deformable attention
Python
24
star
6

triton-client-rs

A client library in Rust for Nvidia Triton.
Rust
23
star
7

octoml-llm-qa

A code sample that shows how to use 🦜️🔗langchain, 🦙llama_index and a hosted LLM endpoint to do a standard chat or Q&A about a pdf document
Python
18
star
8

tvm2onnx

An open-source tool created by OctoML that converts TVM-optimized models to code runnable in ONNX Runtime.
Python
15
star
9

relax

A fork of tvm/unity
Python
15
star
10

octoml-cli-tutorials

A repository containing full end to end examples of the OctoML CLI workflow.
Python
14
star
11

TransparentAI

An example of building your own ML cloud app using OctoML.
Python
13
star
12

public-tvm-docker

Build TVM docker image for production compilation deployments
13
star
13

qualcomm

C
8
star
14

dockercon23-octoai

DockerCon 2023 OctoAI AI/ML Workshop GitHub Repo
Jupyter Notebook
7
star
15

tvm-build

A library for building TVM programmatically.
Rust
7
star
16

mlops

CK MLOps components
6
star
17

octoml-examples

A collection of test models for the OctoML AI acceleration service
5
star
18

octoai-apps

A collection of OctoAI-based demos.
TypeScript
5
star
19

macho-dyld

Custom dyld version inherited from original Apple dyld implementation
C++
4
star
20

cm-mlops

Collective Mind repository with unified automations to automatically co-design, optimize and deploy intelligent and Pareto-efficient systems across continuously changing software and hardware stacks.
Python
4
star
21

mlperf-loadgen-harness

A simple Python harness to run an ONNX model in various concurrency and replication configurations against MLCommon's LoadGen to measure throughput.
Python
4
star
22

octoai-template-apps

Python
3
star
23

mlcommons-inference

Fork of MLCommons inference repository to test TVM integration
Python
2
star
24

azsphere

TVM on Azure Sphere Platform
C
2
star
25

venv

CK virtual environment
Python
2
star
26

octoai-launch-examples

Examples of how to build Generative AI applications powered by the OctoAI compute service.
Jupyter Notebook
1
star
27

octocloud-templates

Python
1
star
28

.github

1
star