• Stars
    star
    119
  • Rank 288,016 (Top 6 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created 4 months ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Annotated version of the Mamba paper

Go to Blog Notebook Colab

image

Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752

Challenge

The triton version is still a lot slower than the mamba custom kernel. If anyone has ideas for speeding it up, I would love to hear them. The file final.py has a minimal version with benchmarking.

pip install mamba-ssm
pip install -U http://kermit.bounceme.net:8900/triton-3.0.0-cp310-cp310-linux_x86_64.whl

More Repositories

1

GPU-Puzzles

Solve puzzles. Learn CUDA.
Jupyter Notebook
4,778
star
2

Tensor-Puzzles

Solve puzzles. Improve your pytorch.
Jupyter Notebook
2,276
star
3

MiniChain

A tiny library for coding with large language models.
Python
1,160
star
4

llama2.rs

A fast llama2 decoder in pure Rust.
Rust
865
star
5

LLM-Training-Puzzles

What would you do with 1000 H100s...
Jupyter Notebook
636
star
6

annotated-s4

Implementation of https://srush.github.io/annotated-s4
Python
394
star
7

streambook

Live Python Notebooks with any Editor
Jupyter Notebook
273
star
8

Autodiff-Puzzles

Jupyter Notebook
260
star
9

raspy

An interactive exploration of Transformer programming.
Jupyter Notebook
212
star
10

Transformer-Puzzles

Puzzles for exploring transformers
Jupyter Notebook
207
star
11

parallax

Python
157
star
12

do-we-need-attention

TeX
152
star
13

awesome-ml-tracking

102
star
14

GPTWorld

A puzzle to learn about prompting
Jupyter Notebook
80
star
15

torch-queue

Python
62
star
16

triton-autodiff

Experiment of using Tangent to autodiff triton
Python
55
star
17

torch-golf

Silly twitter torch implementations.
Python
43
star
18

PyDecode

A dynamic programming toolkit.
C++
39
star
19

VirtualTeaching

DIY setup for virtual teaching on ubuntu
39
star
20

LLM-Talk

38
star
21

learns-dex

33
star
22

text2table

Python
31
star
23

jax-lda

Python
31
star
24

ProbTalk

HTML
28
star
25

Hierarchical-Bayes-Compiler

Hal Daume's hbc
Haskell
20
star
26

g9py

HTML
18
star
27

transformers-bet

HTML
11
star
28

relax-decode

Java
10
star
29

mamba-scans

Blog post
9
star
30

drop7

Jupyter Notebook
8
star
31

aima-arguments

7
star
32

Triton-Puzzles

Jupyter Notebook
7
star
33

minitorch-rust

7
star
34

torch-mechanics

Amateur experiments with autodiff mechanics simulators
6
star
35

cs5781

Machine Learning Engineering
6
star
36

postgres-provanence

C
6
star
37

SemiRings

Holder for a bunch of semirings used in ChartParsing
Haskell
6
star
38

MRF-LM

Shell
5
star
39

TextBook

Command-line Facebook
Haskell
5
star
40

hsNLP-

Combined repo for nlp libs
Haskell
4
star
41

provenance

4
star
42

icfp2009

when I was 4 years old I was maimed by a giant pig
Haskell
4
star
43

Tensor-Puzzles-Penzai

HTML
4
star
44

PowerEdit

A super-minimal Python-based video editor âš¡
Python
4
star
45

configure

some configuration file
Emacs Lisp
3
star
46

clustering

C++
3
star
47

BT-AI

Jupyter Notebook
3
star
48

annotated-transformer.github.io

Annotated Transformer Blog Post
3
star
49

srush-blog

Haskell
2
star
50

Eisner-Parser

An implementation of the Eisner Parser (described in "Bilexical Grammars and a Cubic-time parsing algorithm" ) in Haskell
Haskell
2
star
51

FSM

Finite State Machine lib for haskell
Haskell
2
star
52

hplay

2
star
53

opennmt-gen

Shell
2
star
54

PhraseDep

C++
2
star
55

srush-wiki

2
star
56

tf-fork

Python
2
star
57

icfp2003

Race Car
Haskell
2
star
58

icfp2008

2
star
59

hypergraph

Hypergraph specification
Python
2
star
60

triton

2
star
61

learns-triton

2
star
62

bipartite-sampler

Implementation of Huber-Law rejection sampling for bipartite graphs
C
1
star
63

Training

Haskell
1
star
64

ezTVM

1
star
65

test_grade

Python
1
star
66

Chart-Parsing-

haskell library for basic chart parsers
Haskell
1
star
67

blog-twitter

1
star
68

sigmoidfit

Jupyter Notebook
1
star
69

evernote

Command line bindings for evernote
1
star
70

transforest

transforest
Python
1
star
71

decoding-methods

1
star
72

blog

Jupyter Notebook
1
star
73

nlp-course

Go
1
star
74

beamer-animation

Create animations for LaTeX Beamer presentations.
Python
1
star
75

Duel

Python
1
star
76

nlp

1
star
77

twitter-simmons-sports

1
star
78

monadnack-project

Art project for monadnack
1
star
79

peoplesounds

Python
1
star
80

osgai

JavaScript
1
star
81

CutParse

C++
1
star
82

Penn-Treebank

Haskell library for the penn treebank management
1
star
83

Lattice

lattice protobuffer
Python
1
star
84

icfp2020

Python
1
star
85

twittersports

1
star
86

ProbDist

Tools for probabality distributions focusing on estimation, conditioning, and smoothing
Haskell
1
star
87

mamba-primer

1
star