@mlfoundations
  • Stars
    star
    18,028
  • Global Org. Rank 1,247 (Top 0.4 %)
  • Registered over 3 years ago
  • Most used languages
    Python
    73.9 %
    HTML
    8.7 %
    CSS
    4.3 %

Top repositories

1

open_clip

An open source implementation of CLIP.
Python
9,941
star
2

open_flamingo

An open-source framework for training large multimodal models.
Python
3,716
star
3

dclm

DataComp for Language Models
HTML
1,073
star
4

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.
749
star
5

datacomp

DataComp: In search of the next generation of multimodal datasets
Python
628
star
6

wise-ft

Robust fine-tuning of zero-shot models
Python
618
star
7

open_lm

A repository for research on medium sized language models.
Python
475
star
8

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Python
412
star
9

task_vectors

Editing Models with Task Arithmetic
Python
397
star
10

open-diffusion

Simple large-scale training of stable diffusion with multi-node support.
Python
120
star
11

scaling

Language models scale reliably with over-training and on downstream tasks
Jupyter Notebook
90
star
12

patching

Patching open-vocabulary models by interpolating weights
Python
87
star
13

VisIT-Bench

Python
46
star
14

imagenet-captions

Release of ImageNet-Captions
45
star
15

tableshift

A benchmark for distribution shift in tabular data
Python
38
star
16

clip_quality_not_quantity

Python
28
star
17

rtfm

Research on Tabular Foundation Models
Python
20
star
18

dataset2metadata

Python
19
star
19

spark-commoncrawl

Jupyter Notebook
6
star
20

datacomp_site

HTML
6
star
21

tabliblib

A Python library for processing and filtering TabLib
Python
5
star
22

webdataset-resharder

Efficiently process webdatasets
Python
4
star
23

imagenet-applications-transfer

Python
2
star
24

au21

Jupyter Notebook
1
star
25

advancedml-sp23

CSS
1
star