There are no reviews yet. Be the first to send feedback to the community and the maintainers!
flash-attention
Fast and memory-efficient exact attentiondeepdive
DeepDiveThunderKittens
Tile primitives for speedy kernelsstate-spaces
Sequence Modeling with Structured State Spacesdata-centric-ai
Resources for Data Centric AIsafari
Convolutions for Sequence Modelingmeerkat
Creative interactive views of any dataset.hgcn
Hyperbolic Graph Convolutional Networks in PyTorch.hyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyenaama_prompting
Ask Me Anything language model promptingm2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"H3
Language Modeling with the H3 State Space Modelevaporate
This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"manifest
Prompt programming with FMs.pdftotree
🌲 A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.metal
Snorkel MeTaL: A framework for training models with multi-task weak supervisionfonduer
A knowledge base construction engine for richly formatted dataaisys-building-blocks
Building blocks for foundation models.hyperbolics
Hyperbolic Embeddingslegalbench
An open science effort to benchmark legal reasoning in foundation modelsflyingsquid
More interactive weak supervision with FlyingSquidflash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor CoresKGEmb
Hyperbolic Knowledge Graph embeddings.bootleg
Self-Supervision for Named Entity Disambiguation at the Tailbased
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"HypHC
Hyperbolic Hierarchical Clustering.fly
TART
TART: A plug-and-play Transformer module for task-agnostic reasoningtanda
Learning to Compose Domain-Specific Transformations for Data Augmentationhippo-code
butterfly
Butterfly matrix multiplication in PyTorchspacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.zoology
Understand and test language model architectures on synthetic tasks.lolcats
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"babble
A system for generating training labels via natural language explanationsEmptyHeaded
Your worst case is our best case.domino
blocking-tutorial
mindbender
Tools for iterative knowledge base development with DeepDivereef
Automatically labeling training datafm_data_tasks
Foundation Models for Data Tasksfonduer-tutorials
A collection of simple tutorials for using Fonduereclair-agents
Automating enterprise workflows with multimodal agentsTreeStructure
Table Extraction ToolCaffeConTroll
epoxy
Interactive Model Iteration with Weak Supervision and Pre-Trained EmbeddingsHoroPCA
Hyperbolic PCA via Horospherical Projectionsstructured-nets
Structured matrices for compressing neural networkshidden-stratification
Combating hidden stratification with GEORGEnumbskull
Numba-based version of DimmWitted Gibbs samplermodel-patching
Model Patching: Closing the Subgroup Performance Gap with Data Augmentationskill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelscs145-notebooks-2016
Public materials for the Fall 2016 offering of CS145mandoline
(ICML 2021) Mandoline: Model Evaluation under Distribution Shiftmongoose
A Learnable LSH Framework for Efficient NN Trainingthanos-code
Code release for the paper Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learningukb-cardiac-mri
Weakly Supervised MRI Series Classification for the UK Biobanktuffy
Tuffy, a Markov Logic Network solversnorkel-superglue
Applying Snorkel to SuperGLUEcorrect-n-contrast
Official code repository for Correct-N-Contrastludwig-benchmarking-toolkit
Ludwig benchmarksmallfry
tabi
Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrievallp_rffs
Low precision random Fourier features for kernel approximationddlog
Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿wonderbread
WONDERBREAD benchmark + dataset for BPM tasksaugmentation_code
Reproducible code for Augmentation papersampler
DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿random_embedding
snorkel-biocorpus
ddbiolib
DeepDive Biomedical Toolsbazaar
Omnivore
Omnivore Optimizer and Distributed CcTanchor-stability
A study of the downstream instability of word embeddingsmedical-ned-integration
Cross-domain data integration for named entity disambiguation in biomedical textdd-genomics
The Genomics DeepDive projectembroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationtorchhalp
dimmwitted
Accelerated-PCA
Accelerated Stochastic Power Iteration with Momentumliger
Liger: Fusing Weak Supervision and Model Embeddingscross-modal-ws-demo
hyperE
treedlib
ivy-tutorial
An Introductory Tutorial for Ivyobservational
Observational Supervision for Medical Image Classification using Gaze Datachinstrap
quadrature-features
Code to generate kernel features using Gaussian quadratureicij-maude
Weakly supervised classification of adverse event reports from the FDA's MAUDE database.librarian
DeepDive Librarian for managing all data sets we publish and receivehalp
bert-pretraining
d3m-model-search
D3M Model Search Componentelementary
Data services and APIsdependency_model
Structure learning code from [ICML'19 paper](https://arxiv.org/abs/1903.05844)Love Open Source and this site? Check out how you can help us