There are no reviews yet. Be the first to send feedback to the community and the maintainers!
flash-attention
Fast and memory-efficient exact attentiondeepdive
DeepDiveThunderKittens
Tile primitives for speedy kernelsstate-spaces
Sequence Modeling with Structured State Spacesdata-centric-ai
Resources for Data Centric AIsafari
Convolutions for Sequence Modelingmeerkat
Creative interactive views of any dataset.hgcn
Hyperbolic Graph Convolutional Networks in PyTorch.hyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyenaama_prompting
Ask Me Anything language model promptingm2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"H3
Language Modeling with the H3 State Space Modelevaporate
This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"manifest
Prompt programming with FMs.metal
Snorkel MeTaL: A framework for training models with multi-task weak supervisionpdftotree
🌲 A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.fonduer
A knowledge base construction engine for richly formatted datahyperbolics
Hyperbolic Embeddingsflyingsquid
More interactive weak supervision with FlyingSquidlegalbench
An open science effort to benchmark legal reasoning in foundation modelsaisys-building-blocks
Building blocks for foundation models.flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor CoresKGEmb
Hyperbolic Knowledge Graph embeddings.bootleg
Self-Supervision for Named Entity Disambiguation at the Tailbased
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"HypHC
Hyperbolic Hierarchical Clustering.TART
TART: A plug-and-play Transformer module for task-agnostic reasoningfly
tanda
Learning to Compose Domain-Specific Transformations for Data Augmentationspacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.butterfly
Butterfly matrix multiplication in PyTorchzoology
Understand and test language model architectures on synthetic tasks.hippo-code
babble
A system for generating training labels via natural language explanationsEmptyHeaded
Your worst case is our best case.domino
blocking-tutorial
mindbender
Tools for iterative knowledge base development with DeepDivereef
Automatically labeling training datafonduer-tutorials
A collection of simple tutorials for using Fonduerfm_data_tasks
Foundation Models for Data TasksTreeStructure
Table Extraction ToolCaffeConTroll
eclair-agents
Automating enterprise workflows with multimodal agentsepoxy
Interactive Model Iteration with Weak Supervision and Pre-Trained EmbeddingsHoroPCA
Hyperbolic PCA via Horospherical Projectionsstructured-nets
Structured matrices for compressing neural networkshidden-stratification
Combating hidden stratification with GEORGEnumbskull
Numba-based version of DimmWitted Gibbs samplermodel-patching
Model Patching: Closing the Subgroup Performance Gap with Data Augmentationskill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelscs145-notebooks-2016
Public materials for the Fall 2016 offering of CS145mandoline
(ICML 2021) Mandoline: Model Evaluation under Distribution Shiftmongoose
A Learnable LSH Framework for Efficient NN Trainingthanos-code
Code release for the paper Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learningprefix-linear-attention
tuffy
Tuffy, a Markov Logic Network solverukb-cardiac-mri
Weakly Supervised MRI Series Classification for the UK Biobanksnorkel-superglue
Applying Snorkel to SuperGLUEcorrect-n-contrast
Official code repository for Correct-N-Contrastludwig-benchmarking-toolkit
Ludwig benchmarkddlog
Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿augmentation_code
Reproducible code for Augmentation papersmallfry
tabi
Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrievallp_rffs
Low precision random Fourier features for kernel approximationsampler
DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿random_embedding
snorkel-biocorpus
bazaar
ddbiolib
DeepDive Biomedical Toolsanchor-stability
A study of the downstream instability of word embeddingsOmnivore
Omnivore Optimizer and Distributed CcTembroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationdd-genomics
The Genomics DeepDive projectmedical-ned-integration
Cross-domain data integration for named entity disambiguation in biomedical textdimmwitted
torchhalp
cross-modal-ws-demo
hyperE
treedlib
Accelerated-PCA
Accelerated Stochastic Power Iteration with Momentumliger
Liger: Fusing Weak Supervision and Model Embeddingsivy-tutorial
An Introductory Tutorial for Ivychinstrap
observational
Observational Supervision for Medical Image Classification using Gaze Dataquadrature-features
Code to generate kernel features using Gaussian quadratureicij-maude
Weakly supervised classification of adverse event reports from the FDA's MAUDE database.librarian
DeepDive Librarian for managing all data sets we publish and receivehalp
bert-pretraining
d3m-model-search
D3M Model Search Componentelementary
Data services and APIsdependency_model
Structure learning code from [ICML'19 paper](https://arxiv.org/abs/1903.05844)Love Open Source and this site? Check out how you can help us