There are no reviews yet. Be the first to send feedback to the community and the maintainers!
flash-attention
Fast and memory-efficient exact attentiondeepdive
DeepDivestate-spaces
Sequence Modeling with Structured State Spacesdata-centric-ai
Resources for Data Centric AIsafari
Convolutions for Sequence Modelingmeerkat
Creative interactive views of any dataset.hgcn
Hyperbolic Graph Convolutional Networks in PyTorch.ama_prompting
Ask Me Anything language model promptingm2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"H3
Language Modeling with the H3 State Space Modelhyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyenaevaporate
This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"manifest
Prompt programming with FMs.metal
Snorkel MeTaL: A framework for training models with multi-task weak supervisionpdftotree
🌲 A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.fonduer
A knowledge base construction engine for richly formatted datahyperbolics
Hyperbolic Embeddingsflyingsquid
More interactive weak supervision with FlyingSquidlegalbench
An open science effort to benchmark legal reasoning in foundation modelsKGEmb
Hyperbolic Knowledge Graph embeddings.aisys-building-blocks
Building blocks for foundation models.flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Coresbootleg
Self-Supervision for Named Entity Disambiguation at the TailHypHC
Hyperbolic Hierarchical Clustering.TART
TART: A plug-and-play Transformer module for task-agnostic reasoningfly
based
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"tanda
Learning to Compose Domain-Specific Transformations for Data Augmentationspacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.butterfly
Butterfly matrix multiplication in PyTorchbabble
A system for generating training labels via natural language explanationszoology
Understand and test language model architectures on synthetic tasks.hippo-code
EmptyHeaded
Your worst case is our best case.domino
blocking-tutorial
mindbender
Tools for iterative knowledge base development with DeepDivereef
Automatically labeling training datafonduer-tutorials
A collection of simple tutorials for using Fonduerfm_data_tasks
Foundation Models for Data TasksTreeStructure
Table Extraction ToolCaffeConTroll
HoroPCA
Hyperbolic PCA via Horospherical Projectionsstructured-nets
Structured matrices for compressing neural networkshidden-stratification
Combating hidden stratification with GEORGEnumbskull
Numba-based version of DimmWitted Gibbs samplermodel-patching
Model Patching: Closing the Subgroup Performance Gap with Data Augmentationcs145-notebooks-2016
Public materials for the Fall 2016 offering of CS145skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsmandoline
(ICML 2021) Mandoline: Model Evaluation under Distribution Shiftmongoose
A Learnable LSH Framework for Efficient NN Trainingthanos-code
Code release for the paper Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learningtuffy
Tuffy, a Markov Logic Network solversnorkel-superglue
Applying Snorkel to SuperGLUEukb-cardiac-mri
Weakly Supervised MRI Series Classification for the UK Biobankcorrect-n-contrast
Official code repository for Correct-N-Contrastludwig-benchmarking-toolkit
Ludwig benchmarkddlog
Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿augmentation_code
Reproducible code for Augmentation papersmallfry
tabi
Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrievallp_rffs
Low precision random Fourier features for kernel approximationsampler
DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿random_embedding
snorkel-biocorpus
bazaar
ddbiolib
DeepDive Biomedical Toolsanchor-stability
A study of the downstream instability of word embeddingsOmnivore
Omnivore Optimizer and Distributed CcTembroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationdd-genomics
The Genomics DeepDive projectdimmwitted
medical-ned-integration
Cross-domain data integration for named entity disambiguation in biomedical texttorchhalp
cross-modal-ws-demo
liger
Liger: Fusing Weak Supervision and Model Embeddingstreedlib
Accelerated-PCA
Accelerated Stochastic Power Iteration with MomentumhyperE
chinstrap
ivy-tutorial
An Introductory Tutorial for Ivyquadrature-features
Code to generate kernel features using Gaussian quadratureicij-maude
Weakly supervised classification of adverse event reports from the FDA's MAUDE database.observational
Observational Supervision for Medical Image Classification using Gaze Datalibrarian
DeepDive Librarian for managing all data sets we publish and receivehalp
bert-pretraining
d3m-model-search
D3M Model Search Componentelementary
Data services and APIsdependency_model
Structure learning code from [ICML'19 paper](https://arxiv.org/abs/1903.05844)Love Open Source and this site? Check out how you can help us