There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.audio-dataset
Audio Dataset for training CLAP and other modelsCLIP_benchmark
CLIP-like model evaluationdalle2-laion
Pretrained Dalle2 from laionCLAP
Contrastive Language-Audio Pretrainingnatural_voice_assistant
laion-3d
Collect large 3d dataset and build modelsphenaki
A phenaki reproduction using pytorch.aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of picturesOpen-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasksldm-finetune
Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)CLIP-based-NSFW-Detector
laion-datasets
Description and pointers of laion datasetslaion-dreams
Aim for the moon. If you miss, you may hit a star.laion.ai
AIW
Alice in Wonderland code base for experiments and raw experiments dataLAION-5B-WatermarkDetection
video-clip
Let's make a video clipOpen-GIA
O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, applications & safety tools for Open Generalist Interactive Agents (O-GIA). O-GIA systems will act in collaboration with human or autonomously, supporting various kind of validated decision making and assistance.General-GPT
Discord-Scrapers
Implementation of a discord channel scraper to generate datasets.Text-to-speech
Big-Interleaved-Dataset
Big-Interleaved-Datasetriverbed
Tools for content datamining and NLP at scaleOCR-ensemble
Conditional-Pretraining-of-Large-Language-Models
interesting-text-datasets
blade2blade
Adversarial Training and SFT for Bot Safety Modelstemporal-embedding-aggregation
Aggregating embeddings over timedeep-image-diffusion-prior
Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.watermark-detection
A repository containing datasets and tools to train a watermark classifier.medical
This repository will be a summary and outlook on all our open, medical, AI advancements.Anh
Anh - LAION's multilingual assistant datasets and modelslaion50BU
Un-*** 50 billions multimodality datasetconditioned-prior
(wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.LAION-SAFETY
An open toolbox for NSFW & toxicity detectionopendream
Frontend (and soon also midleware and backend) for a new, opensource image generation platform.laion5B-paper
Building the laion5B paperlaion-dedup
notebooks
A collection of generative and training notebooks getting mirrored to google colab.laionide
This repository contains training code and checkpoitns for finetuning glide.super-resolution
This is the LAION repository for creating open super-resolution models with the help of LAION-5B subsets.dataset-spec
Describe the format of image/text datasetsLAION-PEOPLE
This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally it provides clusters based on the poses and face meshes and pose-related captions based on these cluster assignments.image-deduplication-testset
project-menu
Projects at LAIONlaion-ai.github.io
laion github websitedataset-usage
This repository is a summary of all systems and scientific papers that use LAION datasets.repository-overview
This repository will give a quick overview of all projects and repositories from LAION.LionizeR
Experiments with Summarization, Long Context and RetrievalKAISER
Knowledge Acquisition and Interlinking via Semantic Embeddings and Reasoningdecentralized-learning
A basic setup for decentralized-learning that can be used for training future DALLE/CLIP/CLAP models.diffusion-prior
DALL-E2 diffusion priorGIF
General / Global Inference Frameworkwebsite
This is the development repository of the LAION-AI website.safety-pipeline
A collection of safety classifiers and models to process image and texts.NeoGen
laion5b-subsets
Creating subsets from laion5b via embeddings searchhuman_artifacts
A repo containing images for artifact annotation.public-relations
All media / publicity on LAION and related stuff!public-domain-images
A collection of public domain images donated for ML training.math_problems-step-by-step_solutions
Here we provide and collect many functions to generate math problem and step by step solutions for LLM traininglanguage-models
dataset-inference
The new repository for the genral inference pipeline.introduction-resources
Recommended intro resourcesbalanced-laion5b
This repository shall help finding a good distribution for huge datasets like LAION-5B for more efficient training.hand-inference
A model to run hand inference on a cluster.BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creation and integration of diverse skills for educational and research applications.laion5b-bias
This repository is a collection of found biases in the LAION-5B dataset.dataset-tasks
datasets that should be downloaded & converted to our standard training formart.LAION-AUDIO
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-ModelsAIW_webpage
Alice in Wonderland project and initiative webpageLove Open Source and this site? Check out how you can help us