There are no reviews yet. Be the first to send feedback to the community and the maintainers!
scikit-lego
Extra blocks for scikit-learn pipelines.human-learn
Natural Intelligence is still a pretty good idea.drawdata
Draw datasets from within Jupyter.doubtlab
Doubt your data, find bad labels.whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!bulk
A Simple Bulk Labelling Toolembetter
just a bunch of useful embeddingscluestar
Gain clues from clustering!calm-notebooks
notebooks that are used at calmcode.ioclumper
A small python library that can clump lists of data together.simsity
Super Simple Similarities Servicememo
Decorators that logs stats.mktestdocs
Run pytest against markdown files/docstrings.spacy-youtube-material
Here are the notebooks used during the spacy youtube series.tuilwindcss
Very much like Tailwind, but for TUI frameworks in Textual.tokenwiser
Bag of, not words, but tricks!skedulord
captures logs and makes cron more funpytest-duration-insights
A mini dashboard to help find slow tests in pytest.arxiv-frontpage
My personal frontpage appscikit-partial
Pipeline components that support partial_fit.spacy-report
Generate reports for spaCy models.brent
bayesian graphical modelling and a bit of do-calculus for discrete data.icepickle
It's a cooler way to store simple linear models.koaning
justcharts
Just charts. Really.scikit-prune
Prune your sklearn modelsthismonth.rocks
motivational website to do something special this monthsentimany
Just another sentiment wrapper.kadro
A friendly pandas wrapper with a more composable grammar support.prodigy-tui
A textual TUI for Prodigycalmcode-feedback
A repo to collect issues with calmcode.ioopen_notebooks
Some notebooks that I've shared.sentence-models
A different, but useful, textcat approach.paftdunk
Recommendin' all night to get lucky.proglang-project
scikit-teach
Active Learning Benchmarkstexttoolz
tools and tricks that are good to have aroundmakefile-demo
just a demo of a makefile in actiongitlit
Streamlit App on Github Actionskolektor
Let's give this git-scraping a try.optimal-on-paper
broken in realityliBERTy
A benchmark to compare BERT against sklearn.classycookie
cookiecutter to run standard text classifierslazylines
Pipelines for JSONL filessalary-bias
just another dangerous situationdql101
A 101 repo with some code for openai Deep Q Learningboondoc
lightweight Python API docs for markdownsubspacy
BPEmb embeddings for spaCyakin
Some text similarity utilitiescalm-stats
Some GitScraperscalmcode-datasets
Just a Collection of Datasetskoaning-old.github.io
my personal blogsushigo
An OpenAi-like environment for the sushi go card game.featherbed
Very lightweight text vectors via tf/idf + SVDonnx-demo
onnx seems interestingbenchmarks
Collection of benchmarksbaseliner
baseliner offers simple models that can act as a baseline to compare againstspacy-intent-example
intent prediction example on spaCy v3scikit-bloom
Bloom tricks for text pipelines in scikit-learn.github-slideshow
A robot powered training repository 🤖wordlists
Just a bunch of potentially useful wordlists.gli
my gleeful scripts for the clilabeltable
Things for bulk labelling.fusebox
Finetune-able Universal Sentence Encodersubsette
A dash-boarding environment for datasette.manyterms
Many terms for whatever purposes (weak labelling)sentency
Lightweight SpaCy pipeline to detect sentences.pydata-slovenia-talk
Bag of NLP Tricks!helloworld
a helloworld package that should just workuvnb
Have UV deal with all your Jupyter deps.blackjack
a simple pytest demodemopkg
a demo pkg in R with github actionslamarl
sushigo simulations on an aws backendwow-avatar-datasets
A place to host some parquet files.python_data_intro
A beginner notebook for people who want to get started with python and data. Joy ensues!buggingface
Let's see what we can learn from poking huggingface models.digital-potato
gha-demo
Demo application for GitHub Actions tutorial.fastfood-bot
a rasa demo that can find you a fast food locationecosystem-watcher
Just keeping an eye on the ecosystem.git-scrape-unravel
CLI to unravel git-scraped code.scikit-prodigy
Helpers to leverage scikit-learn pipelines in Prodigy.skooba
less weak supervisionrasa-nlu-deploy
A demo that can run Rasa NLU in a container.datasette-parcoords
Parallel coordinates chart for datasettenlu-cluster-demo
Upload your model file and talk to it!tjek
tjek changes with the main branchkatacoda-scenarios
Katacoda Scenariosbulk-datasets
Helpers for the download command.there-are-no-bad-labels
Repo for the PyData 2023 Workshoptokenvolt
Populate an embedding cache quickly and get on with your day.rusty
Learning how to Rstuvtrick
I really outdid myself with this hack.ollama-railway
Just to see if this might work out well.Love Open Source and this site? Check out how you can help us