• Stars
    star
    54
  • Rank 544,902 (Top 11 %)
  • Language
    Python
  • License
    MIT License
  • Created over 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

☄️ Parallel and distributed training with spaCy and Ray

More Repositories

1

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python
Python
29,546
star
2

thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
Python
2,813
star
3

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course
Python
2,299
star
4

sense2vec

🦆 Contextually-keyed word vectors
Python
1,615
star
5

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library
Python
1,589
star
6

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Python
1,334
star
7

projects

🪐 End-to-end NLP workflows from prototype to production
Python
1,285
star
8

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines
Python
1,049
star
9

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components
Python
858
star
10

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps
Python
787
star
11

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Python
722
star
12

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Jupyter Notebook
477
star
13

wasabi

🍣 A lightweight console printing and formatting toolkit
Python
444
star
14

cymem

💥 Cython memory pool for RAII-style memory management
Cython
436
star
15

srsly

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
Python
422
star
16

displacy

💥 displaCy.js: An open-source NLP visualiser for the modern web
JavaScript
343
star
17

lightnet

🌓 Bringing pjreddie's DarkNet out of the shadows #yolo
C
319
star
18

prodigy-openai-recipes

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
Python
318
star
19

spacy-notebooks

💫 Jupyter notebooks for spaCy examples and tutorials
Jupyter Notebook
285
star
20

spacy-services

💫 REST microservices for various spaCy-related tasks
Python
240
star
21

cython-blis

💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
C
215
star
22

displacy-ent

💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
CSS
197
star
23

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy
TypeScript
188
star
24

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attributes
Python
180
star
25

tokenizations

Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
Rust
180
star
26

wheelwright

🎡 Automated build repo for Python wheels and source packages
Python
174
star
27

catalogue

Super lightweight function registries for your library
Python
171
star
28

confection

🍬 Confection: the sweetest config system for Python
Python
169
star
29

spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy
Python
125
star
30

radicli

🕊️ Radically lightweight command-line interfaces
Python
100
star
31

spacy-lookups-data

📂 Additional lookup tables and data resources for spaCy
Python
98
star
32

spacy-experimental

🧪 Cutting-edge experimental spaCy components and features
Python
94
star
33

talks

💥 Browser-based slides or PDFs of our talks and presentations
JavaScript
94
star
34

thinc-apple-ops

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library
Cython
90
star
35

healthsea

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
Python
87
star
36

preshed

💥 Cython hash tables that assume keys are pre-hashed
Cython
82
star
37

weasel

🦦 weasel: A small and easy workflow system
Python
62
star
38

spacy-huggingface-pipelines

💥 Use Hugging Face text and token classification pipelines directly in spaCy
Python
61
star
39

ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts
Python
45
star
40

murmurhash

💥 Cython bindings for MurmurHash2
C++
44
star
41

assets

💥 Explosion Assets
43
star
42

spacy-huggingface-hub

🤗 Push your spaCy pipelines to the Hugging Face Hub
Python
42
star
43

wikid

Generate a SQLite database from Wikipedia & Wikidata dumps.
Python
30
star
44

vscode-prodigy

🧬 A VS Code extension for annotating data with Prodigy
TypeScript
30
star
45

spacy-alignments

💫 A spaCy package for Yohei Tamura's Rust tokenizations library
Python
26
star
46

spacy-vscode

spaCy extension for Visual Studio Code
Python
24
star
47

spacy-curated-transformers

spaCy entry points for Curated Transformers
Python
22
star
48

spacy-benchmarks

💫 Runtime performance comparison of spaCy against other NLP libraries
Python
20
star
49

prodigy-hf

Train huggingface models on top of Prodigy annotations
Python
19
star
50

prodigy-pdf

A Prodigy plugin for PDF annotation
Python
18
star
51

spacy-vectors-builder

🌸 Train floret vectors
Python
17
star
52

os-signpost

Wrapper for the macOS signpost API
Cython
12
star
53

spacy-loggers

📟 Logging utilities for spaCy
Python
12
star
54

prodigy-evaluate

🔎 A Prodigy plugin for evaluating spaCy pipelines
Python
12
star
55

prodigy-segment

Select pixels in Prodigy via Facebook's Segment-Anything model.
Python
11
star
56

curated-tokenizers

Lightweight piece tokenization library
Cython
11
star
57

conll-2012

A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.
Python
10
star
58

thinc_gpu_ops

🔮 GPU kernels for Thinc
C++
9
star
59

prodigy-ann

A Prodigy pluging for ANN techniques
Python
4
star
60

prodigy-whisper

Audio transcription with OpenAI's whisper model in the loop.
Python
4
star
61

princetondh

Code for our presentation in Princeton DH 2023 April.
Jupyter Notebook
4
star
62

spacy-legacy

🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Python
4
star
63

ec2buildwheel

Python
2
star
64

aiGrunn-2023

Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines
Python
1
star
65

spacy-io-binder

📒 Repository used to build Binder images for the interactive spaCy code examples
Jupyter Notebook
1
star
66

prodigy-lunr

A Prodigy plugin for document search via LUNR
Python
1
star
67

.github

:octocat: GitHub settings
1
star
68

span-labeling-datasets

Loaders for various span labeling datasets
Python
1
star
69

spacy-biaffine-parser

Python
1
star