• Stars
    star
    53
  • Rank 534,782 (Top 11 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 4 years ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

☄️ Parallel and distributed training with spaCy and Ray

More Repositories

1

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python
Python
28,700
star
2

thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
Python
2,777
star
3

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course
Python
2,268
star
4

sense2vec

🦆 Contextually-keyed word vectors
Python
1,595
star
5

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library
Python
1,516
star
6

spacy-transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Python
1,320
star
7

projects

🪐 End-to-end NLP workflows from prototype to production
Python
1,249
star
8

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines
Python
950
star
9

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components
Python
837
star
10

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps
Python
765
star
11

spacy-stanza

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Python
715
star
12

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Jupyter Notebook
464
star
13

wasabi

🍣 A lightweight console printing and formatting toolkit
Python
438
star
14

cymem

💥 Cython memory pool for RAII-style memory management
Cython
434
star
15

srsly

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
Python
414
star
16

displacy

💥 displaCy.js: An open-source NLP visualiser for the modern web
JavaScript
344
star
17

lightnet

🌓 Bringing pjreddie's DarkNet out of the shadows #yolo
C
319
star
18

prodigy-openai-recipes

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
Python
315
star
19

spacy-notebooks

💫 Jupyter notebooks for spaCy examples and tutorials
Jupyter Notebook
284
star
20

spacy-services

💫 REST microservices for various spaCy-related tasks
Python
239
star
21

cython-blis

💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
C
209
star
22

displacy-ent

💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
CSS
196
star
23

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy
TypeScript
187
star
24

tokenizations

Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
Rust
179
star
25

spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attributes
Python
177
star
26

wheelwright

🎡 Automated build repo for Python wheels and source packages
Python
173
star
27

catalogue

Super lightweight function registries for your library
Python
170
star
28

confection

🍬 Confection: the sweetest config system for Python
Python
165
star
29

spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy
Python
125
star
30

radicli

🕊️ Radically lightweight command-line interfaces
Python
96
star
31

spacy-experimental

🧪 Cutting-edge experimental spaCy components and features
Python
93
star
32

spacy-lookups-data

📂 Additional lookup tables and data resources for spaCy
Python
93
star
33

talks

💥 Browser-based slides or PDFs of our talks and presentations
JavaScript
90
star
34

thinc-apple-ops

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library
Cython
89
star
35

healthsea

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
Python
84
star
36

preshed

💥 Cython hash tables that assume keys are pre-hashed
Cython
78
star
37

spacy-huggingface-pipelines

💥 Use Hugging Face text and token classification pipelines directly in spaCy
Python
57
star
38

ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts
Python
45
star
39

assets

💥 Explosion Assets
43
star
40

murmurhash

💥 Cython bindings for MurmurHash2
C++
42
star
41

weasel

🦦 weasel: A small and easy workflow system
Python
41
star
42

spacy-huggingface-hub

🤗 Push your spaCy pipelines to the Hugging Face Hub
Python
39
star
43

vscode-prodigy

🧬 A VS Code extension for annotating data with Prodigy
TypeScript
29
star
44

wikid

Generate a SQLite database from Wikipedia & Wikidata dumps.
Python
26
star
45

spacy-alignments

💫 A spaCy package for Yohei Tamura's Rust tokenizations library
Python
26
star
46

spacy-vscode

spaCy extension for Visual Studio Code
Python
23
star
47

spacy-benchmarks

💫 Runtime performance comparison of spaCy against other NLP libraries
Python
20
star
48

spacy-curated-transformers

spaCy entry points for Curated Transformers
Python
19
star
49

prodigy-hf

Train huggingface models on top of Prodigy annotations
Python
17
star
50

spacy-vectors-builder

🌸 Train floret vectors
Python
15
star
51

os-signpost

Wrapper for the macOS signpost API
Cython
11
star
52

prodigy-pdf

A Prodigy plugin for PDF annotation
Python
11
star
53

spacy-loggers

📟 Logging utilities for spaCy
Python
11
star
54

prodigy-evaluate

🔎 A Prodigy plugin for evaluating spaCy pipelines
Python
11
star
55

prodigy-segment

Select pixels in Prodigy via Facebook's Segment-Anything model.
Python
10
star
56

curated-tokenizers

Lightweight piece tokenization library
Cython
10
star
57

conll-2012

A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.
Python
10
star
58

thinc_gpu_ops

🔮 GPU kernels for Thinc
C++
9
star
59

princetondh

Code for our presentation in Princeton DH 2023 April.
Jupyter Notebook
4
star
60

spacy-legacy

🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Python
4
star
61

prodigy-ann

A Prodigy pluging for ANN techniques
Python
3
star
62

prodigy-whisper

Audio transcription with OpenAI's whisper model in the loop.
Python
3
star
63

ec2buildwheel

Python
2
star
64

aiGrunn-2023

Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines
Python
1
star
65

spacy-io-binder

📒 Repository used to build Binder images for the interactive spaCy code examples
Jupyter Notebook
1
star
66

prodigy-lunr

A Prodigy plugin for document search via LUNR
Python
1
star
67

.github

:octocat: GitHub settings
1
star
68

span-labeling-datasets

Loaders for various span labeling datasets
Python
1
star