• Stars
    star
    176
  • Rank 216,987 (Top 5 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created almost 5 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Label data using HuggingFace's transformers and automatically get a prediction service

Label Studio for Hugging Face's Transformers

Website • Docs • Twitter • Join Slack Community


Transfer learning for NLP models by annotating your textual data without any additional coding.

This package provides a ready-to-use container that links together:


Quick Usage

Install Label Studio and other dependencies

pip install -r requirements.txt
Create ML backend with BERT classifier
label-studio-ml init my-ml-backend --script models/bert_classifier.py
cp models/utils.py my-ml-backend/utils.py

# Start ML backend at http://localhost:9090
label-studio-ml start my-ml-backend

# Start Label Studio in the new terminal with the same python environment
label-studio start
  1. Create a project with Choices and Text tags in the labeling config.
  2. Connect the ML backend in the Project settings with http://localhost:9090
Create ML backend with BERT named entity recognizer
label-studio-ml init my-ml-backend --script models/ner.py
cp models/utils.py my-ml-backend/utils.py

# Start ML backend at http://localhost:9090
label-studio-ml start my-ml-backend

# Start Label Studio in the new terminal with the same python environment
label-studio start
  1. Create a project with Labels and Text tags in the labeling config.
  2. Connect the ML backend in the Project settings with http://localhost:9090

Training and inference

The browser opens at http://localhost:8080. Upload your data on Import page then annotate by selecting Labeling page. Once you've annotate sufficient amount of data, go to Model page and press Start Training button. Once training is finished, model automatically starts serving for inference from Label Studio, and you'll find all model checkpoints inside my-ml-backend/<ml-backend-id>/ directory.

Click here to read more about how to use Machine Learning backend and build Human-in-the-Loop pipelines with Label Studio

License

This software is licensed under the Apache 2.0 LICENSE © Heartex. 2020

More Repositories

1

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
Python
20,885
star
2

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format
JavaScript
16,524
star
3

awesome-data-labeling

A curated list of awesome data labeling tools
3,470
star
4

label-studio-frontend

Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package
JavaScript
318
star
5

label-studio-ml-backend

Configs and boilerplates for Label Studio's Machine Learning backend
Python
263
star
6

label-studio-converter

Tools for converting Label Studio annotations into common dataset formats
Python
253
star
7

RLHF

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Jupyter Notebook
62
star
8

label-studio-sdk

Label Studio SDK
Python
51
star
9

dm2

Full-fledged Data Exploration Tool for Label Studio
JavaScript
35
star
10

pyheartex

Heartex Python SDK - Connect your own models to Heartex Data Labeling
Python
28
star
11

brand-sentiment-analysis

Scripts utilizing Heartex platform to build brand sentiment analysis from the news
CSS
22
star
12

label-studio-evalme

Evaluation metrics package
Python
7
star
13

label-studio-terraform

HCL
5
star
14

label-studio-examples

Example Code to Supplement the Label Studio Blog
Python
5
star
15

label-studio-tools

Python
4
star
16

text-classifier

Tensorflow-based text classifier that could be integrated with Heartex/Label Studio
Python
4
star
17

awesome-human-in-the-loop

Awesome List of Human in the Loop resources and references for retraining models.
4
star
18

smartfew

SmartFew is your swiss knife for semi-supervised structuring of unlabeled data using Few Shot Learning.
Python
4
star
19

charts

3
star
20

heartexlabs.github.io

Label Studio website with the documentation
HTML
2
star
21

awesome-active-learning

A curated list of awesome active learning related topics
2
star
22

label-studio-addon-dicom

DICOM format annotation and labeling support for Label Studio
2
star
23

articles

Materials we publish on Medium and other resources about labeling, machine learning, active learning, etc
1
star