• Stars
    star
    3,470
  • Rank 12,842 (Top 0.3 %)
  • Language
  • Created over 5 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A curated list of awesome data labeling tools

awesome-data-labeling

A curated list of awesome data labeling tools

Images

  • labelImg - LabelImg is a graphical image annotation tool and label object bounding boxes in images
  • CVAT - Powerful and efficient Computer Vision Annotion Tool
  • labelme - Image Polygonal Annotation with Python
  • VoTT - An open source annotation and labeling tool for image and video assets
  • imglab - A web based tool to label images for objects that can be used to train dlib or other object detectors
  • Yolo_mark - GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2
  • PixelAnnotationTool - Software that allows you to manually and quickly annotate images in directories
  • OpenLabeling - Label images and video for Computer Vision applications
  • imagetagger - An open source online platform for collaborative image labeling
  • Alturos.ImageAnnotation - A collaborative tool for labeling image data
  • deeplabel - A cross-platform image annotation tool for machine learning
  • MedTagger - A collaborative framework for annotating medical datasets using crowdsourcing.
  • Labelbox - Labelbox is the fastest way to annotate data to build and ship computer vision applications
  • turktool - A modern React app for scalable bounding box annotation of images
  • Pixie - Pixie is a GUI annotation tool which provides the bounding box, polygon, free drawing and semantic segmentation object labelling
  • OpenLabeler - OpenLabeler is an open source desktop application for annotating objects for AI appplications
  • Anno-Mage - A Semi Automatic Image Annotation Tool which helps you in annotating images by suggesting you annotations for 80 object classes using a pre-trained model
  • CATMAID - Collaborative Annotation Toolkit for Massive Amounts of Image Data
  • make-sense - makesense.ai is a free to use online tool for labelling photos
  • LOST - Design your own smart Image Annotation process in a web-based environment
  • Annotorious - A JavaScript library for image annotation.
  • Sloth - Tool for labeling image and video data for computer vision research.

Text

  • YEDDA - A Lightweight Collaborative Text Span Annotation Tool (Chunking, NER, etc.). ACL best demo nomination.
  • ML-Annotate - Label text data for machine learning purposes. ML-Annotate supports binary, multi-label and multi-class labeling.
  • TagEditor - Annotation tool for spaCy
  • SMART - Smarter Manual Annotation for Resource-constrained collection of Training data
  • PIAF - A Question-Answering annotation tool

Audio

  • EchoML - Play, visualize, and annotate your audio files
  • audio-annotator - A JavaScript interface for annotating and labeling audio files.
  • audio-labeler - An in-browser app for labeling audio clips at random, using Docker and Flask.
  • wavesurfer.js - Simple annotations tool, check the example.
  • peak.js - Browser-based audio waveform visualisation and UI component for interacting with audio waveforms, developed by BBC UK.
  • Praat - Doing Phonetics By Computer
  • Aubio - Tool designed for the extraction of annotations from audio signals.

Video

  • UltimateLabeling - A multi-purpose Video Labeling GUI in Python with integrated SOTA detector and tracker
  • VATIC - VATIC is an online video annotation tool for computer vision research that crowdsources work to Amazon's Mechanical Turk.

Time Series

  • Curve - Curve is an open-source tool to help label anomalies on time-series data
  • TagAnomaly - Anomaly detection analysis and labeling tool, specifically for multiple time series (one time series per category)
  • time-series-annotator - The CrowdCurio Time Series Annotation Library implements classification tasks for time series.
  • WDK - The Wearables Development Toolkit (WDK) is a set of tools to facilitate the development of activity recognition applications with wearable devices.

3D

  • webKnossos - webKnossos is an open-source web-based tool for visualizing, annotating, and sharing large 3D image datasets. It features fast 3D data browsing, skeleton (line-segment) annotations, segmentation and proof-reading tools, mesh visualization, and collaboration features. The public instance webknossos.org hosts a collection of published datasets and can be used without a local setup.
  • KNOSSOS - KNOSSOS is a software tool for the visualization and annotation of 3D image data and was developed for the rapid reconstruction of neural morphology and connectivity.

Lidar

MultiDomain

  • Label Studio - Label Studio is a configurable data annotation tool that works with different data types
  • Dataturks - Dataturks support E2E tagging of data items like video, images (classification, segmentation and labelling) and text (full length document annotations for PDF, Doc, Text etc) for ML projects.

More Repositories

1

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
Python
20,885
star
2

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format
JavaScript
16,524
star
3

label-studio-frontend

Data labeling react app that is backend agnostic and can be embedded into your applications β€” distributed as an NPM package
JavaScript
318
star
4

label-studio-ml-backend

Configs and boilerplates for Label Studio's Machine Learning backend
Python
263
star
5

label-studio-converter

Tools for converting Label Studio annotations into common dataset formats
Python
253
star
6

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a prediction service
Python
176
star
7

RLHF

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Jupyter Notebook
62
star
8

label-studio-sdk

Label Studio SDK
Python
51
star
9

dm2

Full-fledged Data Exploration Tool for Label Studio
JavaScript
35
star
10

pyheartex

Heartex Python SDK - Connect your own models to Heartex Data Labeling
Python
28
star
11

brand-sentiment-analysis

Scripts utilizing Heartex platform to build brand sentiment analysis from the news
CSS
22
star
12

label-studio-evalme

Evaluation metrics package
Python
7
star
13

label-studio-terraform

HCL
5
star
14

label-studio-examples

Example Code to Supplement the Label Studio Blog
Python
5
star
15

label-studio-tools

Python
4
star
16

text-classifier

Tensorflow-based text classifier that could be integrated with Heartex/Label Studio
Python
4
star
17

awesome-human-in-the-loop

Awesome List of Human in the Loop resources and references for retraining models.
4
star
18

smartfew

SmartFew is your swiss knife for semi-supervised structuring of unlabeled data using Few Shot Learning.
Python
4
star
19

charts

3
star
20

heartexlabs.github.io

Label Studio website with the documentation
HTML
2
star
21

awesome-active-learning

A curated list of awesome active learning related topics
2
star
22

label-studio-addon-dicom

DICOM format annotation and labeling support for Label Studio
2
star
23

articles

Materials we publish on Medium and other resources about labeling, machine learning, active learning, etc
1
star