• Stars
    star
    747
  • Rank 60,741 (Top 2 %)
  • Language
    Python
  • Created about 2 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Data annotation toolbox supports image, audio and video data.

简体中文 | English

Introduction

LabelU is an open source data annotation tool that supports Chinese. At present, it has image annotation capabilities such as rectangle, polygon, point, line, classification, and caption. It can support detection, classification, segmentation, text transcription, Line detection, key point detection and other computer vision task scenarios. You can customize the annotation task by freely combining tools, and support COCO and MASK format data export.

Getting started

Install locally with miniconda

# Download and Install miniconda
# https://docs.conda.io/en/latest/miniconda.html

# Create virtual environment(python = 3.7)
conda create -n labelu python=3.7

# Activate virtual environment
conda activate labelu

# Install labelu
pip install labelu

# Start labelu, server: http://localhost:8000
labelu

Install for local development

# Download and Install miniconda
# https://docs.conda.io/en/latest/miniconda.html

# Create virtual environment(python = 3.7)
conda create -n labelu python=3.7

# Activate virtual environment
conda activate labelu

# Install peotry
# https://python-poetry.org/docs/#installing-with-the-official-installer

# Install all package dependencies
poetry install

# Start labelu, server: http://localhost:8000
uvicorn labelu.main:app --reload

# Update submodule
git submodule update --remote --merge

feature

  • Uniform, Six image annotation tools are provided, which can be configured through simple visualization or Yaml
  • Unlimited, Multiple tools can be freely combined to meet most image annotation requirements

  • Universal, Support multiple data export formats, including LabelU, COCO, Mask

Scenes

Computer Vision

  • Detection: Detection scenes for vehicles, license plates, pedestrians, faces, industrial parts, etc.
  • Classification: Detection of object classification, target characteristics, right and wrong judgments, and other classification scenarios
  • Semantic segmentation: Human body segmentation, panoramic segmentation, drivable area segmentation, vehicle segmentation, etc.
  • Text transcription: Text detection and recognition of license plates, invoices, insurance policies, signs, etc.
  • Contour detection: positioning line scenes such as human contour lines, lane lines, etc.
  • Key point detection: positioning scenes such as human face key points, vehicle key points, road edge key points, etc.

Usage

Annotation Format

Communication

Welcome to the Opendatalab Wechat group!

Links

LICENSE

This project is released under the Apache 2.0 license.

More Repositories

1

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Python
13,269
star
2

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction
Python
5,020
star
3

WanJuan1.0

万卷1.0多模态语料
525
star
4

LabelLLM

The Open-Source Data Annotation Platform
TypeScript
384
star
5

magic-doc

Python
342
star
6

magic-html

Python
241
star
7

UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Python
175
star
8

VIGC

AAAI 2024: Visual Instruction Generation and Correction
Python
87
star
9

opendatalab-datasets

datasets resource
65
star
10

CLIP-Parrot-Bias

ECCV2024_Parrot Captions Teach CLIP to Spot Text
Python
60
star
11

opendatalab-python-sdk

SDK of OpenDataLab - https://opendatalab.org.cn
Python
56
star
12

labelU-Kit

Data annotation component library --provided as NPM packages
TypeScript
53
star
13

H2RSVLM

H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
43
star
14

dsdl-docs

Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
HTML
43
star
15

MLLM-DataEngine

MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Python
34
star
16

image-downloader

Python
24
star
17

dsdl-sdk

Jupyter Notebook
13
star
18

labelU-frontend

LabelU front-end library
TypeScript
7
star
19

allz

A universal command line tool for compression and decompression
Python
4
star
20

laion5b-downloader

Python
3
star
21

HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Python
2
star
22

MLS-BRN

[CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
1
star
23

Miner-PDF-Benchmark

MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
Python
1
star
24

labelU-ML

Python
1
star
25

s3_browser

基于Streamlit开发,可在线查看S3存储内容的工具。
Python
1
star