Label Studio

Data annotation and labeling platform for building high-quality ML training datasets.

Overview

Label Studio is a multi-type data annotation platform for creating labeled datasets that train and evaluate machine learning models. Annotate text, images, audio, video, and time series — all in one tool, with ML-assisted pre-labeling to speed up the process. Projects, tasks, and labels are all managed through a clean web interface, and export to standard ML dataset formats.

Key Features

Multi-Type Annotation — Text, images, audio, video, HTML, and time series in one platform
Labeling Templates — Pre-built templates for classification, NER, object detection, sentiment, and more
ML-Assisted Pre-labeling — Connect a model backend to auto-label tasks for human review
Project Management — Organize annotation work into projects with task queues
Team Collaboration — Multiple annotators work on the same project with agreement tracking
Export Formats — Export to JSON, CSV, COCO, YOLO, Pascal VOC, and more

Getting Started

From the Hub, click Label Studio to launch
Click Create Project and give it a name
Upload your data (images, text files, audio, etc.) or connect a cloud storage source
Choose a labeling template or configure your own label schema
Start annotating — click through tasks and apply labels
Export your completed annotations when ready

Labeling Task Types

Text

Named entity recognition (NER) — highlight and tag spans of text
Text classification — assign categories to documents or sentences
Relation extraction — draw links between entities

Images

Bounding box detection — draw rectangles around objects
Polygon segmentation — precise object outlines
Keypoint annotation — mark specific points (e.g., body pose)
Image classification — label the whole image

Audio

Audio classification — label audio clips
Transcription — type what you hear
Speaker diarization — segment by speaker

Video

Video classification
Timeline segmentation

Labeling Templates

Label Studio includes templates for common ML tasks. When creating a project, choose from:

Object Detection (YOLO/COCO format)
Image Segmentation
Text Classification (single/multi-label)
Named Entity Recognition
Sentiment Analysis
Audio Transcription
Time Series Annotation

Or write your own labeling config in XML to define any annotation interface.

ML-Assisted Pre-labeling

Connect an ML backend to automatically pre-label tasks:

Go to Settings → Machine Learning in your project
Add your model backend URL
Enable Auto-Annotation
Label Studio calls your model for each new task and populates predictions
Annotators review and correct predictions instead of labeling from scratch

Exporting Annotations

Export from the project Export tab:

Format	Use Case
JSON	General purpose, full annotation data
CSV	Tabular data, classification tasks
COCO	Object detection (bounding boxes, segmentation)
YOLO	Object detection (YOLO format)
Pascal VOC	Object detection (XML format)

When to Use Label Studio

Task	Tool
Creating training datasets for ML models	Label Studio
Annotating images for object detection	Label Studio
NER annotation for NLP models	Label Studio
Running ML experiments with labeled data	AI Notebook Lab + MLflow
Tracking model performance across versions	MLflow