Label Studio
Label Studio
Data annotation and labeling platform for building high-quality ML training datasets.
Overview
Label Studio is a multi-type data annotation platform for creating labeled datasets that train and evaluate machine learning models. Annotate text, images, audio, video, and time series — all in one tool, with ML-assisted pre-labeling to speed up the process. Projects, tasks, and labels are all managed through a clean web interface, and export to standard ML dataset formats.
Key Features
- Multi-Type Annotation — Text, images, audio, video, HTML, and time series in one platform
- Labeling Templates — Pre-built templates for classification, NER, object detection, sentiment, and more
- ML-Assisted Pre-labeling — Connect a model backend to auto-label tasks for human review
- Project Management — Organize annotation work into projects with task queues
- Team Collaboration — Multiple annotators work on the same project with agreement tracking
- Export Formats — Export to JSON, CSV, COCO, YOLO, Pascal VOC, and more
Getting Started
- From the Hub, click Label Studio to launch
- Click Create Project and give it a name
- Upload your data (images, text files, audio, etc.) or connect a cloud storage source
- Choose a labeling template or configure your own label schema
- Start annotating — click through tasks and apply labels
- Export your completed annotations when ready
Labeling Task Types
Text
- Named entity recognition (NER) — highlight and tag spans of text
- Text classification — assign categories to documents or sentences
- Relation extraction — draw links between entities
Images
- Bounding box detection — draw rectangles around objects
- Polygon segmentation — precise object outlines
- Keypoint annotation — mark specific points (e.g., body pose)
- Image classification — label the whole image
Audio
- Audio classification — label audio clips
- Transcription — type what you hear
- Speaker diarization — segment by speaker
Video
- Video classification
- Timeline segmentation
Labeling Templates
Label Studio includes templates for common ML tasks. When creating a project, choose from:
- Object Detection (YOLO/COCO format)
- Image Segmentation
- Text Classification (single/multi-label)
- Named Entity Recognition
- Sentiment Analysis
- Audio Transcription
- Time Series Annotation
Or write your own labeling config in XML to define any annotation interface.
ML-Assisted Pre-labeling
Connect an ML backend to automatically pre-label tasks:
- Go to Settings → Machine Learning in your project
- Add your model backend URL
- Enable Auto-Annotation
- Label Studio calls your model for each new task and populates predictions
- Annotators review and correct predictions instead of labeling from scratch
Exporting Annotations
Export from the project Export tab:
| Format | Use Case |
|---|---|
| JSON | General purpose, full annotation data |
| CSV | Tabular data, classification tasks |
| COCO | Object detection (bounding boxes, segmentation) |
| YOLO | Object detection (YOLO format) |
| Pascal VOC | Object detection (XML format) |
When to Use Label Studio
| Task | Tool |
|---|---|
| Creating training datasets for ML models | Label Studio |
| Annotating images for object detection | Label Studio |
| NER annotation for NLP models | Label Studio |
| Running ML experiments with labeled data | AI Notebook Lab + MLflow |
| Tracking model performance across versions | MLflow |