Sourcify
Effortlessly find and manage open-source dependencies for your projects.

The industry-standard open-source platform for professional data labeling and computer vision management.

Computer Vision Annotation Tool (CVAT) is a high-performance, web-based platform designed for the complex requirements of professional data annotation for computer vision models. Originally developed by Intel and now managed by CVAT.ai, the platform has evolved into a comprehensive data management suite in 2026, offering seamless support for 2D images, video interpolation, and 3D point cloud (Lidar) data. Its architecture is built around a robust Django backend and a React frontend, optimized for high-throughput labeling tasks. CVAT distinguishes itself through its tight integration with automated annotation tools like Segment Anything (SAM) and YOLO models via Nuclio, allowing teams to leverage AI-assisted pre-labeling. This reduces manual effort by up to 80% in high-density scenarios. In the 2026 market, CVAT maintains a dominant position as the bridge between open-source flexibility and enterprise-grade SaaS reliability, supporting diverse deployment models from local Docker containers to fully managed cloud environments. It remains a critical piece of the MLOps pipeline for industries ranging from autonomous driving to precision agriculture, providing granular quality control, role-based access, and deep versioning capabilities.
Computer Vision Annotation Tool (CVAT) is a high-performance, web-based platform designed for the complex requirements of professional data annotation for computer vision models.
Explore all tools that specialize in 3d cuboid labeling. This domain focus ensures Computer Vision Annotation Tool (CVAT) delivers optimized results for this specific requirement.
Explore all tools that specialize in annotate image data. This domain focus ensures Computer Vision Annotation Tool (CVAT) delivers optimized results for this specific requirement.
Explore all tools that specialize in perform semantic segmentation. This domain focus ensures Computer Vision Annotation Tool (CVAT) delivers optimized results for this specific requirement.
Explore all tools that specialize in interpolate video frames. This domain focus ensures Computer Vision Annotation Tool (CVAT) delivers optimized results for this specific requirement.
Integration with Segment Anything (SAM) and custom models via Nuclio serverless framework.
Support for Lidar data (PCD, BIN) with specialized viewports for Top/Side/Front perspectives.
Linear and non-linear interpolation of object positions between keyframes in video.
Directly mount S3, Azure, or GCS buckets as data sources without moving files to CVAT servers.
Configurable keypoint structures with parent-child relationships for human pose estimation.
Full programmatic access to all UI functions through the CVAT REST API and PyCVAT library.
Specific user roles for 'Annotator' and 'Reviewer' with status tracking for every 'Job'.
Create an account on app.cvat.ai or pull the Docker image for self-hosting.
Initialize a 'Project' to define high-level labeling parameters and labels.
Define 'Labels' and 'Attributes' using the visual constructor or JSON raw configuration.
Create a 'Task' and upload source files (images, videos, or point cloud files).
Configure 'Jobs' by splitting data into subsets for different annotators.
Connect external cloud storage (S3, Azure Blob, GCS) if datasets are too large for direct upload.
Deploy serverless functions via Nuclio for AI-assisted automatic annotation.
Execute annotation using tools like Polygon, Brush, or 3D Cuboids with frame interpolation.
Utilize the 'Review' workflow to accept or reject annotations and provide feedback.
Export the dataset in one of the 20+ supported industry-standard formats.
All Set
Ready to go
Verified feedback from other users.
"Highly regarded for its robust feature set and open-source flexibility, though users frequently note a steep learning curve for advanced configurations."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.