BoT-SORT
Robust Associations Multi-Pedestrian Tracking using motion and appearance information with camera-motion compensation.

A simple, fast, and strong multi-object tracker that associates every detection box.
ByteTrack is a multi-object tracking (MOT) method that associates every detection box, including those with low scores, to recover true objects and filter out background detections. This approach addresses the problem of missing objects and fragmented trajectories caused by discarding low-score detection boxes in traditional MOT methods. ByteTrack demonstrates significant improvements in IDF1 scores when applied to various state-of-the-art trackers and achieves high MOTA, IDF1, and HOTA scores on the MOT17 test set. Implemented using YOLOX for detection, it provides demo links for Google Colab and Huggingface Spaces. The tracker can be installed and used on a host machine or via Docker, with detailed instructions provided for data preparation and model training, supporting datasets like MOT17, MOT20, CrowdHuman, Cityperson, and ETHZ.
ByteTrack is a multi-object tracking (MOT) method that associates every detection box, including those with low scores, to recover true objects and filter out background detections.
Explore all tools that specialize in multi-object tracking. This domain focus ensures ByteTrack delivers optimized results for this specific requirement.
Explore all tools that specialize in object detection. This domain focus ensures ByteTrack delivers optimized results for this specific requirement.
Explore all tools that specialize in trajectory analysis. This domain focus ensures ByteTrack delivers optimized results for this specific requirement.
Open side-by-side comparison first, then move to deeper alternatives guidance.
Verified feedback from other users.
No reviews yet. Be the first to rate this tool.
Robust Associations Multi-Pedestrian Tracking using motion and appearance information with camera-motion compensation.
Pluggable SOTA multi-object tracking modules for segmentation, object detection, and pose estimation models.

A large-scale street fashion dataset with polygon annotations for computer vision research.

A pure ConvNet model constructed entirely from standard ConvNet modules, designed for the 2020s.

A suite of libraries, tools, and APIs for applying AI and ML techniques across multiple platforms and modalities.
Integrate powerful vision detection features into applications for image analysis, document understanding, and video intelligence.