BoxMOT
Pluggable SOTA multi-object tracking modules for segmentation, object detection, and pose estimation models.
Robust Associations Multi-Pedestrian Tracking using motion and appearance information with camera-motion compensation.
BoT-SORT is a state-of-the-art multi-object tracker designed for detecting and tracking objects in a scene, maintaining unique identifiers for each. It combines motion and appearance information with camera-motion compensation and an accurate Kalman filter. BoT-SORT and BoT-SORT-ReID trackers excel in MOTChallenge datasets, achieving top ranks in MOTA, IDF1, and HOTA metrics. It leverages YOLOX and YOLOv7 for object detection and supports multi-class tracking. The architecture facilitates camera motion compensation using OpenCV's VideoStab Global Motion Estimation. Installation involves setting up a Conda environment, installing PyTorch, and using pip to install necessary packages and dependencies like ByteTrack and FastReID.
BoT-SORT is a state-of-the-art multi-object tracker designed for detecting and tracking objects in a scene, maintaining unique identifiers for each.
Explore all tools that specialize in multi-object tracking. This domain focus ensures BoT-SORT delivers optimized results for this specific requirement.
Explore all tools that specialize in pedestrian tracking. This domain focus ensures BoT-SORT delivers optimized results for this specific requirement.
Explore all tools that specialize in object detection. This domain focus ensures BoT-SORT delivers optimized results for this specific requirement.
Explore all tools that specialize in camera motion compensation. This domain focus ensures BoT-SORT delivers optimized results for this specific requirement.
Open side-by-side comparison first, then move to deeper alternatives guidance.
Verified feedback from other users.
No reviews yet. Be the first to rate this tool.
Pluggable SOTA multi-object tracking modules for segmentation, object detection, and pose estimation models.

A simple, fast, and strong multi-object tracker that associates every detection box.

A large-scale street fashion dataset with polygon annotations for computer vision research.

A pure ConvNet model constructed entirely from standard ConvNet modules, designed for the 2020s.

A suite of libraries, tools, and APIs for applying AI and ML techniques across multiple platforms and modalities.
Integrate powerful vision detection features into applications for image analysis, document understanding, and video intelligence.