BoxMOT
Pluggable SOTA multi-object tracking modules for segmentation, object detection, and pose estimation models.
Efficient and lightweight CNN architecture for mobile and edge devices.
MobileNetV3 is a series of convolutional neural network architectures designed for mobile and edge devices, focusing on maximizing accuracy while minimizing computational cost and model size. Based on the 'Searching for MobileNetV3' paper, these models leverage a combination of hardware-aware network architecture search (NAS) and network design. The architecture includes inverted residual blocks with linear bottlenecks and squeeze-and-excitation modules for efficient feature extraction. MobileNetV3 comes in two variants: Large and Small, catering to different resource constraints. These models are implemented within PyTorch's torchvision library, offering pre-trained weights for easy integration. They facilitate tasks such as image classification, object detection, and semantic segmentation, with optimized performance on resource-constrained devices. The builders instantiate a MobileNetV3 model from torchvision.models.mobilenetv3.MobileNetV3 base class.
MobileNetV3 is a series of convolutional neural network architectures designed for mobile and edge devices, focusing on maximizing accuracy while minimizing computational cost and model size.
Explore all tools that specialize in image classification. This domain focus ensures MobileNetV3 delivers optimized results for this specific requirement.
Explore all tools that specialize in object detection. This domain focus ensures MobileNetV3 delivers optimized results for this specific requirement.
Explore all tools that specialize in semantic segmentation. This domain focus ensures MobileNetV3 delivers optimized results for this specific requirement.
Open side-by-side comparison first, then move to deeper alternatives guidance.
Verified feedback from other users.
No reviews yet. Be the first to rate this tool.
Pluggable SOTA multi-object tracking modules for segmentation, object detection, and pose estimation models.

A simple, fast, and strong multi-object tracker that associates every detection box.

Labeled subsets of the 80 million tiny images dataset for machine learning research.

Real-time semantic segmentation for efficient scene understanding.

A large-scale street fashion dataset with polygon annotations for computer vision research.

A pure ConvNet model constructed entirely from standard ConvNet modules, designed for the 2020s.