Sourcify
Effortlessly find and manage open-source dependencies for your projects.

The industry-standard open-source object detection toolbox for academic research and industrial deployment.

MMDetection is a part of the OpenMMLab project and stands as the most comprehensive open-source object detection toolbox built on PyTorch. As of 2026, it has matured into a hyper-modular architecture leveraging MMEngine and MMCV, allowing researchers and engineers to decompose complex detection pipelines into individual components: backbones, necks, dense heads, and ROI heads. Its technical excellence lies in its implementation of over 300+ algorithms and its support for a wide variety of tasks including 2D/3D object detection, instance segmentation, and panoptic segmentation. The framework's design philosophy facilitates rapid prototyping and benchmark reproducibility, which has made it the de facto choice for COCO and Cityscapes competition entries. For 2026 enterprise applications, MMDetection integrates seamlessly with MMDeploy for cross-platform model export (TensorRT, ONNX, OpenVINO) and supports advanced training techniques such as mixed-precision training, multi-node distributed training, and automated hyper-parameter tuning via its robust configuration system.
MMDetection is a part of the OpenMMLab project and stands as the most comprehensive open-source object detection toolbox built on PyTorch.
Explore all tools that specialize in train deep learning models. This domain focus ensures MMDetection delivers optimized results for this specific requirement.
Explore all tools that specialize in region proposal. This domain focus ensures MMDetection delivers optimized results for this specific requirement.
Uses a hierarchical inheritance config system that allows users to override specific components of a model without rewriting the entire pipeline.
Seamless integration with MMDet3D for point cloud and multi-view 3D detection.
Native support for torch.cuda.amp to accelerate training on Tensor Core GPUs while reducing VRAM usage.
Highly optimized Feature Pyramid Networks (FPN) and Path Aggregation Networks (PAN) for multi-scale object detection.
Dedicated deployment toolkit for converting PyTorch models to ONNX, TensorRT, and ncnn.
Includes state-of-the-art implementations like DINO, Mask2Former, and Grounding DINO.
A foundational library for training deep learning models that provides a unified distributed execution environment.
Install Python 3.8+ and PyTorch 2.0+ in a clean virtual environment.
Install MMEngine and MMCV using openmim (pip install -U openmim; mim install mmcv).
Clone the MMDetection repository from GitHub and install in editable mode (pip install -v -e .).
Verify installation by running the demo inference script on a sample image.
Prepare your dataset in COCO or Pascal VOC format.
Select a model configuration file from the 'configs/' directory (e.g., Faster R-CNN, Mask R-CNN, or YOLO series).
Customize the config file to point to your local dataset paths and define the number of classes.
Download pre-trained weights to initialize your backbone for faster convergence.
Launch training using the 'tools/train.py' script, optionally enabling distributed training for multiple GPUs.
Evaluate model performance using the 'tools/test.py' script to generate mAP and AR metrics.
All Set
Ready to go
Verified feedback from other users.
"Highly praised for its extensibility and scientific accuracy, though noted for a steep learning curve for beginners."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.