Sourcify
Effortlessly find and manage open-source dependencies for your projects.

The open-source toolkit for deep learning-based document image analysis and structured data extraction.

Layout Parser is a comprehensive Python-based framework designed to streamline the pipeline of document image analysis. As of 2026, it remains a critical infrastructure component for developers building high-accuracy OCR and document understanding applications. The tool provides a unified interface for state-of-the-art deep learning models, allowing for the detection of complex layouts—including tables, figures, headers, and multi-column text. It effectively bridges the gap between raw document images (scanned PDFs, photographs) and structured digital formats. By integrating with major backends like Detectron2 and PaddleDetection, it offers a plug-and-play architecture for loading pre-trained weights from the 'Layout Bank.' Its versatility extends to OCR orchestration, supporting engines such as Tesseract and Google Cloud Vision. In the 2026 market, Layout Parser is positioned as the go-to open-source alternative to proprietary solutions like Amazon Textract, favored for its flexibility in self-hosting and fine-tuning models on niche datasets. Its modularity allows enterprises to build custom parsing pipelines that maintain data privacy and reduce recurring API costs associated with commercial SaaS offerings.
Layout Parser is a comprehensive Python-based framework designed to streamline the pipeline of document image analysis.
Explore all tools that specialize in extract structured data. This domain focus ensures Layout Parser delivers optimized results for this specific requirement.
Explore all tools that specialize in text segmentation. This domain focus ensures Layout Parser delivers optimized results for this specific requirement.
A repository of pre-trained deep learning models specialized for various document types like academic papers and newspapers.
A wrapper that provides a consistent interface for different OCR engines including Tesseract, PyTesseract, and Google Cloud Vision.
Stores layout information in a nested structure allowing for parent-child relationship tracking between blocks.
Matplotlib-based module for overlaying detected bounding boxes and segmentation masks on images.
Dual-backend support for two of the most popular computer vision frameworks.
Allows users to load their own PyTorch or PaddlePaddle model weights directly into the layout detection pipeline.
Tools to handle coordinate scaling and rotation corrections during the detection phase.
Install Layout Parser library via pip install layoutparser.
Install deep learning backend (Detectron2 or PaddleDetection) based on hardware availability.
Import the layoutparser module into your Python environment.
Select and download a pre-trained model from the Layout Bank (e.g., HJDataset for historical documents).
Load the document image into the framework using the built-in image loader.
Initialize the layout model with specific configuration parameters.
Perform inference to detect layout elements (text blocks, tables, images).
(Optional) Initialize and run an OCR engine (Tesseract/Google Vision) on detected text regions.
Use the visualization module to verify bounding boxes and labels.
Export structured data to JSON or CSV for downstream processing.
All Set
Ready to go
Verified feedback from other users.
"Highly praised for its modularity and the quality of pre-trained models. Developers value its cost-saving potential over SaaS OCR."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.