Home
Since its initial release, PaddleOCR has gained widespread acclaim across academia, industry, and research communities, thanks to its cutting-edge algorithms and proven performance in real-world applications. Itโs already powering popular open-source projects like Umi-OCR, OmniParser, MinerU, and RAGFlow, making it the go-to OCR toolkit for developers worldwide.
On January 29, 2026, PaddleOCR open-sourced the advanced and efficient document parsing model PaddleOCR-VL-1.5. PaddleOCR-VL-1.5 is a new iterative version of the PaddleOCR-VL series. Based on comprehensive optimization of the core capabilities of version 1.0, the model achieves 94.5% accuracy on the authoritative document parsing benchmark OmniDocBench v1.5, surpassing top global general-purpose large models and document parsingโspecific models.
PaddleOCR-VL-1.5 innovatively supports irregular-shaped bounding box localization of document elements, enabling excellent performance in real-world application scenarios such as scanning, skew, warping, screen-photography, and complex illumination, achieving comprehensive SOTA performance. In addition, the model further integrates seal recognition and spotting tasks, with key metrics continuing to lead mainstream models.
You can use it online on the PaddleOCR official website or call the model API.
Major Features in PaddleOCR 3.x:
-
PaddleOCR-VL - Multilingual Document Parsing via a 0.9B VLM
The SOTA and resource-efficient model tailored for document parsing, that supports 109 languages and excels in recognizing complex elements (e.g., text, tables, formulas, and charts), while maintaining minimal resource consumption. -
PP-OCRv5 โ Universal Scene Text Recognition
Single model supports five text types (Simplified Chinese, Traditional Chinese, English, Japanese, and Pinyin) with 13% accuracy improvement. Solves multilingual mixed document recognition challenges. -
PP-StructureV3 โ Complex Document Parsing
Intelligently converts complex PDFs and document images into Markdown and JSON files that preserve original structure. Outperforms numerous commercial solutions in public benchmarks. Perfectly maintains document layout and hierarchical structure. -
PP-ChatOCRv4 โ Intelligent Information Extraction
Natively integrates ERNIE 4.5 to precisely extract key information from massive documents, with 15% accuracy improvement over previous generation. Makes documents "understand" your questions and provide accurate answers.
๐ก Tips
PaddleOCR's free API now supports up to 20,000 pages of document parsing per day, enabling large-scale PDF file processing, along with MCP and Skills services. For more details, please visit PaddleOCR Official Website.
In addition to its strong model library, PaddleOCR 3.0 also provides easy-to-use tools covering model training, inference, and serving, helping developers bring AI applications into production more efficiently.
In addition, PaddleOCR provides official Agent Skills for invoking text recognition, document parsing, and related capabilities in Skills-enabled AI apps.
You can Quick Start directly, find comprehensive documentation in the PaddleOCR Docs, get support via Github Issues, and explore our OCR courses on OCR courses on AIStudio.
Special Note: PaddleOCR 3.x introduces several significant interface changes. Old code written based on PaddleOCR 2.x is likely incompatible with PaddleOCR 3.x. Please ensure that the documentation you are reading matches the version of PaddleOCR you are using. This document explains the reasons for the upgrade and the major changes from PaddleOCR 2.x to 3.x.
๐ Quick Overview of Execution Results¶
PP-OCRv5¶
PP-StructureV3¶
PaddleOCR-VL¶
๐ฉโ๐ฉโ๐งโ๐ฆ PaddleOCR OCEAN Ecosystem Alliance¶
The lead in single-point technology is just the beginning โ the prosperity of the ecosystem is where long-term value truly lies. To better serve global developers and industry scenarios with OCR and document intelligence technologies, we are officially launching the PaddleOCR OCEAN Ecosystem Alliance.
The alliance name OCEAN embodies five core pillars:
- Open Source โ Open source as the foundation
- Community โ Community-driven
- Ecosystem โ Shared ecosystem success
- Application โ Real-world application
- Network โ Networked collaboration
Positioning: An ecosystem alliance centered on open-source co-building, open to global upstream and downstream partners in OCR and document intelligence. The alliance involves no commercial exclusivity and does not interfere with partners' independent business decisions. It focuses on technical collaboration, community engagement, and mutual influence expansion. Guided by the core principles of openness, symbiosis, and shared success, it brings together developers, platform providers, and application builders to jointly advance the full-chain application of OCR technology and ecosystem prosperity. The alliance is committed to achieving dual growth in full-chain application scale and the number of derivative projects, enabling developers and users worldwide to share in the dividends of OCR technology advancement.
Join Us: Walk into the Deep End with Like-Minded Partners
The PaddleOCR OCEAN Ecosystem Alliance is open to global partners across the OCR and document intelligence value chain. We firmly believe: the value of an ecosystem lies not in quantity, but in quality.
We look forward to welcoming partners who:
- Genuinely embrace the open-source spirit and are willing to co-build and share with an open mindset
- Have the willingness and capability to contribute consistently โ whether through code, use case scenarios, or platform integration
- Are committed to growing together with the alliance โ not chasing short-term traffic, but cultivating long-term value
The alliance is not a hall of fame โ it is a rallying call for those who take action.
We will carefully evaluate every application, giving priority to partners who have already taken action within the PaddleOCR ecosystem or have a clear co-building plan in place. We do not pursue being "large and all-encompassing." Instead, we seek to work hand in hand with truly like-minded organizations and individuals, diving deep into the frontiers of OCR together.
If you resonate with the above vision, we welcome you to reach out through the following channel:
- Send an email to paddleocr@baidu.com with a brief introduction of your collaboration with PaddleOCR or your co-building plan.
