Home
comments: true hide: - navigation - toc
Since its initial release, PaddleOCR has gained widespread acclaim across academia, industry, and research communities, thanks to its cutting-edge algorithms and proven performance in real-world applications. Itโs already powering popular open-source projects like Umi-OCR, OmniParser, MinerU, and RAGFlow, making it the go-to OCR toolkit for developers worldwide.
On May 20, 2025, the PaddlePaddle team unveiled PaddleOCR 3.0, fully compatible with the official release of the PaddlePaddle 3.0 framework. This update further boosts text-recognition accuracy, adds support for multiple text-type recognition and handwriting recognition, and meets the growing demand from large-model applications for high-precision parsing of complex documents. When combined with the ERNIE 4.5, it significantly enhances key-information extraction accuracy. PaddleOCR 3.0 also introduces support for domestic hardware platforms such as KUNLUNXIN and Ascend.
Major Features in PaddleOCR 3.x:
-
PaddleOCR-VL - Multilingual Document Parsing via a 0.9B VLM
The SOTA and resource-efficient model tailored for document parsing, that supports 109 languages and excels in recognizing complex elements (e.g., text, tables, formulas, and charts), while maintaining minimal resource consumption. -
PP-OCRv5 โ Universal Scene Text Recognition
Single model supports five text types (Simplified Chinese, Traditional Chinese, English, Japanese, and Pinyin) with 13% accuracy improvement. Solves multilingual mixed document recognition challenges. -
PP-StructureV3 โ Complex Document Parsing
Intelligently converts complex PDFs and document images into Markdown and JSON files that preserve original structure. Outperforms numerous commercial solutions in public benchmarks. Perfectly maintains document layout and hierarchical structure. -
PP-ChatOCRv4 โ Intelligent Information Extraction
Natively integrates ERNIE 4.5 to precisely extract key information from massive documents, with 15% accuracy improvement over previous generation. Makes documents "understand" your questions and provide accurate answers.
[!TIP]
On October 24, 2025, the PaddleOCR official website Beta version was launched, offering a more convenient online experience and large-scale PDF file parsing, as well as free API and MCP services. For more details, please visit the PaddleOCR official website.
In addition to providing an outstanding model library, PaddleOCR 3.0 also offers user-friendly tools covering model training, inference, and service deployment, so developers can rapidly bring AI applications to production.
You can Quick Start directly, find comprehensive documentation in the PaddleOCR Docs, get support via Github Issues, and explore our OCR courses on OCR courses on AIStudio.
Special Note: PaddleOCR 3.x introduces several significant interface changes. Old code written based on PaddleOCR 2.x is likely incompatible with PaddleOCR 3.x. Please ensure that the documentation you are reading matches the version of PaddleOCR you are using. This document explains the reasons for the upgrade and the major changes from PaddleOCR 2.x to 3.x.
๐ Quick Overview of Execution Results¶
PP-OCRv5¶
PP-StructureV3¶
PaddleOCR-VL¶
๐ฉโ๐ฉโ๐งโ๐ฆ Community¶
- The PaddleOCR Best Practice Projects call for submissions is now open! ๐ August 5, 2025 โ October 30, 2025. Share your scenario-based PaddleOCR applications and shine in the global developer community!
- ๐ซ Join the PaddlePaddle Community, where you can engage with paddlepaddle developers, researchers, and enthusiasts from around the world.
- ๐ Learn from experts through workshops, tutorials, and Q&A sessions hosted by the AI Studio.
- ๐ Participate in hackathons, challenges, and competitions to showcase your skills and win exciting prizes.
- ๐ฃ Stay updated with the latest news, announcements, and events by following our Twitter and WeChat).

