PP-OCRV1

Lightweight Architecture: Optimized for fast inference on CPUs and edge devices.
English & Chinese Support: Primarily focused on these two languages.
Single-Step Detection & Recognition: Uses a simple CNN + LSTM-based approach.
Moderate Accuracy: Works well on clean documents but struggles with complex layouts or noisy backgrounds.
Open-Source Availability: Released to encourage community adoption and improvements.

General Availability Yes (Open-source)

Supported Data Types for Input Image (PNG, JPG)

Supported Data Types for Output Text

Supported # Tokens for Input Single image (max 1024x1024 px)

Supported # Tokens for Output 1k (basic text extraction)

Knowledge Cutoff December 2021

Tool Use None (standalone OCR)