Release pdfOCR 4.0.1
Release date: Feb 14th, 2025
pdfOCR is our add-on for iText Core to perform OCR on documents and images.
This version improves memory usage when using the Tesseract 4 engine for OCR text extraction.
Downloads
| GitHub | Maven | NuGet | Artifactory | |
|---|---|---|---|---|
iText pdfOCR – 4.0.1 (Java) | link | link | N/A | link |
iText pdfOCR – 4.0.1 (.NET) | link | N/A | link | link |
Changelog
Bug fixes
Improved memory usage
Installation Instructions
Examples (latest ones)
FAQ (latest ones)
- Which languages are supported in pdfOCR?
- What does TextPositioning in pdfOCR do?
- Could not find a glyph corresponding to Unicode character
- pdfOCR: If your scanned document has a mixture of sections with paragraphs and tables, what is a recommended strategy here?
- pdfOCR: Is handwriting recognition supported