iText Core
Search for "pdfocr" returned 7 results.
-
pdfOCR: Is handwriting recognition supported?
strong suit, so handwritten recognition is not supported at this stage. pdfocr faq Converted from version 'Latest'.
-
pdfOCR: Who provided TESS_DATA_DIRECTORY?
downloading the training models, you just need to point to them with pdfOCR https://itextpdf.com/en/products/itext-7/pdfocr
-
Which languages are supported in pdfOCR?
Since pdfOCR https://itextpdf.com/en/products/itext-7/pdfocr relies on Tesseract https://github.com/tesseract-ocr/tesseract
-
What does TextPositioning in pdfOCR do?
pdfOCR https://itextpdf.com/en/products/itext-7/pdfocr allows you to define the way text is retrieved in the Tesseract
-
pdfOCR: If your scanned document has a mixture of sections with paragraphs and tables, what is a recommended strategy here?
into paragraphs without losing the words' boundaries. In addition, since pdfOCR 1.0.1, you can also use BY_WORDS_AND_LINES
-
Could not find a glyph corresponding to Unicode character
We ship pdfOCR https://itextpdf.com/en/products/itext-7/pdfocr with Liberation https://en.wikipedia.org/wiki/Liberation_fonts
-
How do I create a separate OCR layer?
By default, pdfOCR https://itextpdf.com/en/products/itext-7/pdfocr merges the recognized text into the image that just got