Skip to main content
Skip table of contents


The tessdata directory should be retrieved by you, as there are multiple repositories with different training models. A great place to start is Tesseract's own GitHub repo, where you can find the tessdata (standard model) and the tessdata_best (slower, but higher accuracy) data files. 

After downloading the training models, you just need to point to them with pdfOCR, by using the appropriate method (Java/.NET).

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.