Release date: July 5, 2021
This the first release of the pdfOCR add-on this year.
It brings more advanced image type detection. From now on, pdfOCR does not rely on the file extension to determine the image type, but instead it detects the image type by considering a file's content to prevent errors in OCR processes.
It allows you to use files with unknown or incorrect extensions as an input, providing they have the correct structure from a specifications point of view.
image type detection based on file content
Examples (latest ones)
FAQ (latest ones)