Release pdf2Data 2.1.11
Release date: April 14, 2021
In this release, we've further improved the table data extraction capabilities. From now on, pdf2Data is able to recognize tables which use ASCII horizontal line symbols as separators.
Surprisingly, such ASCII-made tables are still quite common even today, so if you'd like to learn more about this feature (or you don`t know your ASCII from your elbow) please refer to the example section below. Otherwise, just update pdf2Data to get this functionality.
Additionally, you'll get the possibility to extract additional metadata (fontname, fontstyle, fontsize, and fontcolor) as data field recognition properties.
Downloads:
GitHub | Maven | NuGet | Artifactory | |
---|---|---|---|---|
iText pdf2Data – 2.1.11 (Java) | N/A | N/A | N/A | link |
iText pdf2Data – 2.1.11 (.NET) | N/A | N/A | link | link |
Changelog:
New Features
- recognition of tables using ASCII horizontal line separators
Improvements
- fontname, fontstyle, fontsize, and fontcolor metadata have been added to recognition properties