Release date: April 14, 2021

In this release, we've further improved the table data extraction capabilities. From now on, pdf2Data is able to recognize tables which use ASCII horizontal line symbols as separators.
Surprisingly, such ASCII-made tables are still quite common even today, so if you'd like to learn more about this feature (or you don`t know your ASCII from your elbow) please refer to the example section below. Otherwise, just update pdf2Data to get this functionality.

Additionally, you'll get the possibility to extract additional metadata (fontname, fontstyle, fontsize, and fontcolor) as data field recognition properties.

Release Related Examples



GitHubMavenNuGetArtifactory
iText pdf2Data – 2.1.11 (Java)N/AN/AN/Alink
iText pdf2Data – 2.1.11 (.NET)N/AN/Alinklink

New Features

  • recognition of tables using ASCII horizontal line separators

Improvements

  • fontname, fontstyle, fontsize, and fontcolor metadata have been added to recognition properties

Installation Instructions

There is no content with the specified labels

Video tutorials