iText

pdf2Data: Add support for tables using ASCII line symbol separators

In previous versions pdf2Data would fail to recognize tables that use horizontal lines that are drawn using ASCII graphics. However, in pdf2Data version 2.1.11, the correct and full recognition of ASCII horizontal lines is supported.

The screenshot below shows a table drawn using horizontal ASCII lines:

image2021-4-7_16-55-50.png

The resulting template shows the following:

image2021-4-7_16-54-13.png

Likewise, the output xml shows the following:


XML
<elements>
  <data name="DataField1">
    <table>
      <row>
        <cell>1</cell>
        <cell>2</cell>
        <cell>3</cell>
      </row>
      <row>
        <cell>A</cell>
        <cell>b</cell>
        <cell>c</cell>
      </row>
      <row>
        <cell>A</cell>
        <cell>b</cell>
        <cell>c</cell>
      </row>
      <row>
        <cell>A</cell>
        <cell>b</cell>
        <cell></cell>
      </row>
    </table>
  </data>
</elements>