Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4764

When a PDF has table with blank entries in the column the stripper just ignores the column and moves to next field in the coulmn

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Not A Bug
    • Affects Version/s: 2.0.8
    • Fix Version/s: None
    • Component/s: Text extraction
    • Labels:
      None

      Description

      When a PDF has tables with columns with empty values,the stripper ignores the field and moves to next column which has records(if its blank it should capture)

       

      PDFTextStripperByArea stripper = new PDFTextStripperByArea();
      stripper.setSortByPosition(true);

      PDFTextStripper tStripper = new PDFTextStripper();

      String pdfFileInText = tStripper.getText(document);

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              madhube2003@gmail.com karthik guns
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: