Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-118

Text extraction fails for pages in landscape format

    Details

    • Type: Bug
    • Status: Closed
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Text extraction
    • Labels:
      None

      Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1410876
      Originally submitted by lenuweit on 2006-01-20 07:22.

      Text extraction fails for some PDFs (see attached one
      generated by PS printer/Ghostscript) under the
      following circumstances:

      • page is in landscape format
      • setSortByPosition is true

      Extraction works fine if page is in portrait format.

      [attachment on SourceForge]
      http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1410876&file_id=164169
      testpdfbox.pdf (application/pdf), 5442 bytes
      sample PDF (1 page in landscape)

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                Anonymous
              • Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: