Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-118

Text extraction fails for pages in landscape format

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Resolution: Duplicate
    • None
    • None
    • Text extraction
    • None

    Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1410876
      Originally submitted by lenuweit on 2006-01-20 07:22.

      Text extraction fails for some PDFs (see attached one
      generated by PS printer/Ghostscript) under the
      following circumstances:

      • page is in landscape format
      • setSortByPosition is true

      Extraction works fine if page is in portrait format.

      [attachment on SourceForge]
      http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1410876&file_id=164169
      testpdfbox.pdf (application/pdf), 5442 bytes
      sample PDF (1 page in landscape)

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              Anonymous Anonymous
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: