Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-80

Does not convert spacing. gourps words

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 0.8.0-incubator
    • Text extraction
    • None

    Description

      [imported from SourceForge]
      http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1258807
      Originally submitted by gohan222 on 2005-08-13 22:47.

      The PDFTextStripper misses some spacing in between
      words. It crunches sentences together on occasions.
      After running extract look for string "demandsofGoogle's".

      [attachment on SourceForge]
      http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1258807&file_id=145590
      p125-ghemawat.zip (application/zip), 252359 bytes
      Removes spacing

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jukkaz Jukka Zitting
              Votes:
              1 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: