Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-3054

Getting Unicode mapping error, file was Ok in 1.8

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 2.0.0
    • Fix Version/s: None
    • Component/s: Text extraction
    • Labels:

      Description

      Text extraction on attached file is getting many errors like:

      WARNING: No Unicode mapping for c (131) in font C0HR11_T1GI0361

      and then returning gibberish for all but the first 4 strings.

      In 1.8 all the text characters were correct. Fine in Acrobat, can copy/paste from there also.

      This has type 3 fonts.

      Tested against trunk build 20151024.140757-1624

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                tilman Tilman Hausherr
                Reporter:
                fred_andrews Fred Andrews
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: