Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1323

PDF with subset CIDFontType2 (TTF) paints wrong characters

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.7.0
    • 1.7.1
    • Swing GUI
    • None

    Description

      A PDF created with Apache FOP containing a CID TrueType font shows bad characters when painted. Since PDFBox apparently cannot currently load the subset TTF, the font is substituted but that information is not propagated from the descendant font to the parent font. As a consequence, the codepoints are used instead of the Unicode text which results in the wrong glyphs being used.

      Log output:

      26.05.2012 18:32:11 org.apache.pdfbox.pdmodel.font.PDCIDFont determineEncoding
      SCHWERWIEGEND: Error: Could not parse predefined CMAP file for 'Adobe-Identity-UCS'
      26.05.2012 18:32:11 org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
      INFO: Can't read the embedded font EAAEHE+ArialMT
      26.05.2012 18:32:13 org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
      INFO: Using font Arial instead

      Attachments

        1. hello-from-fop.pdf
          17 kB
          Jeremias Maerki

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jeremias@apache.org Jeremias Maerki
            jeremias@apache.org Jeremias Maerki
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment