Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-3092

Format 4 TTF cmap table is parsed incorrectly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • 2.0.0
    • None
    • None
    • None

    Description

      Certain large Format 4 cmap tables aren't being parsed correctly by CmapSubtable#processSubtype4(), for example in the font "ArialUnicodeMS".

      This results in missing glyphs when rendering the file from PDFBOX-2950, when "ArialUnicodeMS" is used as a substitute. You can force this to happen by changing the following line of PDCIDFontType2:

      // find font or substitute
      CIDFontMapping mapping = FontMappers.instance()
                                          .getCIDFont(getBaseFont(), getFontDescriptor(),
                                                      getCIDSystemInfo());
      

      Replace getBaseFont() with "ArialUnicodeMS"

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jahewson John Hewson
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: