Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-5905

Many ZapfDingbats symbols do not appear when page is rendered.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.32, 3.0.3 PDFBox
    • 2.0.33, 3.0.4 PDFBox, 4.0.0
    • Rendering
    • None
    • Ubuntu 22.04

    Description

      We've encountered some PDFs that have characters written using the ZapfDingbats symbol font. On Windows PdfBox recreates the symbols appropriately, but when testing on Ubuntu 22.04 we've noticed that many symbols, such as the heavy checkmark (U+2714) are missing. Instead an empty character is produced. 

      Upon debugging it appears that the PdfBox toolkit substitutes the ZapfDingbats font for some other font instead. In our observation, the code attempts to fallback to "Helvetica," but because this font is not present the last-resort "LiberationSans" is selected instead. This font is not capable of displaying the heavy checkmark symbol (a20/U+2714) and hence it is missing when the PDF page is rendered to an image. The following appears in the Debug console:

      WARNING: Using fallback font LiberationSans for ZapfDingbats

      While debugging, if we inject the font name "DejaVuSans" into the variable for "fallbackName" in the FontMapperImpl::getFontBoxFont(String, PDFontDescriptor) method, then DejaVuSans is instead resolved as the base font and the heavy checkmark symbol (and others) is drawn correctly.

      It's not clear to us why a more appropriate font is not chosen in this instance. It results in many ZapfDingbats symbols missing when the page is rendered.

      This PDF file demonstrates the problem.

      Steps to Reproduce:

      1. Create a Java project and import PdfBox v2.0.32
      2. Execute the below code snippet, substituting the placeholder file paths for real ones on your system, on Ubuntu 22.04
      3. Observe that the image file that is produced is missing many symbols.
      String inputPath = "zapf-dingbats.pdf";
      File pdfFile = new File(inputPath);
      try (PDDocument pdfDoc = PDDocument.load(pdfFile)) {
          PDFRenderer renderer = new PDFRenderer(pdfDoc);
          BufferedImage image = renderer.renderImage(0);
      
          String outputPath = "zapf-dingbats.ubuntu.png";
          try (OutputStream outFile = new FileOutputStream(outputPath)) {
              ImageIOUtil.writeImage(image, "PNG", outFile);
          }
      }
      catch (IOException e) {
          e.printStackTrace();
      }
      

      Attachments

        1. missing_symbols.png
          21 kB
          Robert Amidon
        2. image-2024-12-17-13-57-21-509.png
          48 kB
          Robert Amidon

        Issue Links

          Activity

            People

              tilman Tilman Hausherr
              rpa242 Robert Amidon
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: