Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4785

No Unicode mapping with MS-Mincho

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.0.18, 2.0.19
    • None
    • FontBox
    • None

    Description

      ExtractText from attached pdf fails after v2.0.18 while v2.0.17 succeed.
      Error message is as follows, and can't extract character "最"(CID+7025).

      FEB 26, 2020 10:32:29 AM org.apache.pdfbox.pdmodel.font.PDType0Font toUnicode
      WARNING: No Unicode mapping for CID+7025 (7025) in font NAEGKL+MS-Mincho

      This maybe related to PDFBOX-4661?

      Attachments

        1. E02779_convocation_notice_p14.pdf
          316 kB
          Ryosuke Fujita

        Activity

          People

            Unassigned Unassigned
            fukkun Ryosuke Fujita
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: