Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2768

While parsing pdf documents with PDFParser, the marking for bold characters is lost

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.19.1
    • 1.19.1
    • parser
    • None

    Description

      While parsing pdf documents with PDFParser (as chosen by AutoDetectParser) the marking for bold character is lost as the method

      writeString(String text, List<TextPosition> textPositions) 

      is ignoring the textPositions parameter which contains the font information which could be used to derive if the text is bold text

      Attachments

        Activity

          People

            Unassigned Unassigned
            phanindraramesh Phanindra Ramesh
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: