Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-3074

Mark transparency groups

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Won't Fix
    • Affects Version/s: 2.0.0
    • Fix Version/s: 2.0.0
    • Component/s: Text extraction
    • Labels:

      Description

      We try to read text from PDF files but some of the files include extra data that is never shown. These segments are usually grouped in transparency groups. So for us this function to flag a marked content as a transparency group is quite useful.

      If there is a way to do this please tell me or if there is a better way to remove text that isn't presented or drawn when the PDF is viewed then I'm all ears.

        Attachments

        1. mark_transparency_groups.patch
          5 kB
          Daniel Persson

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              kalaspuffar Daniel Persson
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: