Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2390

Extract images embedded in Html

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Duplicate
    • 1.15
    • 1.18, 2.0.0
    • parser
    • None

    Description

      We should handle images embedded in html like we do for other formats, as attachments. There are encodings other than base64 used out there to embed images in html?

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              lfcnassif Luís Filipe Nassif
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: