Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3695

LimitingMetadataFilter

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.28.1, 2.3.0
    • 2.4.0
    • metadata
    • None

    Description

      Some files may contain abnormally big metadata (several MB, be it for the metadata values, the metadata names, but also for the total amount of metadata) that can be problematic concerning the memory consumption.

      It would be great to develop a new LimitingMetadataFilter so that we can filter out the metadata according to different bytes limits (on metadata names, metadata values and global amount of metadata) 

       

      Attachments

        1. huge-title.docx
          4 kB
          Tim Allison
        2. tika-config.xml
          0.4 kB
          Tim Allison

        Issue Links

          Activity

            People

              Unassigned Unassigned
              julienFL Julien Massiera
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: