Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1508

Add uniformity to parser parameter configuration

Agile BoardAttach filesAttach ScreenshotAdd voteVotersStop watchingWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Reopened
    • Major
    • Resolution: Unresolved
    • None
    • 1.14
    • None
    • None

    Description

      We can currently configure parsers by the following means:
      1) programmatically by direct calls to the parsers or their config objects
      2) sending in a config object through the ParseContext
      3) modifying .properties files for specific parsers (e.g. PDFParser)

      Rather than scattering the landscape with .properties files for each parser, it would be great if we could specify parser parameters in the main config file, something along the lines of this:

          <parser class="org.apache.tika.parser.audio.AudioParser">
            <params>
              <int name="someparam1">2</int>
              <str name="someOtherParam2">something or other</str>
            </params>
            <mime>audio/basic</mime>
            <mime>audio/x-aiff</mime>
            <mime>audio/x-wav</mime>
          </parser>
      

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            chrismattmann Chris A. Mattmann
            tallison Tim Allison

            Dates

              Created:
              Updated:

              Slack

                Issue deployment