Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-4243

tika configuration overhaul

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • config
    • None

    Description

      In 3.0.0 when dealing with Tika, it would greatly help to have a Typed Configuration schema. 

      In 3.x can we remove the old way of doing configs and replace with Json Schema?

      Json Schema can be converted to Pojos using a maven plugin https://github.com/joelittlejohn/jsonschema2pojo

      This automatically creates a Java Pojo model we can use for the configs. 

      This can allow for the legacy tika-config XML to be read and converted to the new pojos easily using an XML mapper so that users don't have to use JSON configurations yet if they do not want.

      When complete, configurations can be set as XML, JSON or YAML

      tika-config.xml

      tika-config.json

      tika-config.yaml

      Replace all instances of tika config annotations that used the old syntax, and replace with the Pojo model serialized from the xml/json/yaml.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ndipiazza Nicholas DiPiazza
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: