Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-625

Easier XML parser extensibility

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.10
    • parser
    • None

    Description

      The DcXMLParser class uses our streaming XPath mechanism to locate Dublin Core elements from a stream of SAX events. While powerful, that mechanism is a bit cumbersome to use for simple use cases where you'd just want to map the contents of a specific XML element or attribute into a metadata field. To make this simpler (and to remove the XPath processing overhead), I'd like to add new Attribute- and ElementMetadataHandler utility classes that focus on this specific use case.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jukkaz Jukka Zitting
            jukkaz Jukka Zitting
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment