Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-789

Improvements to Tika parser

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Won't Fix
    • None
    • 1.7, 2.2
    • parser
    • None
    • reported by Sami, in NUTCH-766

    Description

      As reported by Sami in NUTCH-766, Sami has a few improvements he made to the Tika parser. We'll track that progress here.

      Attachments

        1. NutchTikaConfig.java
          4 kB
          Chris A. Mattmann
        2. TikaParser.java
          8 kB
          Chris A. Mattmann

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            chrismattmann Chris A. Mattmann
            chrismattmann Chris A. Mattmann
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment