Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2172

index-more: document format of contenttype-mapping.txt

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.10
    • Fix Version/s: 1.12
    • Component/s: indexer, metadata
    • Labels:
    • Environment:

      Macosx, Java 8

      Description

      The index-more plugin uses the conf/contenttype-mapping.txt file to build up the mimeMap hash table (in the readConfiguration() method).
      The line splitting is performed around "\t", so it silently skip lines separated by simple spaces or more than one tab (see line 325).
      Changing the single-char string "\t" with the regex "
      s+" should do the magic.

        Attachments

          Activity

            People

            • Assignee:
              snagel Sebastian Nagel
              Reporter:
              nicola.tonellotto Nicola Tonellotto

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 0.5h
                0.5h
                Remaining:
                Remaining Estimate - 0.5h
                0.5h
                Logged:
                Time Spent - Not Specified
                Not Specified

                  Issue deployment