Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2172

index-more: document format of contenttype-mapping.txt

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.10
    • 1.12
    • indexer, metadata
    • Macosx, Java 8

    Description

      The index-more plugin uses the conf/contenttype-mapping.txt file to build up the mimeMap hash table (in the readConfiguration() method).
      The line splitting is performed around "\t", so it silently skip lines separated by simple spaces or more than one tab (see line 325).
      Changing the single-char string "\t" with the regex "
      s+" should do the magic.

      Attachments

        1. NUTCH-2172-1.patch
          1.0 kB
          Sebastian Nagel
        2. NUTCH-2172-2.patch
          2 kB
          Sebastian Nagel

        Activity

          People

            snagel Sebastian Nagel
            nicola.tonellotto Nicola Tonellotto
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 0.5h
                0.5h
                Remaining:
                Remaining Estimate - 0.5h
                0.5h
                Logged:
                Time Spent - Not Specified
                Not Specified