Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-732

Subcollection plugin not working on Nutch-1.0

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 1.0.0
    • Fix Version/s: 1.1
    • Component/s: indexer
    • Labels:
      None
    • Environment:

      Mac OS X 10.5 intel

      Description

      I am trying to get subcollections working, using Nutch-1.0 !
      I configured subcolections.xml then I added the plugin on nutch-site.xml.
      When the index finishes, I opened lucene luke to check if the database was working properly.
      The field subcollection is populated as it should, but searching for any subcollection, on the search tab of luke, returns no results.
      If I do a search on the url field, I can see that every record has a subcollection associated, yet i can't search for using the subcollection field.
      search examples on luke:
      subcollection:sub1 -> no results
      url:sub1 -> results with field subcollection populated -> sub1

      Same results using:
      ./bin/nutch org.apache.nutch.searcher.NutchBean "subcollection:sub1 sub"

      If i use the "explain", subcollection field is there with the correct word.

      It makes no sense so i beleive it's a bug.

        Attachments

        1. sub.patch
          2 kB
          Andrzej Bialecki

          Activity

            People

            • Assignee:
              ab Andrzej Bialecki
              Reporter:
              fantunes Filipe Antunes
            • Votes:
              1 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: