Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-732

Subcollection plugin not working on Nutch-1.0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 1.0.0
    • 1.1
    • indexer
    • None
    • Mac OS X 10.5 intel

    Description

      I am trying to get subcollections working, using Nutch-1.0 !
      I configured subcolections.xml then I added the plugin on nutch-site.xml.
      When the index finishes, I opened lucene luke to check if the database was working properly.
      The field subcollection is populated as it should, but searching for any subcollection, on the search tab of luke, returns no results.
      If I do a search on the url field, I can see that every record has a subcollection associated, yet i can't search for using the subcollection field.
      search examples on luke:
      subcollection:sub1 -> no results
      url:sub1 -> results with field subcollection populated -> sub1

      Same results using:
      ./bin/nutch org.apache.nutch.searcher.NutchBean "subcollection:sub1 sub"

      If i use the "explain", subcollection field is there with the correct word.

      It makes no sense so i beleive it's a bug.

      Attachments

        1. sub.patch
          2 kB
          Andrzej Bialecki

        Activity

          People

            ab Andrzej Bialecki
            fantunes Filipe Antunes
            Votes:
            1 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: