Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-760

Allow field mapping from nutch to solr index

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.1
    • indexer
    • None
    • Patch Available

    Description

      I am using nutch to crawl sites and have combined it
      with solr pushing the nutch index using the solrindex command. I have
      set it up as specified on the wiki using the copyField url to id in the
      schema. Whilst this works fine it is stuff's up my inputs from other
      sources in solr (e.g. using the solr data import handler) as they have
      both id's and url's. I have patch that implements a nutch xml schema
      defining what basic nutch fields map to in your solr push.

      Attachments

        1. solrindex_schema.patch
          4 kB
          David Stuart
        2. solrindex_schema.patch
          5 kB
          David Stuart
        3. solrindex_schema.patch
          12 kB
          David Stuart
        4. solrindex_schema.patch
          12 kB
          David Stuart

        Activity

          People

            ab Andrzej Bialecki
            dstuart David Stuart
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: