Nutch
  1. Nutch
  2. NUTCH-760

Allow field mapping from nutch to solr index

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.1
    • Component/s: indexer
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      I am using nutch to crawl sites and have combined it
      with solr pushing the nutch index using the solrindex command. I have
      set it up as specified on the wiki using the copyField url to id in the
      schema. Whilst this works fine it is stuff's up my inputs from other
      sources in solr (e.g. using the solr data import handler) as they have
      both id's and url's. I have patch that implements a nutch xml schema
      defining what basic nutch fields map to in your solr push.

      1. solrindex_schema.patch
        12 kB
        David Stuart
      2. solrindex_schema.patch
        12 kB
        David Stuart
      3. solrindex_schema.patch
        5 kB
        David Stuart
      4. solrindex_schema.patch
        4 kB
        David Stuart

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Andrzej Bialecki
            Reporter:
            David Stuart
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development