Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2130

copyField rawcontent creates error within schema.xml

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.3.1
    • 2.3.1
    • None
    • None

    Description

      The presence of the rawcontent copyField within the Nutch Solr schema.xml is creating problems for users when attempting to index NutchDocuments into Solr.
      The rawcontent field is produced by the index-html plugin however in committing this feature we have forgotten to add the field definition to schema.xml before applying the copyField instruction.

      There are two ways to resolve this

      • remove rawcontent from copyField, or
      • add rawcontent as a field prior to it's copyFields defintiion.

      I propose to do the latter and will submit a patch ASAP unless someone else is able to do so.

      This was explained on this thread

      Attachments

        1. NUTCH-2130.patch
          1 kB
          Sebastian Nagel

        Activity

          People

            lewismc Lewis John McGibbney
            lewismc Lewis John McGibbney
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: