Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-6259

Performance issue with large number of fields and values when using copyFields

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 4.8.1
    • 4.10, 6.0
    • None
    • None

    Description

      When you have schema with a large enough number of fields (in my case around 250 fields) and you use copyFields to populate a number of fields (very few in my case 3-4) you see a severe degradation in the performance of ingestion.

      Tracking this down using a profiler found that in the lucene Document.getField() was using 87% of all CPU time. As it turns out getField() does an iteration over the list of fields in the Document returning the field if the name matches.. Anyway in the case of copyFields with lots of values getField() gets called alot...

      Attachments

        1. SOLR-6259.patch
          5 kB
          Steven Bower

        Issue Links

          Activity

            People

              shalin Shalin Shekhar Mangar
              sbower Steven Bower
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: