Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-7128

Two phase distributed search is fetching extra fields in GET_TOP_IDS phase

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 4.10.2, 4.10.3
    • 4.10.4, 5.1, 6.0
    • search
    • None

    Description

      pqueixalos reported this to me privately so I am creating this issue on his behalf.

      We found an issue in versions 4.10.+ (4.10.2 and 4.10.3 for sure).

      When processing a two phase distributed query with an explicit fl parameter, the two phases are well processed, but the GET_TOP_IDS retrieves the matching documents fields, even if a GET_FIELDS shard request is getting executed just after.

      /solr/someCollectionCore?collection=someOtherCollection&q=:&debug=true&fl=id,title
      => id is retrieved during GET_TOP_IDS phase that's ok:: it's our uniqueKeyField
      => title is also retrieved during GET_TOP_IDS phase, that's not ok.

      I'm able to reproduce this. This is pretty bad performance bug that was introduced in SOLR-5768 or it's subsequent related issues. I plan to fix this bug and add substantial tests to assert such things.

      Attachments

        1. SOLR-7128.patch
          19 kB
          Shalin Shekhar Mangar
        2. SOLR-7128.patch
          19 kB
          Shalin Shekhar Mangar
        3. SOLR-7128.patch
          18 kB
          Shalin Shekhar Mangar
        4. SOLR-7128.patch
          17 kB
          Shalin Shekhar Mangar
        5. SOLR-7128-addendum.patch
          7 kB
          Shalin Shekhar Mangar

        Issue Links

          Activity

            People

              shalin Shalin Shekhar Mangar
              shalin Shalin Shekhar Mangar
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: