Solr
  1. Solr
  2. SOLR-3473

Distributed deduplication broken

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 4.0-ALPHA
    • Fix Version/s: 4.9, Trunk
    • Component/s: SolrCloud, update
    • Labels:
      None

      Description

      Solr's deduplication via the SignatureUpdateProcessor is broken for distributed updates on SolrCloud.

      Mark Miller:

      Looking again at the SignatureUpdateProcessor code, I think that indeed this won't currently work with distrib updates. Could you file a JIRA issue for that? The problem is that we convert update commands into solr documents - and that can cause a loss of info if an update proc modifies the update command.

      I think the reason that you see a multiple values error when you try the other order is because of the lack of a document clone (the other issue I mentioned a few emails back). Addressing that won't solve your issue though - we have to come up with a way to propagate the currently lost info on the update command.

      Please see the ML thread for the full discussion: http://lucene.472066.n3.nabble.com/SolrCloud-deduplication-td3984657.html

      1. SOLR-3473.patch
        3 kB
        Hoss Man
      2. SOLR-3473.patch
        9 kB
        Hoss Man
      3. SOLR-3473-trunk-2.patch
        9 kB
        Markus Jelsma

        Issue Links

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              Markus Jelsma
            • Votes:
              2 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:

                Development