Uploaded image for project: 'Ignite'
  1. Ignite
  2. IGNITE-10771

Print troubleshooting hint when exchange latch got stucked

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.5
    • 2.8
    • cache

    Description

      Sometimes users face with a problem when exchange latch can't be completed:

      2018-12-12 07:07:57:563 [exchange-worker-#42] WARN 
      o.a.i.i.p.c.d.d.p.GridDhtPartitionsExchangeFuture:488 - Unable to await 
      partitions release latch within timeout: ClientLatch 
      [coordinator=ZookeeperClusterNode [id=6b9fc6e4-5b6a-4a98-be4d-6bc1aa5c014c, 
      addrs=[172.17.0.1, 10.0.230.117, 0:0:0:0:0:0:0:1%lo, 127.0.0.1], order=3, 
      loc=false, client=false], ackSent=true, super=CompletableLatch [id=exchange, 
      topVer=AffinityTopologyVersion [topVer=45, minorTopVer=1]]] 
      

      It may indicate that some node in a cluster can' t finish partitions release (finish all ongoing operations at the previous topology version) or it can be silent network problem.
      We should print to log a hint how to troubleshoot it to reduce the number of questions about such problem.

      Attachments

        Issue Links

          Activity

            People

              jokser Pavel Kovalenko
              jokser Pavel Kovalenko
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m