Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-2645

tablet stuck unloading, and problem is hard to diagnose

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.4.4
    • Fix Version/s: 1.6.1, 1.7.0
    • Component/s: tserver
    • Labels:
    • Environment:

      very large production cluster, CDH3u5

      Description

      • master failed to balance
      • custom balancer refused to balance while migrations were in place
      • tablet server was not unloading the tablet
      • tablet server was otherwise serving tablets, providing status
      • memory dump determined that there were 21K UnloadTabletHandler objects
      • jstack showed UnloadTabletHandler in Tablet.completeClose, line 2674
      • the last print of the debug "completeClose(safeState=true, completeClose=true) occured 9 days ago
      • there was a query that had been running for 9 days

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ecn Eric Newton
                Reporter:
                ecn Eric Newton
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m