Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-4615

ThreadPool timeout when checking tserver stats is confusing

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Not A Problem
    • Affects Version/s: 1.8.1
    • Fix Version/s: None
    • Component/s: master

      Description

      If it takes longer than the configured time to gather information from all the tablet servers, the thread pool stops and processing continues with whatever has been collected. Code is https://github.com/apache/accumulo/blob/1.8/server/master/src/main/java/org/apache/accumulo/master/Master.java#L1120, default timeout is 6s. Does not appear to be an issue prior to 1.8.

      Best case, this was really confusing. The monitor page would have 30 tservers, then 5 tservers. Didn't really see any other negative effects, no migrations and no balancing appeared to be affected. Worse case though, I missed something and the master is making decisions based on incomplete information.

      Dave Marion please add more info if needed.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                jschmidt10 Jeff Schmidt
                Reporter:
                mjwall Michael Wall
              • Votes:
                1 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 7h 20m
                  7h 20m