Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10743

MiniDFSCluster test runtimes can be drastically reduce

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 2.0.0-alpha
    • None
    • hdfs
    • None

    Description

      MiniDFSCluster tests have excessive runtimes. The main problem appears to be the heartbeat interval. The NN may have to wait up to 3s (default value) for all DNs to heartbeat, triggering registration, so NN can go active. Tests that repeatedly restart the NN are severely affected.

      Example for varying heartbeat intervals for TestFSImageWithAcl:

      • 3s = ~70s – (disgusting, why I investigated)
      • 1s = ~27s
      • 500ms = ~17s – (had to hack DNConf for millisecond precision)

      That a 4x improvement in runtime.

      17s is still excessively long for what the test does. Further areas to explore when running tests:

      • Reduce numerous sleeps intervals in DN's BPServiceActor.
      • Ensure heartbeats and initial BR are sent immediately upon (re)registration.

      Attachments

        1. HDFS-10743.001.patch
          3 kB
          Yiqun Lin
        2. HDFS-10743.002.patch
          3 kB
          Kuhu Shukla
        3. HDFS-10743.003.patch
          18 kB
          Yiqun Lin

        Activity

          People

            kshukla Kuhu Shukla
            daryn Daryn Sharp
            Votes:
            1 Vote for this issue
            Watchers:
            17 Start watching this issue

            Dates

              Created:
              Updated: