Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-1083

add concurrency to HDFS write-ahead log

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • tserver
    • None

    Description

      When running tablet servers on beefy nodes (lots of disks), the write-ahead log can be a serious bottleneck. Today we ran a continuous ingest test of 1.5-SNAPSHOT on an 8-node (plus a master node) cluster in which the nodes had 32 cores and 15 drives each. Running with write-ahead log off resulted in a >4x performance improvement sustained over a long period.

      I believe the culprit is that the WAL is only using one file at a time per tablet server, which means HDFS is only appending to one drive (plus replicas). If we increase the number of concurrent WAL files supported on a tablet server we could probably drastically improve the performance on systems with many disks. As it stands, I believe Accumulo is significantly more optimized for a larger number of smaller nodes (3-4 drives).

      Attachments

        1. walog-replication-factor-performance.jpg
          213 kB
          Luke Brassard
        2. walog-performance.jpg
          199 kB
          Luke Brassard

        Issue Links

          Activity

            People

              Unassigned Unassigned
              afuchs Adam Fuchs
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: