Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-551

Experiment with multi-node batch writer

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Abandoned
    • None
    • None
    • None
    • None

    Description

      Accumulo has a batch writer that batches mutations by tablet server for writes. This works well until there are alot of tablet servers being written to at which point only a small amount of data is being sent to each tablet server. Would it be better for the client to batch writes for multiple tablet servers and send them to one server which writes directly to the tablet servers?

      One possible way to do this is to :

      • batch mutations by rack on the client
      • send all of those mutations to one random tablet server on the rack
      • have the random tablet server write to the other servers on the rack

      This cuts down on the number of direct connections the client has to make. Could have the following benefits.

      • Tablet servers can keep connections open to other tablet servers.
      • A write pipeline

      Would be interesting to run some test and see how well this works.

      Attachments

        Issue Links

          Activity

            People

              kturner Keith Turner
              kturner Keith Turner
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m