Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-17849

PE tool random read is not totally random

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.0
    • 2.0.0
    • Performance, test
    • None
    • Reviewed
    • Hide
      When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.
      Show
      When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.

    Description

      Recently we were using the PE tool for doing some bucket cache related performance tests. One thing that we noted was that the way the random read works is not totally random.

      Suppose we load 200G of data using --size param and then we use --rows=500000 to do the randomRead. The assumption was among the 200G of data it could generate randomly 500000 row keys to do the reads.
      But it so happens that the PE tool generates random rows only on those set of row keys which falls under the first 500000 rows.
      This was quite evident when we tried to use HBASE-15314 in our testing. Suppose we split the bucket cache of size 200G into 2 files each 100G the randomReads with --rows=500000 always lands in the first file and not in the 2nd file. Better to make PE purely random.

      Attachments

        1. HBASE-17849_2.patch
          7 kB
          ramkrishna.s.vasudevan
        2. HBASE-17849_1.patch
          6 kB
          ramkrishna.s.vasudevan
        3. HBASE-17849.patch
          6 kB
          ramkrishna.s.vasudevan
        4. HBASE-17849.patch
          6 kB
          ramkrishna.s.vasudevan

        Issue Links

          Activity

            People

              ram_krish ramkrishna.s.vasudevan
              ram_krish ramkrishna.s.vasudevan
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: