Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-2302

Optimize M-R by bulk excluding regions - less InputSplit-s to avoid traffic on region servers when performing M-R on a subset of the table

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.20.3
    • 0.90.0
    • None
    • None
    • Reviewed

    Description

      TableInputFormatBase , creates a InputSplit per region. Given that the keys are sorted - sometimes - it might be needed to perform M-R on a subset of the keyset ( regions ) . Adding a provision to filter the regions when generating InputSplits might be useful .

      The granularity of exclusion is per-region-wise. A RowFilter might still be needed during a Scan on a separate region, but that is a separate issue altogether.

      Methodology: Add a way to prune the keyset before generating inputsplits , by default - all sets are returned indicating all regions are included that could be overridden as necessary, depending on the higher-level logic.

      Attachments

        1. HBASE-2302.patch
          3 kB
          Karthik K
        2. HBASE-2302.patch
          2 kB
          Karthik K

        Activity

          People

            Unassigned Unassigned
            kaykay.unique Karthik K
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: