Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-1963

Output to multiple tables from Hadoop MR without use of HTable

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.20.1
    • 0.90.0
    • mapreduce
    • None
    • Reviewed
    • MultiTableOutputFormat allows output from a map/reduce job to be written to multiple tables. An example illustrates use for creating secondary indexes from an existing table.

    Description

      o.a.h.h.mapreduce.TableOutputFormat allows writing to a single table as output from a map/reduce job in the natural way. It requires that the user specify the table name ahead of time and can only write to one table. I had a need to write to multiple tables from the same job (write my data to one table, and also write to index tables), and I wanted to have a consistent API whether writing to one or many tables.

      Attached MultiTableOutputFormat takes the table name as the key and the Put or Delete as the value. Also included is an example demonstrating the usage.

      Attachments

        1. HBASE-1963.patch
          16 kB
          Kevin Peterson
        2. HBASE-1963-revised.patch
          16 kB
          Kevin Peterson

        Activity

          People

            kevinpet Kevin Peterson
            kevinpet Kevin Peterson
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: