Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-1963

Output to multiple tables from Hadoop MR without use of HTable

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.20.1
    • 0.90.0
    • mapreduce
    • None
    • Reviewed
    • MultiTableOutputFormat allows output from a map/reduce job to be written to multiple tables. An example illustrates use for creating secondary indexes from an existing table.

    Description

      o.a.h.h.mapreduce.TableOutputFormat allows writing to a single table as output from a map/reduce job in the natural way. It requires that the user specify the table name ahead of time and can only write to one table. I had a need to write to multiple tables from the same job (write my data to one table, and also write to index tables), and I wanted to have a consistent API whether writing to one or many tables.

      Attached MultiTableOutputFormat takes the table name as the key and the Put or Delete as the value. Also included is an example demonstrating the usage.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            kevinpet Kevin Peterson Assign to me
            kevinpet Kevin Peterson
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment