Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-10937

How to run CsvBulkLoadTool of Phoenix 4.0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • None
    • None
    • documentation
    • None

    Description

      There is known issue to run MR job for hbase0.96+ version. Details at section “Notice to Mapreduce users of HBase 0.96.1 and above” https://hbase.apache.org/book.html

      Basically we need to put hbase-protocol*.jar before hadoop loads protobuf-java jar. I updated our documentation on http://phoenix.incubator.apache.org/bulk_dataload.html on how to use CsvBulkLoadTool for Phoenix 4.0 as following:

      HADOOP_CLASSPATH=$(hbase mapredcp)::/path/to/hbase/conf hadoop jar phoenix-4.0.0-incubating-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool --table EXAMPLE --input /data/example.csv
      

      OR

      HADOOP_CLASSPATH=/path/to/hbase-protocol.jar:/path/to/hbase/conf hadoop jar phoenix-4.0.0-incubating-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool --table EXAMPLE --input /data/example.csv
      

      Thanks.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jeffreyz Jeffrey Zhong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: