Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-2753

In distributed mapreduce mode, pig can not get correct hbase configuration

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.9.1
    • 0.10.0
    • piggybank, site
    • None
    • OS:Red Hat Enterprise Linux Server release 5.5 (Tikanga)

    Description

      Hadoop/Hbase/Zookeeper/pig node distribution:
      hadoop nodes:

      {node1=[namenode, secondarynamenode, jobtracker], node2=[datanode, tasktracker]}

      hbase nodes:

      {node1=[master, regionserver]}

      pig nodes:

      {node1, node2}

      zookeeper nodes:

      {node1}

      Operate hbase table in node1 pig shell like:

      test = LOAD 'hbase://table' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage( 'd:sWords','-loadKey true') AS (ID: bytearray , Words:chararray );
      result = FOREACH test GENERATE ID, com.pig.test(Words);
      --result = FOREACH AA GENERATE com.pig.test(Words), ID;
      --dump result;

      store result into 'table' using org.apache.pig.backend.hadoop.hbase.HBaseStorage('d:drools_cat');
      --store result into 'AA_10_categs' using org.apache.pig.backend.hadoop.hbase.HBaseStorage('d:cat');

      In tasktracker node, pig can not read hbase configuration in job.xml.

      Attachments

        1. 2753.patch
          3 kB
          fang fang chen

        Activity

          People

            fang fang chen fang fang chen
            fang fang chen fang fang chen
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: