Details
Description
Hadoop/Hbase/Zookeeper/pig node distribution:
hadoop nodes:
hbase nodes:
{node1=[master, regionserver]}pig nodes:
{node1, node2}zookeeper nodes:
{node1}Operate hbase table in node1 pig shell like:
test = LOAD 'hbase://table' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage( 'd:sWords','-loadKey true') AS (ID: bytearray , Words:chararray );
result = FOREACH test GENERATE ID, com.pig.test(Words);
--result = FOREACH AA GENERATE com.pig.test(Words), ID;
--dump result;
store result into 'table' using org.apache.pig.backend.hadoop.hbase.HBaseStorage('d:drools_cat');
--store result into 'AA_10_categs' using org.apache.pig.backend.hadoop.hbase.HBaseStorage('d:cat');
In tasktracker node, pig can not read hbase configuration in job.xml.