Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3817

HDFS globing feature is not working for HiveColumnarLoader

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.10.0
    • None
    • piggybank
    • None
    • HiveColumnarLoader

    Description

      HDFS contains the following RC files:
      /data/content_1.dat
      /data/content_2.dat
      /data/lala.txt

      a = load '/data/*.dat' USING org.apache.pig.piggybank.storage.HiveColumnarLoader('device_id string,oui string,product_class string,serial_number string,parameter_name string,parameter_value string,parameter_type string,collection_time bigint,custom_attr1 string,custom_attr2 string,custom_attr3 string,custom_attr4 string');

      Pig Stack Trace
      ---------------
      ERROR 1200: java.io.FileNotFoundException: File /data/*.dat does not exist.

      org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1000: Error during parsing. java.io.FileNotFoundException: File /data/*.dat does not exist.
      at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1608)
      at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1547)
      at org.apache.pig.PigServer.registerQuery(PigServer.java:518)
      at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:991)
      at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:412)
      at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:194)
      at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:170)
      at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:84)
      at org.apache.pig.Main.run(Main.java:475)
      at org.apache.pig.PigRunner.run(PigRunner.java:49)
      at org.apache.oozie.action.hadoop.PigMain.runPigJob(PigMain.java:283)
      at org.apache.oozie.action.hadoop.PigMain.run(PigMain.java:223)
      at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:37)
      at org.apache.oozie.action.hadoop.PigMain.main(PigMain.java:76)
      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      at java.lang.reflect.Method.invoke(Method.java:616)
      at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:495)
      at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
      at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
      at org.apache.hadoop.mapred.MapTask.run(MapTask.java:332)
      at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
      at java.security.AccessController.doPrivileged(Native Method)
      at javax.security.auth.Subject.doAs(Subject.java:416)
      at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
      at org.apache.hadoop.mapred.Child.main(Child.java:262)
      Caused by: Failed to parse: java.io.FileNotFoundException: File /data/*.dat does not exist.
      at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:193)
      at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1600)
      ... 26 more
      Caused by: java.lang.RuntimeException: java.io.FileNotFoundException: File /data/*.dat does not exist.
      at org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:410)
      at org.apache.pig.piggybank.storage.partition.PathPartitioner.getPartitionKeys(PathPartitioner.java:105)
      at org.apache.pig.piggybank.storage.partition.PathPartitionHelper.getPartitionKeys(PathPartitionHelper.java:101)
      at org.apache.pig.piggybank.storage.HiveColumnarLoader.getPartitionColumns(HiveColumnarLoader.java:577)
      at org.apache.pig.piggybank.storage.HiveColumnarLoader.getSchema(HiveColumnarLoader.java:647)
      at org.apache.pig.newplan.logical.relational.LOLoad.getSchemaFromMetaData(LOLoad.java:174)
      at org.apache.pig.newplan.logical.relational.LOLoad.<init>(LOLoad.java:88)
      at org.apache.pig.parser.LogicalPlanBuilder.buildLoadOp(LogicalPlanBuilder.java:839)
      at org.apache.pig.parser.LogicalPlanGenerator.load_clause(LogicalPlanGenerator.java:3236)
      at org.apache.pig.parser.LogicalPlanGenerator.op_clause(LogicalPlanGenerator.java:1315)
      at org.apache.pig.parser.LogicalPlanGenerator.general_statement(LogicalPlanGenerator.java:799)
      at org.apache.pig.parser.LogicalPlanGenerator.statement(LogicalPlanGenerator.java:517)
      at org.apache.pig.parser.LogicalPlanGenerator.query(LogicalPlanGenerator.java:392)
      at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:184)
      ... 27 more
      Caused by: java.io.FileNotFoundException: File /data/*.dat does not exist.
      ... 41 more

      Attachments

        Activity

          People

            Unassigned Unassigned
            joaosal Joao Salcedo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: