Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-1071

Support comma separated file/directory names in load statements

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.6.0
    • None
    • None
    • Reviewed

    Description

      Currently Pig Latin support following LOAD syntax:

      LOAD 'data' [USING loader function] [AS schema];      
      

      where data is the name of the file or directory, including files specified with Hadoop-supported globing syntax. This name is passed to the loader function.

      This feature is to support loaders that can load multiple files from different directories and allows users to pass in the file names in a comma separated string.

      For example, these will be valid load statements:

      LOAD '/usr/pig/test1/a,/usr/pig/test2/b' USING someloader()';
      

      and

      LOAD '/usr/pig/test1/{a,c},/usr/pig/test2/b' USING someloader();
      

      This comma separated string is passed to the loader.

      Attachments

        1. PIG-1071.patch
          10 kB
          Richard Ding
        2. PIG-1071.patch
          10 kB
          Richard Ding

        Activity

          People

            rding Richard Ding
            rding Richard Ding
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: