Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-769

COUNT fails on local mode but executes correctly on grid mode for the same data

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.2.0
    • 0.7.0
    • None
    • None

    Description

      The following script run on the grid executes correctly. It prints (4L) for '/user/gmavr/k_sample_preprocessed_withj_sample'

      On local mode (invoked with -x local) and the same data in the local filesystem, it failes with:
      -2009-04-11 03:23:15,155 [main] ERROR org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POStore - Received error from storer function: org.apache.pig.backend.executionengine.ExecException: ERROR 2106: Error while computing count in COUNT

      %declare k_sample_preprocessed_withj '/user/gmavr/k_sample_preprocessed_withj_sample';
      – %declare k_sample_preprocessed_withj '/homes/gmavr/mlrSite/k_sample_preprocessed_withj_sample';

      webdataFiltered = LOAD '$k_sample_preprocessed_withj' USING BinStorage() AS (url:chararray, pg:bytearray);

      X1 = GROUP webdataFiltered ALL;
      Y1 = FOREACH X1 GENERATE COUNT;
      DUMP Y1;

      Attachments

        Activity

          People

            Unassigned Unassigned
            gmavromatis George Mavromatis
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: