Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-2266

bug with input file joining optimization in Pig

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.9.0, 0.10.0
    • Fix Version/s: 0.12.0
    • Component/s: impl
    • Labels:
      None

      Description

      In src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MRCompiler.java, the function hasTooManyInputFiles instantiated a LoadFunc instance, then calls setLocation before calling setUDFContextSignature. This is inconsistent with the documentation for the LoadFunc interface (see http://pig.apache.org/docs/r0.9.0/api/org/apache/pig/LoadFunc.html#setUDFContextSignature(java.lang.String)). (We've written UDFs that assume that setUDFContextSignature is called first.)

      I think you can fix this by adding

      loader.setUDFContextSignature(ld.getSignature());

      Before

      loader.setLocation(location, job);

        Attachments

        1. PIG-2266.patch
          0.9 kB
          Cheolsoo Park

          Issue Links

            Activity

              People

              • Assignee:
                jadler Joseph Adler
                Reporter:
                jadler Joseph Adler
              • Votes:
                1 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: