Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-2266

bug with input file joining optimization in Pig

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9.0, 0.10.0
    • 0.12.0
    • impl
    • None

    Description

      In src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MRCompiler.java, the function hasTooManyInputFiles instantiated a LoadFunc instance, then calls setLocation before calling setUDFContextSignature. This is inconsistent with the documentation for the LoadFunc interface (see http://pig.apache.org/docs/r0.9.0/api/org/apache/pig/LoadFunc.html#setUDFContextSignature(java.lang.String)). (We've written UDFs that assume that setUDFContextSignature is called first.)

      I think you can fix this by adding

      loader.setUDFContextSignature(ld.getSignature());

      Before

      loader.setLocation(location, job);

      Attachments

        1. PIG-2266.patch
          0.9 kB
          Cheolsoo Park

        Issue Links

          Activity

            People

              jadler Joseph Adler
              jadler Joseph Adler
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: