Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4932

Cache files not loaded when using 'limit' operator

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.14.0
    • 0.18.0
    • None
    • None

    Description

      UDF in pig throws error when input is fed to the UDF after applying LIMIT operator. The UDF is not able to find the cache file when using LIMIT

      org.apache.pig.backend.executionengine.ExecException: ERROR 2078: Caught error from UDF: org.test.hadoop.pig.BagProcess [Caught exception: File './names_cache' does not exist]

      By removing the LIMIT and directly feeding the input to UDF it runs fine. LIMIT operator seems to not load the cache files causing the issue. I was able to regenerate this on two different cluster running 0.14.0 version of hive

      Attachments

        Activity

          People

            daijy Daniel Dai
            hem1891 hemanth meka
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: