I have a Pig script that I am using as the workflow for my Falcon process. The pig script uses HCatalogStorer to write to a HCatalog URI that is the output feed defined in my Falcon Process Entity. The Pig action in the resulting Ooozie Workflow generated by Falcon fails with the attached stack trace. The root is that it is missing a class definitions of org/apache/hadoop/hive/shims/ShimLoader.
Running the script manually using pig -x tex -useHCatalog <all the -params passed by Oozie> <path to pig script> results in a successful execution. It's only once this is called as a Pig activity in the Falcon-generated Oozie workflow that the missing class definitions manifests.
After some investigation I found that the Oozie workflow.xml is missing a required sharelib decleration.
From the workflow.xml generated by Falcon:
If I modify the value to include hive sharelib then the Pig action succeeds and does not throw a missing class definition error.
Modified workflow.xml property (works):