Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-376

AWS Glue dependency issue for EMR 5.28.0

    XMLWordPrintableJSON

Details

    Description

      Hi hudi team, it's really encouraging that Hudi is finally officially supported application on AWS EMR. Great job!

      I found a ClassNotFound exception when using:

      /usr/lib/hudi/bin/run_sync_tool.sh
      

      in emr master.

      And I think is due to demand of aws glue data sdk dependency. (I used aws glue as hive meta data)

      So I added a line to run_sync_tool.sh to get a quick fix for this:

      HIVE_JARS=$HIVE_JARS:/usr/lib/hive/auxlib/aws-glue-datacatalog-hive2-client.jar:/usr/share/aws/emr/emr-metrics-collector/lib/aws-java-sdk-glue-1.11.475.jar

      not sure if any more jars needed, but these two jar fixed my problem.

       

      I think it would be great if take glue in consideration for emr scripts.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              XingXPan Xing Pan
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m