Currently, if a user wants to deploy the hello-samza project to the YARN, he has to change the pom file in samza-job-package by adding the HDFS dependency. see tutorial
I think we should provide the hdfs jar by default. Because
1. the user can use the same package compiled in Hello Samza page when he first time uses the Samza for the YARN deploy
2. the user only needs to upload the package and change the factory and file path in the job properties file, which makes the whole process smoother.