YARN applications like MapReduce and Tez rely on the cluster's Hadoop jars instead of distributing their own.
This has a couple advantages
- Avoids sending a bunch of bits to every node for each app
- Only a single version of Hadoop can be running on a cluster at one time, simplifying debugging
- Easier to upgrade and apply patched versions of Hadoop