Dataload has several locations where it does a long string of HDFS commands similar to this:
Most hdfs shell commands can take multiple arguments. In particular, "mkdir" can make multiple directories in one command. "put" can copy multiple files into a single destination. This can save on hdfs commandline invocations, which are often expensive due to JVM startup and other costs. For example, the above is equivalent to:
Dataload should make these types of optimizations wherever possible.