Currently, external tables replication, copies the data in directory level. So, if target directory exist, then DistCp should compare and update or skip data files in the directory instead of creating new directory inside pre-existing target directory.
This can be achieved using -update.
Also, -delete option is needed to delete the files missing in source directory but present in target.
Hive should mandate these DistCp options even if user passes other options.