Description
the same sql, running on spark and mr engine, will generate different size of shuffle data.
i think it is because of hive on mr just serialize part of HiveKey, but hive on spark which using kryo will serialize full of Hivekey object.
what is your opionion?
Attachments
Attachments
Issue Links
- links to