Details
Description
set hive.exec.parallel=true; will cause the Yarn application instance to linger
forever. set hive.exec.parallel=false, the application goes away as soon as hive query is complete. The underlying table is an ORC store_sales table compressed with SNAPPY.
hive.exec.parallel=true;
select * from store_sales where ss_ticket_number=5741230 and ss_item_sk=4825
The query will run under Tez and finish << 30 seconds.
After 30-40 of these jobs the cluster gets to a point where no jobs will finish.