Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14586

Hive : with multi insert query: FAILED: SemanticException Should not happened

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      I am using multi insert query for optimization purpose, surely it helps me a lot but with each day run, I can find 3 to 4 id's (having count more than 10 Million) taking too much time at reducer. to fix this I have implemented skewjoin optimization properties but it's throwing

      "FAILED: SemanticException Should not happened"

      Properties which I am using

      set hive.exec.compress.output=true;
      set mapreduce.output.fileoutputformat.compress=true;
      set mapreduce.output.fileoutputformat.compress.codec=org.apache.hadoop.io.compress.SnappyCodec;
      set mapreduce.output.fileoutputformat.compress.type=BLOCK;
      SET hive.optimize.skewjoin=true;
      set hive.exec.compress.intermediate=true;
      set hive.intermediate.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
      set hive.intermediate.compression.type=BLOCK;
      set hive.optimize.skewjoin=true;
      set hive.skewjoin.key=100000;
      set hive.skewjoin.mapjoin.map.tasks=10000;
      set hive.skewjoin.mapjoin.min.split=33554432;

      Attachments

        Activity

          People

            Unassigned Unassigned
            amitVns Amit Pathak
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: