Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-6867

Bucketized Table feature fails in some cases

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 0.12.0
    • None
    • HiveServer2
    • None

    Description

      Bucketized Table feature fails in some cases. if src & destination is bucketed on same key, and if actual data in the src is not bucketed (because data got loaded using LOAD DATA LOCAL INPATH ) then the data won't be bucketed while writing to destination.
      Example
      ----------------------------------------------------------------------
      CREATE TABLE P1(key STRING, val STRING)
      CLUSTERED BY (key) SORTED BY (key) INTO 2 BUCKETS STORED AS TEXTFILE;
      LOAD DATA LOCAL INPATH '/Users/jp/apache-hive1/data/files/P1.txt' INTO TABLE P1;
      – perform an insert to make sure there are 2 files
      INSERT OVERWRITE TABLE P1 select key, val from P1;
      --------------------------------------------------
      This is not a regression. This has never worked.
      This got only discovered due to Hadoop2 changes.
      In Hadoop1, in local mode, number of reducers will always be 1, regardless of what is requested by app. Hadoop2 now honors the number of reducer setting in local mode (by spawning threads).
      Long term solution seems to be to prevent load data for bucketed table.

      Attachments

        1. HIVE-6867.01.patch
          8 kB
          Pengcheng Xiong
        2. HIVE-6867.02.patch
          503 kB
          Pengcheng Xiong
        3. HIVE-6867.03.patch
          503 kB
          Pengcheng Xiong
        4. HIVE-6867.04.patch
          428 kB
          Pengcheng Xiong
        5. HIVE-6867.05.patch
          428 kB
          Pengcheng Xiong

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jpullokkaran Laljo John Pullokkaran
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: