Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10685

Alter table concatenate oparetor will cause duplicate data

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.3.0, 1.2.1
    • Fix Version/s: 1.2.1
    • Component/s: None
    • Labels:
      None

      Description

      "Orders" table has 1500000000 rows and stored as ORC.

      hive> select count(*) from orders;
      OK
      1500000000
      Time taken: 37.692 seconds, Fetched: 1 row(s)
      

      The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB.

      After executing command : ALTER TABLE orders CONCATENATE;
      The table is already 1530115000 rows.

      My hive version is 1.1.0.

        Attachments

        1. HIVE-10685.patch
          0.4 kB
          guoliming
        2. HIVE-10685.patch
          0.7 kB
          Prasanth Jayachandran

          Issue Links

            Activity

              People

              • Assignee:
                FanTn guoliming
                Reporter:
                FanTn guoliming
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: