Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10685

Alter table concatenate oparetor will cause duplicate data

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 0.14.0, 1.0.0, 1.1.0, 1.2.0, 1.2.1, 1.3.0
    • 1.2.1
    • None
    • None

    Description

      "Orders" table has 1500000000 rows and stored as ORC.

      hive> select count(*) from orders;
      OK
      1500000000
      Time taken: 37.692 seconds, Fetched: 1 row(s)
      

      The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB.

      After executing command : ALTER TABLE orders CONCATENATE;
      The table is already 1530115000 rows.

      My hive version is 1.1.0.

      Attachments

        1. HIVE-10685.patch
          0.7 kB
          Prasanth Jayachandran
        2. HIVE-10685.patch
          0.4 kB
          guoliming

        Issue Links

          Activity

            People

              FanTn guoliming
              FanTn guoliming
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: