Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10685

Alter table concatenate oparetor will cause duplicate data

Log workAgile BoardRank to TopRank to BottomVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 0.14.0, 1.0.0, 1.1.0, 1.2.0, 1.2.1, 1.3.0
    • Fix Version/s: 1.2.1
    • Component/s: None
    • Labels:
      None

      Description

      "Orders" table has 1500000000 rows and stored as ORC.

      hive> select count(*) from orders;
      OK
      1500000000
      Time taken: 37.692 seconds, Fetched: 1 row(s)
      

      The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB.

      After executing command : ALTER TABLE orders CONCATENATE;
      The table is already 1530115000 rows.

      My hive version is 1.1.0.

        Attachments

        1. HIVE-10685.patch
          0.7 kB
          Prasanth Jayachandran
        2. HIVE-10685.patch
          0.4 kB
          guoliming

        Issue Links

          Activity

          $i18n.getText('security.level.explanation', $currentSelection) Viewable by All Users
          Cancel

            People

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment