Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-23347

MSCK REPAIR cannot discover partitions with upper case directory names.

    XMLWordPrintableJSON

Details

    Description

      For the following scenario, we expect MSCK REPAIR to discover partitions but it couldn't.
      1. Have partitioned data path as follows.
      hdfs://mycluster/datapath/t1/Year=2020/Month=03/Day=10
      hdfs://mycluster/datapath/t1/Year=2020/Month=03/Day=11
      2. create external table t1 (key int, value string) partitioned by (Year int, Month int, Day int) stored as orc location hdfs://mycluster/datapath/t1'';
      3. msck repair table t1;
      4. show partitions t1; --> Returns zero partitions
      5. select * from t1; --> Returns empty data.

      When the partition directory names are changed to lower case, this works fine.
      hdfs://mycluster/datapath/t1/year=2020/month=03/day=10
      hdfs://mycluster/datapath/t1/year=2020/month=03/day=11

      Attachments

        1. HIVE-23347.01.patch
          6 kB
          Syed Shameerur Rahman
        2. HIVE-23347.10.patch
          23 kB
          Adesh Kumar Rao
        3. HIVE-23347.2.patch
          8 kB
          Adesh Kumar Rao
        4. HIVE-23347.3.patch
          8 kB
          Adesh Kumar Rao
        5. HIVE-23347.4.patch
          13 kB
          Adesh Kumar Rao
        6. HIVE-23347.5.patch
          21 kB
          Adesh Kumar Rao
        7. HIVE-23347.6.patch
          21 kB
          Adesh Kumar Rao
        8. HIVE-23347.7.patch
          21 kB
          Adesh Kumar Rao
        9. HIVE-23347.8.patch
          22 kB
          Adesh Kumar Rao
        10. HIVE-23347.9.patch
          23 kB
          Adesh Kumar Rao

        Issue Links

          Activity

            People

              adeshrao Adesh Kumar Rao
              sankarh Sankar Hariappan
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h