Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-4331

Integrated StorageHandler for Hive and HCat using the HiveStorageHandler

    XMLWordPrintableJSON

Details

    Description

      1) Deprecate the HCatHBaseStorageHandler and "RevisionManager" from HCatalog. These will now continue to function but internally they will use the "DefaultStorageHandler" from Hive. They will be removed in future release of Hive.

      2) Design a HivePassThroughFormat so that any new StorageHandler in Hive will bypass the HiveOutputFormat. We will use this class in Hive's "HBaseStorageHandler" instead of the "HiveHBaseTableOutputFormat".

      3) Write new unit tests in the HCat's "storagehandler" so that systems such as Pig and Map Reduce can use the Hive's "HBaseStorageHandler" instead of the "HCatHBaseStorageHandler".

      4) Make sure all the old and new unit tests pass without backward compatibility (except known issues as described in the Design Document).

      5) Replace all instances of the HCat source code, which point to "HCatStorageHandler" to use the"HiveStorageHandler" including the "FosterStorageHandler".

      I have attached the design document for the same and will attach a patch to this Jira.

      Attachments

        1. HIVE-4331.2.patch
          172 kB
          Viraj Bhat
        2. HIVE-4331.1.patch
          173 kB
          Viraj Bhat
        3. HIVE-4331.patch
          172 kB
          Viraj Bhat
        4. hive4331hcatrebase.patch
          184 kB
          Viraj Bhat
        5. HIVE4331_07-17.patch
          181 kB
          Viraj Bhat
        6. StorageHandlerDesign_HIVE4331.pdf
          135 kB
          Viraj Bhat

        Issue Links

          Activity

            People

              viraj Viraj Bhat
              ashutoshc Ashutosh Chauhan
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m