Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12052

automatically populate file metadata to HBase metastore based on config or table properties

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      As discussed in HIVE-11500
      Should use a table property similar to auto.purge.
      Then, when this setting is set, partitions are added (convertToMPart is a good source to find all the paths for that), after compactions, after load/non-ACID insert, and periodically (configurable), the storage locations should be scanned for new files and cache updated accordingly. All the updates should probably be in the background thread and taken from queue (high pri from most ops, low pri from enabling the property and from periodic updates) to avoid high load on HDFS from metastore.

      Attachments

        1. HIVE-12052.WIP.patch
          91 kB
          Sergey Shelukhin

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sershe Sergey Shelukhin Assign to me
            sershe Sergey Shelukhin

            Dates

              Created:
              Updated:

              Slack

                Issue deployment