Uploaded image for project: 'Falcon'
  1. Falcon
  2. FALCON-85 Hive (HCatalog) integration
  3. FALCON-143

Enable Late data handling for hive tables

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.3
    • 0.4
    • None
    • None

    Description

      HCat nor Hive APIs expose internal stats about a given partition. The only way to get the partition size is to get the location of the partition on HDFS and then use globStatus and contentSummary APIs.

      With the addition of HIVE-5317, this is going to get more complicated with deltas and minor and major compactions with no locking.

      Need to work with hive to see if there will be an API or Falcon needs to understand the structure of the layout of the data on the file system.

      Attachments

        1. FALCON-143.patch
          119 kB
          Venkatesh Seetharam
        2. FALCON-143-r0.patch
          120 kB
          Venkatesh Seetharam

        Issue Links

          Activity

            People

              svenkat Venkatesh Seetharam
              svenkat Venkatesh Seetharam
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: