Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17204 support un-bucketed tables in acid
  3. HIVE-17214

check/fix conversion of unbucketed non-acid to acid

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.0.0
    • Component/s: Transactions
    • Labels:
      None
    • Target Version/s:

      Description

      bucketed tables have stricter rules for file layout on disk - bucket files are direct children of a partition directory.

      for un-bucketed tables I'm not sure there are any rules
      for example, CTAS with Tez + Union operator creates 1 directory for each leg of the union

      Supposedly Hive can read table by picking all files recursively.
      Can it also write (other than CTAS example above) arbitrarily?
      Does it mean Acid write can also write anywhere?

      Figure out what can be supported and how can existing layout can be checked? Examining a full "ls -l -R" for a large table could be expensive.

        Attachments

        1. HIVE-17214.01.patch
          11 kB
          Eugene Koifman

          Issue Links

            Activity

              People

              • Assignee:
                ekoifman Eugene Koifman
                Reporter:
                ekoifman Eugene Koifman
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: