Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17006

LLAP: Parquet caching v1

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.0.0
    • None
    • None

    Description

      There are multiple options to do Parquet caching in LLAP:
      1) Full elevator (too intrusive for now).
      2) Page based cache like ORC (requires some changes to Parquet or copy-pasted).
      3) Cache disk data on column chunk level as is.

      Given that Parquet reads at column chunk granularity, (2) is not as useful as for ORC, but still a good idea. I messaged the dev list about it but didn't get a response, we may follow up later.

      For now, do (3).

      Attachments

        1. HIVE-17006.WIP.patch
          95 kB
          Sergey Shelukhin
        2. HIVE-17006.patch
          115 kB
          Sergey Shelukhin
        3. HIVE-17006.01.patch
          118 kB
          Sergey Shelukhin
        4. HIVE-17006.02.patch
          118 kB
          Sergey Shelukhin
        5. HIVE-17006.03.patch
          130 kB
          Sergey Shelukhin

        Issue Links

          Activity

            People

              sershe Sergey Shelukhin
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: