Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12971 Hive Support for Kudu
  3. HIVE-22362

Support key-range splitting by size the HiveKuduInputFormat

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      In order to allow for more parallelism and predictable task sizes we should support Kudu key range splitting to allow more parallel tasks per tablet. Without this the parallelism is limited by the number of tablets to scan.

      The implementation is like similar to the Spark implementation here:
      https://github.com/apache/kudu/commit/22a6faa44364dec3a171ec79c15b814ad9277d8f

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              granthenke Grant Henke
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: