Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-11124

testdata loading should reuse TPCH/TPCDS local data if they exist

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.1.0
    • Infrastructure
    • None
    • ghx-label-11

    Description

      When loading testdata for TPC-H/TPC-DS, we first run a preload script to generate local data, and then upload them to HDFS to be used by Hive. It's time-consuming to run the preload script in large scale factors (e.g. 30). We should reuse them if they exist.

      Attachments

        Activity

          People

            stigahuang Quanlong Huang
            stigahuang Quanlong Huang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: