Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2125

Improve perf when reading timestamps from parquet files written by hive

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Duplicate
    • Affects Version/s: Impala 2.2
    • Fix Version/s: None
    • Component/s: Backend
    • Labels:
      None

      Description

      This is for tracking purposes. The improvement is already committed – 29de99c9d25c49b73488d2f75bc3644ae9ff9325.

      When using the flag -convert_legacy_hive_parquet_utc_timestamps=true, depending on the query the runtime may be 10x longer (possibly more). The commit above inlines some function calls which improves the 10x case to 5x.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                caseyc casey
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: