Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2125

Improve perf when reading timestamps from parquet files written by hive

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Duplicate
    • Affects Version/s: Impala 2.2
    • Fix Version/s: None
    • Component/s: Backend
    • Labels:
      None

      Description

      This is for tracking purposes. The improvement is already committed – 29de99c9d25c49b73488d2f75bc3644ae9ff9325.

      When using the flag -convert_legacy_hive_parquet_utc_timestamps=true, depending on the query the runtime may be 10x longer (possibly more). The commit above inlines some function calls which improves the 10x case to 5x.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              caseyc casey

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment