Uploaded image for project: 'Tajo'
  1. Tajo
  2. TAJO-1975

Gathering fine-grained column statistics for range shuffle

    Details

    • Type: Task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.12.0, 0.11.1
    • Component/s: None
    • Labels:
      None

      Description

      One of the stages where statistics is very useful is the shuffle stage during query execution.Tajo also utilizes statistics for range shuffle.

      Currently, once gathering statistics is enabled, it is collected on every column of the input schema rather than the shuffle key columns. This may cause unnecessary overhead, so we need to collect statistics on only the shuffle keys.

        Attachments

          Activity

            People

            • Assignee:
              jihoonson Jihoon Son
              Reporter:
              jihoonson Jihoon Son
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: