Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-1975

Gathering fine-grained column statistics for range shuffle

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.12.0, 0.11.1
    • None
    • None

    Description

      One of the stages where statistics is very useful is the shuffle stage during query execution.Tajo also utilizes statistics for range shuffle.

      Currently, once gathering statistics is enabled, it is collected on every column of the input schema rather than the shuffle key columns. This may cause unnecessary overhead, so we need to collect statistics on only the shuffle keys.

      Attachments

        Activity

          People

            jihoonson Jihoon Son
            jihoonson Jihoon Son
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: