Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-7551

Inaccurate timeline for "Rows Available"

    XMLWordPrintableJSON

    Details

      Description

      While debugging IMPALA-6932, it was noticed that the "Rows Available" metric in the query profile was a short duration (~ 1 second) for a long running limit 1 query (~ 1 hour).

      Currently, it tracks when Open() from the top-most node in the plan returns, not when the first row is actually produced. This can be misleading. A better timeline would be to return true when the first non-empty batch was added to the PlanRootSink. 

      We should consider changing the definition of the FINISHED state accordingly as well, so that we don't transition to FINISHED until a row is actually available to fetch immediately.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                stakiar Sahil Takiar
                Reporter:
                poojanilangekar Pooja Nilangekar
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated: