Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-9124

Transparently retry queries that fail due to cluster membership changes

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: In Progress
    • Critical
    • Resolution: Unresolved
    • None
    • None
    • Backend, Clients
    • None
    • ghx-label-5

    Description

      Currently, if the Impala Coordinator or any Executors run into errors during query execution, Impala will fail the entire query. It would improve user experience to transparently retry the query for some transient, recoverable errors.

      This JIRA focuses on retrying queries that would otherwise fail due to cluster membership changes. Specifically, node failures that cause changes in the cluster membership (currently the Coordinator cancels all queries running on a node if it detects that the node is no longer part of the cluster) and node blacklisting (the Coordinator blacklists a node because it detects a problem with that node - can’t execute RPCs against the node). It is not focused on retrying general errors (e.g. any frontend errors, MemLimitExceeded exceptions, etc.).

      Attachments

        Issue Links

          Activity

            People

              stakiar Sahil Takiar
              stakiar Sahil Takiar
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: