Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-6694

Avoid unnecessary calls of fetching table meta data to region servers holding the system tables in batch oriented jobs in spark or hive otherwise those RS become hotspot

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None

    Description

      Currently we are preparing the query plan in both data source and partition readers which is creating new connection in each worker and job initialisation which unnecessarily touch basing all both system catalog table, system stats table as well as meta. When there are jobs with millions of parallel workers hotspot the region servers holding the meta and system catalog as well system stats table. So if we share the same query plan between the workers which can avoid the hotspot.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rajeshbabu Rajeshbabu Chintaguntla
            rajeshbabu Rajeshbabu Chintaguntla
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment