Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4207

Queries appear to hang in scenarios when hive plugin is enabled

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      On a on-demand test environment, which brings up a Hadoop cluster with Drill & Hive, I observed that queries submitted to Drill appear to hang (but eventually do complete - after several minutes). This occurs frequently and can be reproduced.

      The web UI shows them to be in a PENDING state and additional queries cannot be submitted at that time (they too go into PENDING state). These queries also take a very long time to cancel. Basically the whole Drill setup is in a unusable state.

      Logs indicate that the query takes prolonged time in registering with the Hive metastore. It could be either the hive metastore hasn't come up completely or because Drill is having trouble communicating with Hive server. Log snippet attached.

      Disabling Hive helps avoid this issue.

      Since the queries being run are not related to Hive and do not need to use the Hive plugin, I'm not quite sure why Drill would need to connect to Hive in the first place. And if Drill does requires every query to register with storage plugins such as Hive, then there should possibly be a timeout if the query is unrelated to the plugin taking time to load/register.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            agirish Abhishek Girish

            Dates

              Created:
              Updated:

              Slack

                Issue deployment