Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10817

Blacklist For Bad MetaStore

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.2.0
    • None
    • HiveServer2, Metastore
    • None

    Description

      During a reliability test ,when one of MetaStore 's machine power down ,HiveServer2 then never submit jobs to YARN.
      There are 100 JDBC clients (Beeline) running concurrently.And all the 100 JDBC clients hangs.
      After checking HiveServer2's thread stack,i find that most of the threads waiting to lock AbstractService while the one holding it is trying to connect to
      the bad MetaStore which has been power down.When the thread which hold this lock finally return SocketTimeoutException and release this lock,another thread will hold this lock and again stuck until socket time out.
      Adding a new blacklist mechanism finally solved this issue.

      Attachments

        1. HIVE-10817
          3 kB
          Nemon Lou

        Activity

          People

            nemon Nemon Lou
            nemon Nemon Lou
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: