Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-19848

Zookeeper thread leaks in hbase-spark bulkLoad method

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.2.0
    • Fix Version/s: None
    • Component/s: spark, Zookeeper
    • Labels:
    • Environment:

      hbase-spark-1.2.0-cdh5.12.1 version

      spark 1.6

    • Hadoop Flags:
      Reviewed
    • Flags:
      Patch, Important

      Description

      In hbase-spark project, HBaseContext provides bulkload methond for loading spark rdd data to hbase easily.But when i using it frequently, the program will throw "cannot create native thread" exception.

      using pstack command in spark driver process , the thread num is increasing 

      using jstack, named "main-SendThread" and "main-EventThread"  thread so many

      It seems like that , connection created before bulkload ,but close method uninvoked at last

        Attachments

        1. HBaseContext.scala
          41 kB
          Key Hutu
        2. HBaseContext.patch
          16 kB
          Key Hutu
        3. HBASE-19848-V2.patch
          1 kB
          Key Hutu
        4. HBASE-19848-V3.patch
          0.7 kB
          Key Hutu

          Activity

            People

            • Assignee:
              Key Hutu Key Hutu
              Reporter:
              Key Hutu Key Hutu
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 72h
                72h
                Remaining:
                Remaining Estimate - 72h
                72h
                Logged:
                Time Spent - Not Specified
                Not Specified