Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-18641

Show databases NullPointerException while Sentry turned on

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • 2.0.0
    • None
    • SQL
    • None
    • CentOS 6.5 / Hive 1.1.0 / Sentry 1.5.1

    Description

      I've traced into source code, and it seems that <hive.sentry.subject.name> of Sentry not set when spark sql started a session. This operation should be done in org.apache.sentry.binding.hive.HiveAuthzBindingSessionHook which is not called in spark sql.

      Edit: I copyed hive-site.xml(which turns on Sentry) and all sentry jars into spark's classpath.

      Here is the stack:

      16/11/30 10:54:50 WARN SentryMetaStoreFilterHook: Error getting DB list
      java.lang.NullPointerException
              at java.util.concurrent.ConcurrentHashMap.hash(ConcurrentHashMap.java:333)
              at java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:988)
              at org.apache.hadoop.security.Groups.getGroups(Groups.java:162)
              at org.apache.sentry.provider.common.HadoopGroupMappingService.getGroups(HadoopGroupMappingService.java:60)
              at org.apache.sentry.binding.hive.HiveAuthzBindingHook.getHiveBindingWithPrivilegeCache(HiveAuthzBindingHook.java:956)
              at org.apache.sentry.binding.hive.HiveAuthzBindingHook.filterShowDatabases(HiveAuthzBindingHook.java:826)
              at org.apache.sentry.binding.metastore.SentryMetaStoreFilterHook.filterDb(SentryMetaStoreFilterHook.java:131)
              at org.apache.sentry.binding.metastore.SentryMetaStoreFilterHook.filterDatabases(SentryMetaStoreFilterHook.java:59)
              at org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getAllDatabases(HiveMetaStoreClient.java:1031)
              at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
              at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
              at java.lang.reflect.Method.invoke(Method.java:606)
              at org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:156)
              at com.sun.proxy.$Proxy38.getAllDatabases(Unknown Source)
              at org.apache.hadoop.hive.ql.metadata.Hive.getAllDatabases(Hive.java:1234)
              at org.apache.hadoop.hive.ql.metadata.Hive.reloadFunctions(Hive.java:174)
              at org.apache.hadoop.hive.ql.metadata.Hive.<clinit>(Hive.java:166)
              at org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:503)
              at org.apache.spark.sql.hive.client.HiveClientImpl.<init>(HiveClientImpl.scala:170)
              at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
              at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
              at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
              at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
              at org.apache.spark.sql.hive.client.IsolatedClientLoader.createClient(IsolatedClientLoader.scala:258)
              at org.apache.spark.sql.hive.HiveUtils$.newClientForMetadata(HiveUtils.scala:359)
              at org.apache.spark.sql.hive.HiveUtils$.newClientForMetadata(HiveUtils.scala:263)
              at org.apache.spark.sql.hive.HiveSharedState.metadataHive$lzycompute(HiveSharedState.scala:39)
              at org.apache.spark.sql.hive.HiveSharedState.metadataHive(HiveSharedState.scala:38)
              at org.apache.spark.sql.hive.HiveSessionState.metadataHive$lzycompute(HiveSessionState.scala:43)
              at org.apache.spark.sql.hive.HiveSessionState.metadataHive(HiveSessionState.scala:43)
              at org.apache.spark.sql.hive.thriftserver.SparkSQLEnv$.init(SparkSQLEnv.scala:62)
              at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2$.main(HiveThriftServer2.scala:84)
              at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2.main(HiveThriftServer2.scala)
              at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
              at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
              at java.lang.reflect.Method.invoke(Method.java:606)
              at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:729)
              at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:185)
              at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:210)
              at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:124)
              at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            zhangqw zhangqw
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: