Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-19475

Issue when streaming data to Azure Data Lake Store

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.2.0
    • None
    • Streaming
    • None

    Description

      I am trying to stream data from a Java (Play2 api) to  HDInsight Hive interactive query with Azure Data Lake Store as storage back-end. The following code is ran on one of the head nodes of the cluster.

      When fetching a transaction-batch:

      TransactionBatch txnBatch = this.connection.fetchTransactionBatch(10, (RecordWriter)writer);
      

      I receive the following error:

      play.api.UnexpectedException: Unexpected exception[StreamingIOFailure: Failed creating RecordUpdaterS for adl://home/hive/warehouse/raw_telemetry_data/ingest_date=2018-05-07 txnIds[506,515]]
          at play.api.http.HttpErrorHandlerExceptions$.throwableToUsefulException(HttpErrorHandler.scala:251)
          at play.api.http.DefaultHttpErrorHandler.onServerError(HttpErrorHandler.scala:182)
          at play.core.server.AkkaHttpServer$$anonfun$2.applyOrElse(AkkaHttpServer.scala:343)
          at play.core.server.AkkaHttpServer$$anonfun$2.applyOrElse(AkkaHttpServer.scala:341)
          at scala.concurrent.Future.$anonfun$recoverWith$1(Future.scala:414)
          at scala.concurrent.impl.Promise.$anonfun$transformWith$1(Promise.scala:37)
          at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:60)
          at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:55)
          at akka.dispatch.BatchingExecutor$BlockableBatch.$anonfun$run$1(BatchingExecutor.scala:91)
          at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:12)
      Caused by: org.apache.hive.hcatalog.streaming.StreamingIOFailure: Failed creating RecordUpdaterS for adl://home/hive/warehouse/raw_telemetry_data/ingest_date=2018-05-07 txnIds[506,515]
          at org.apache.hive.hcatalog.streaming.AbstractRecordWriter.newBatch(AbstractRecordWriter.java:208)
          at org.apache.hive.hcatalog.streaming.HiveEndPoint$TransactionBatchImpl.<init>(HiveEndPoint.java:608)
          at org.apache.hive.hcatalog.streaming.HiveEndPoint$TransactionBatchImpl.<init>(HiveEndPoint.java:556)
          at org.apache.hive.hcatalog.streaming.HiveEndPoint$ConnectionImpl.fetchTransactionBatchImpl(HiveEndPoint.java:442)
          at org.apache.hive.hcatalog.streaming.HiveEndPoint$ConnectionImpl.fetchTransactionBatch(HiveEndPoint.java:422)
          at hive.HiveRepository.createMany(HiveRepository.java:76)
          at controllers.HiveController.create(HiveController.java:40)
          at router.Routes$$anonfun$routes$1.$anonfun$applyOrElse$2(Routes.scala:70)
          at play.core.routing.HandlerInvokerFactory$$anon$4.resultCall(HandlerInvoker.scala:137)
          at play.core.routing.HandlerInvokerFactory$JavaActionInvokerFactory$$anon$8$$anon$2$$anon$1.invocation(HandlerInvoker.scala:108)
      Caused by: java.io.IOException: No FileSystem for scheme: adl
          at org.apache.hadoop.fs.FileSystem.getFileSystemClass(FileSystem.java:2798)
          at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2809)
          at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:100)
          at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2848)
          at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2830)
          at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:389)
          at org.apache.hadoop.fs.Path.getFileSystem(Path.java:356)
          at org.apache.hadoop.hive.ql.io.orc.OrcRecordUpdater.<init>(OrcRecordUpdater.java:187)
          at org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat.getRecordUpdater(OrcOutputFormat.java:278)
          at org.apache.hive.hcatalog.streaming.AbstractRecordWriter.createRecordUpdater(AbstractRecordWriter.java:268)

       

      Any help would be greatly appreciated.

       

       
       

      Attachments

        Activity

          People

            Unassigned Unassigned
            thomas_nys Thomas Nys
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: