Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-5092 Client like PHERF tool's thread dies because of unhandled exception in MutationState#commit()
  3. PHOENIX-5194

Thread Cache is not update for Index retries in for MutationState#send()#doMutation()

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Resolved
    • Affects Version/s: 4.14.0, 5.0.0, 4.15.0, 4.14.1
    • Fix Version/s: 4.15.0, 5.1.0, 4.14.2
    • Component/s: None
    • Labels:

      Description

      Wwhen Client is writing and Index Failures happens, MutationState#send() will use PhoenixIndexFailurePolicy#doBatchWithRetries to apply index mutations. If during this retires Index region and Data table region moves , Index/Data table region location cache does not get updated. Because of this client is keep trying to write in same location and get failures. After all retries are finished, it will simply disable Index and aborts the client thread.

      2019-03-08 09:41:32,678 WARN [pool-8-thread-25] execute.MutationState - THREAD_ABORT MutationState#send(Iterator<TableRef>) :-
      org.apache.hadoop.hbase.client.RetriesExhaustedWithDetailsException: Failed 36 actions: org.apache.hadoop.hbase.DoNotRetryIOException: ERROR 2008 (INT10): ERROR 2008 (INT10): Unable to find cached index metadata. key=1873403620592046670 region=PHERF:TABLE1,1552037797977.20beae29172b4bec422a6984e088eeae.host=phoenix-host1,60020,1552037496260 Index update failed
      at org.apache.phoenix.util.ServerUtil.createIOException(ServerUtil.java:112)
      at org.apache.phoenix.util.ServerUtil.throwIOException(ServerUtil.java:86)
      at org.apache.phoenix.index.PhoenixIndexMetaDataBuilder.getIndexMetaDataCache(PhoenixIndexMetaDataBuilder.java:101)
      at org.apache.phoenix.index.PhoenixIndexMetaDataBuilder.getIndexMetaData(PhoenixIndexMetaDataBuilder.java:51)
      at org.apache.phoenix.index.PhoenixIndexBuilder.getIndexMetaData(PhoenixIndexBuilder.java:100)
      at org.apache.phoenix.index.PhoenixIndexBuilder.getIndexMetaData(PhoenixIndexBuilder.java:73)
      at org.apache.phoenix.hbase.index.builder.IndexBuildManager.getIndexMetaData(IndexBuildManager.java:79)
      at org.apache.phoenix.hbase.index.Indexer.preBatchMutateWithExceptions(Indexer.java:385)
      at org.apache.phoenix.hbase.index.Indexer.preBatchMutate(Indexer.java:345)
      at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$35.call(RegionCoprocessorHost.java:1025)
      at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost$RegionOperation.call(RegionCoprocessorHost.java:1693)
      at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperation(RegionCoprocessorHost.java:1771)
      at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.execOperation(RegionCoprocessorHost.java:1727)
      at org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preBatchMutate(RegionCoprocessorHost.java:1021)
      at org.apache.hadoop.hbase.regionserver.HRegion.doMiniBatchMutation(HRegion.java:3309)
      at org.apache.hadoop.hbase.regionserver.HRegion.batchMutate(HRegion.java:3076)
      at org.apache.hadoop.hbase.regionserver.HRegion.batchMutate(HRegion.java:3018)
      at org.apache.hadoop.hbase.regionserver.RSRpcServices.doBatchOp(RSRpcServices.java:914)
      at org.apache.hadoop.hbase.regionserver.RSRpcServices.doNonAtomicRegionMutation(RSRpcServices.java:842)
      at org.apache.hadoop.hbase.regionserver.RSRpcServices.multi(RSRpcServices.java:2397)
      at org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:35080)
      at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2399)
      at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:124)
      at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:188)
      at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:168)
      Caused by: java.sql.SQLException: ERROR 2008 (INT10): Unable to find cached index metadata. key=1873403620592046670 region=PHERF:TABLE1,1552037797977.20beae29172b4bec422a6984e088eeae.host=phoenix-host1,60020,1552037496260
      at org.apache.phoenix.exception.SQLExceptionCode$Factory$1.newException(SQLExceptionCode.java:494)
      at org.apache.phoenix.exception.SQLExceptionInfo.buildException(SQLExceptionInfo.java:150)
      at org.apache.phoenix.index.PhoenixIndexMetaDataBuilder.getIndexMetaDataCache(PhoenixIndexMetaDataBuilder.java:100)
      ... 22 more
      : 36 times, servers with issues: phoenix-host1,60020,1552037496260
      at org.apache.hadoop.hbase.client.AsyncProcess$BatchErrors.makeException(AsyncProcess.java:260)
      at org.apache.hadoop.hbase.client.AsyncProcess$BatchErrors.access$2400(AsyncProcess.java:240)
      at org.apache.hadoop.hbase.client.AsyncProcess$AsyncRequestFutureImpl.getErrors(AsyncProcess.java:1711)
      at org.apache.hadoop.hbase.client.HTable.batch(HTable.java:917)
      at org.apache.hadoop.hbase.client.HTable.batch(HTable.java:931)
      at org.apache.phoenix.execute.MutationState$3.doMutation(MutationState.java:992)
      at org.apache.phoenix.index.PhoenixIndexFailurePolicy.doBatchWithRetries(PhoenixIndexFailurePolicy.java:480)
      at org.apache.phoenix.execute.MutationState.send(MutationState.java:988)
      at org.apache.phoenix.execute.MutationState.send(MutationState.java:1368)
      at org.apache.phoenix.execute.MutationState.commit(MutationState.java:1188)
      at org.apache.phoenix.jdbc.PhoenixConnection$3.call(PhoenixConnection.java:670)
      at org.apache.phoenix.jdbc.PhoenixConnection$3.call(PhoenixConnection.java:666)
      at org.apache.phoenix.call.CallRunner.run(CallRunner.java:53)
      at org.apache.phoenix.jdbc.PhoenixConnection.commit(PhoenixConnection.java:666)
      at org.apache.phoenix.pherf.workload.WriteWorkload$2.call(WriteWorkload.java:297)
      at org.apache.phoenix.pherf.workload.WriteWorkload$2.call(WriteWorkload.java:256)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      at java.lang.Thread.run(Thread.java:748)
      

        Attachments

        1. PHOENIX-5194.patch
          6 kB
          Mihir Monani
        2. PHOENIX-5194-4.x-HBase-1.3.01.patch
          3 kB
          Mihir Monani
        3. PHOENIX-5194-4.x-HBase-1.3.02.patch
          3 kB
          Mihir Monani
        4. PHOENIX-5194-4.x-HBase-1.3.03.patch
          7 kB
          Mihir Monani
        5. PHOENIX-5194-4.x-HBase-1.3.04.patch
          7 kB
          Mihir Monani
        6. PHOENIX-5194-4.x-HBase-1.3.05.patch
          8 kB
          Mihir Monani
        7. PHOENIX-5194-4.x-HBase-1.3.06.patch
          6 kB
          Mihir Monani

          Issue Links

            Activity

              People

              • Assignee:
                mihir6692 Mihir Monani
                Reporter:
                mihir6692 Mihir Monani
              • Votes:
                0 Vote for this issue
                Watchers:
                8 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: