Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-10421

[Snapshot] OM corruption due to NullPointerException

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • Snapshot

    Description

      Scenario :

      • Generate data over parallel threads over various volume/buckets
      • Perform parallel snapshot create/delete/list operations over above buckets
      • Perform parallel snapdiff operations over each bucket
      • Perform parallel read operations of snapshot contents

      Nullpointer exception in OM logs -

      2024-02-14 21:50:59,574 [OM StateMachine ApplyTransaction Thread - 0] WARN org.apache.hadoop.ozone.om.snapshot.SnapshotCache: Key: '/vol-yu9w1/buck-kmkp4/snap1707922796' does not exist in cache.
      2024-02-14 21:50:59,907 [OM StateMachine ApplyTransaction Thread - 0] ERROR org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine: Terminating with exit status 1: Request cmdType: SnapshotPurge
      clientId: "client-62D7790F8E6C"
      SnapshotPurgeRequest {
        snapshotDBKeys: "/vol-yu9w1/buck-kmkp4/snap1707922796"
      }
       failed with exception
      java.lang.NullPointerException
              at org.apache.hadoop.ozone.om.snapshot.SnapshotUtils.getNextActiveSnapshot(SnapshotUtils.java:152)
              at org.apache.hadoop.ozone.om.request.snapshot.OMSnapshotPurgeRequest.validateAndUpdateCache(OMSnapshotPurgeRequest.java:99)
              at org.apache.hadoop.ozone.protocolPB.OzoneManagerRequestHandler.handleWriteRequest(OzoneManagerRequestHandler.java:375)
              at org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.runCommand(OzoneManagerStateMachine.java:568)
              at org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.lambda$1(OzoneManagerStateMachine.java:359)
              at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
              at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
              at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
              at java.lang.Thread.run(Thread.java:748)
      2024-02-14 21:50:59,912 [shutdown-hook-0] INFO org.apache.ranger.audit.provider.AuditProviderFactory: ==> JVMShutdownHook.run() 

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jyosin Jyotirmoy Sinha
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: