Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-1178

Nil pointer dereference during schedule recovery

    XMLWordPrintableJSON

Details

    Description

      Observed a SIGSEGV during context recovery:

      2022-04-13T20:34:42.675Z    INFO    cache/node.go:109    add existing allocation    {"nodeID": "yk8s-worker", "allocation": "allocationKey:\"8bf3b33c-4dd9 
      panic: runtime error: invalid memory address or nil pointer dereference                                                                                    
      [signal SIGSEGV: segmentation violation code=0x1 addr=0x58 pc=0x17b34d7]                                                                                   
                                                                                                                                                                  
      goroutine 196 [running]:                                                                                                                                   
      github.com/apache/yunikorn-k8shim/pkg/cache.(*Context).recover(0xc00013d2c0, 0xc003de8210, 0x1, 0x1, 0x45d964b800, 0x1, 0xc003d03d28)                      
          /Users/ccondit/yunikorn/yunikorn-k8shim/pkg/cache/context_recovery.go:121 +0x1e77                                                                      
      github.com/apache/yunikorn-k8shim/pkg/cache.(*Context).WaitForRecovery(0xc00013d2c0, 0xc003de8210, 0x1, 0x1, 0x45d964b800, 0xc003de8210, 0x0)              
          /Users/ccondit/yunikorn/yunikorn-k8shim/pkg/cache/context_recovery.go:46 +0xb3                                                                         
      github.com/apache/yunikorn-k8shim/pkg/shim.(*KubernetesShim).recoverSchedulerState.func1(0xc0005295c0)                                                     
          /Users/ccondit/yunikorn/yunikorn-k8shim/pkg/shim/scheduler.go:216 +0x1e5                                                                               
      created by github.com/apache/yunikorn-k8shim/pkg/shim.(*KubernetesShim).recoverSchedulerState                                                              
          /Users/ccondit/yunikorn/yunikorn-k8shim/pkg/shim/scheduler.go:191 +0x3f                                                                                
      Stream closed EOF for yunikorn/yunikorn-scheduler-5b67d97447-7mds6 (yunikorn-scheduler-k8s)  
      

      Attachments

        Issue Links

          Activity

            People

              ccondit Craig Condit
              ccondit Craig Condit
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: