Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-2624

CatalogManagerTskITest.LeadershipChangeOnTskGeneration fails in DEBUG mode at assertion in RefCountedThreadSafe destructor

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Cannot Reproduce
    • 1.6.0, 1.7.0, 1.8.0, 1.7.1
    • NA
    • master, test

    Description

      Due to unexpected return from the CatalogManager::ProcessTabletReport() method, a destructor of AsyncDeleteReplica and corresponding destructor of RefCountedThreadSafeBase is called:

      E1115 00:56:54.353895   378 catalog_manager.cc:3906] Error updating tablets from 643d19b068d9455ab8bbebc475716158: Illegal state: Replica 47262025ef764044bd3d269f7b524b2f is not leader of this config. Role: FOLLOWER. Consensus state: current_term: 72 committed_config { opid_index: -1 OBSOLETE_local: false peers { permanent_uuid: "299def043c104b1f8eaeed5f89344d07" member_type: VOTER last_known_addr { host: "127.0.56.126" port: 37561 } } peers { permanent_uuid: "07e81f368ada48f38b80fbab0b450dd9" member_type: VOTER last_known_addr { host: "127.0.56.125" port: 41377 } } peers { permanent_uuid: "47262025ef764044bd3d269f7b524b2f" member_type: VOTER last_known_addr { host: "127.0.56.124" port: 41201 } } }. Tablet report was: is_incremental: false updated_tablets { tablet_id: "fac11d48413a42aeb73e79549134d681" state: BOOTSTRAPPING consensus_state { current_term: 1 leader_uuid: "643d19b068d9455ab8bbebc475716158" committed_config { opid_index: -1 OBSOLETE_local: true peers { permanent_uuid: "643d19b068d9455ab8bbebc475716158" member_type: VOTER last_known_addr { host: "127.0.56.65" port: 42231 } health_report { overall_health: HEALTHY } } } } schema_version: 0 tablet_data_state: TABLET_DATA_READY } updated_tablets { tablet_id: "e960e0a4329e4b56bc3e53129c711837" state: SHUTDOWN schema_version: 0 tablet_data_state: TABLET_DATA_DELETED } sequence_number: 1460
      

      However, since instances of AsyncDeleteReplica are wrapped into std::unique_ptr, but not scoped_refptr, the assertion in RefCountedThreadSafeBase destructor triggers:

      F1115 00:56:54.354007   378 ref_counted.cc:69] Check failed: in_dtor_ RefCountedThreadSafe object deleted without calling Release()
      

      The full log is attached: catalog_manager_tsk-itest.txt.xz

      Attachments

        1. catalog_manager_tsk-itest.txt.xz
          93 kB
          Alexey Serbin

        Activity

          People

            Unassigned Unassigned
            aserbin Alexey Serbin
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: