Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-4344

Block Deletion Performance Improvements

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      In cluster deployments it was observed that block deletion can be slow. For example if a user writes a million keys in Ozone, the time it takes for those million keys to be deleted from datanodes can be high. The jira would cover various improvements which can be made for better deletion speeds.

      Attachments

        1.
        Allow multiple transactions per container to be sent for deletion by SCM Sub-task Resolved Lokesh Jain Actions
        2.
        SCM deletion service should delete configured number of blocks every interval Sub-task Resolved Lokesh Jain Actions
        3.
        Configuration for deletion service intervals should be different for OM, SCM and datanodes Sub-task Resolved Aryan Gupta Actions
        4.
        Datanode should store the delete transaction as is in rocksDB Sub-task Resolved Aryan Gupta Actions
        5.
        Datanode deletion service can avoid storing deleted blocks Sub-task Resolved Aryan Gupta Actions
        6.
        SCM should create transactions using all blocks received from OM Sub-task Resolved Aryan Gupta Actions
        7.
        Avoid counting skipped transactions belonging to open containers Sub-task Resolved Lokesh Jain Actions
        8.
        With HA OM can send deletion blocks to SCM multiple times Sub-task Resolved Bharat Viswanadham Actions
        9.
        SCM can avoid using RETRIABLE_DATANODE_COMMAND for datanode deletion commands Sub-task Resolved Aryan Gupta Actions
        10.
        Datanode deletion config should be based on number of blocks Sub-task Resolved Aryan Gupta Actions
        11.
        SCM can avoid creating RetriableDatanodeEventWatcher for deletion command ACK Sub-task Resolved Aryan Gupta Actions
        12.
        Increase default value for key and block deletion limit Sub-task Resolved Lokesh Jain Actions
        13.
        SCM should avoid sending delete transactions for under-replicated containers Sub-task Resolved Aryan Gupta Actions
        14.
        Datanode should have deletion service thread per volume Sub-task Open YI-CHEN WANG Actions
        15.
        Datanode delete operations should not impact rocksDB cache Sub-task Open Unassigned Actions
        16.
        SCM should reduce RPC payLoad to OM for deletion ACK Sub-task Open Aryan Gupta Actions

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ljain Lokesh Jain Assign to me
            ljain Lokesh Jain

            Dates

              Created:
              Updated:

              Slack

                Issue deployment