Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-28328

Add an option to count different types of Delete Markers in RowCounter

    XMLWordPrintableJSON

Details

    • Reviewed
    • Hide
      The JIRA adds a feature in RowCounter tool to count the various types of delete markers (DELETE_COLUMN, DELETE_FAMILY, DELETE_FAMILY_VERSION) and the number of rows containing at least one delete marker. The feature can be enabled by passing the flag --countDeleteMarkers as a CLI option. When the feature is enabled, raw scan is performed without FirstKeyOnlyFilter.
      Show
      The JIRA adds a feature in RowCounter tool to count the various types of delete markers (DELETE_COLUMN, DELETE_FAMILY, DELETE_FAMILY_VERSION) and the number of rows containing at least one delete marker. The feature can be enabled by passing the flag --countDeleteMarkers as a CLI option. When the feature is enabled, raw scan is performed without FirstKeyOnlyFilter.

    Description

      Add an option (count-delete-markers) to the RowCounter tool to count the number of Delete Markers of all types, i.e. (DELETE, DELETE_COLUMN, DELETE_FAMILY,DELETE_FAMILY_VERSION)

      We already have such a feature within our internal implementation of RowCounter and it's very useful.

      Implementation Ideas:
      1. If the option is passed, initialize the empty job counters for all 4 types of deletes.
      2. Within mapper, increase the respective delete counts while processing each row.

      Attachments

        Activity

          People

            shubhamroy Shubham Roy
            hgwalani81 Himanshu Gwalani
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: