Uploaded image for project: 'Apache Trafodion (Retired)'
  1. Apache Trafodion (Retired)
  2. TRAFODION-3223

Row count estimation code works poorly on time-ordered aged-out data

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • any
    • 2.4
    • sql-cmp
    • None

    Description

      The estimateRowCountBody method in module HBaseClient.java samples cells from the first 500 rows from the first HFile it sees in order to estimate the number of rows in a Trafodion table. If the table happens to have a time-ordered key, and data are aged out over time, we can get large clumps of "delete" tombstones in one or more HFiles. If estimateRowCountBody happens to look at such an HFile, it will incorrectly conclude that most cells are "delete" tombstones and therefore drastically underestimate the row count.

      Attachments

        Issue Links

          Activity

            People

              dbirdsall Dave Birdsall
              dbirdsall Dave Birdsall
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: