Details

    • Sub-task
    • Status: Resolved
    • Normal
    • Resolution: Fixed
    • 0.7 beta 1
    • None
    • None

    Description

      during flush and compaction we should keep row size statistics using EstimatedHistogram (column count, and row size), replacing min/max/total sizes in CFS.

      having this detail will let us estimate, given an index CF, how many nodes we need to query to get the number of matching rows requested by the client.

      Attachments

        1. 1155.txt
          21 kB
          Brandon Williams
        2. 1155-v2.txt
          22 kB
          Jonathan Ellis
        3. 1155-v3.txt
          28 kB
          Brandon Williams
        4. 1155-v4.txt
          25 kB
          Jonathan Ellis
        5. 1155-v5.txt
          35 kB
          Brandon Williams

        Activity

          People

            brandon.williams Brandon Williams
            jbellis Jonathan Ellis
            Brandon Williams
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: