Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-1050

Too many splits for ColumnFamily with only a few rows

    Details

      Description

      ColumnFamilyInputFormat creates splits for the entire Keyspace. If one ColumnFamily has 100 Million rows and another has only 100 rows, the number of splits will be the 1,526 (assuming 64k rows per split) for either one, since it is based on the total number of unique keys across the whole keyspace, and not on the number of rows in the ColumnFamily.

        Attachments

        1. CASSANDRA-0.6-1050.patch
          7 kB
          Johan Oskarsson
        2. CASSANDRA-1050.patch
          7 kB
          Johan Oskarsson

          Activity

            People

            • Assignee:
              johanoskarsson Johan Oskarsson
              Reporter:
              joosto Joost Ouwerkerk
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: