Cassandra
  1. Cassandra
  2. CASSANDRA-3945

Support incremental/batch sizes for BulkRecordWriter, due to GC overhead issues

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Duplicate
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      When loading large amounts of data, currently the BulkRecordWriter will write out all the sstables, then stream them. This actually caused us GC overhead issues, due to our heap sizes for reducers. We ran into a problem where the number of SSTables on disk that had to be open would cause the jvm process to die. We also wanted a way to incrementally stream them as we created them. I created support for setting this, the default behavior is wait for them to be created. But if you increase to >= 1, you can determine the batch size.

        Issue Links

          Activity

          Transition Time In Source Status Execution Times Last Executer Last Execution Date
          Open Open Resolved Resolved
          491d 16h 2m 1 Jonathan Ellis 28/Jun/13 00:15
          Jonathan Ellis made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Assignee Chris Goffinet [ lenn0x ]
          Resolution Duplicate [ 3 ]
          Hide
          Jonathan Ellis added a comment -

          CASSANDRA-5555 fixes this

          Show
          Jonathan Ellis added a comment - CASSANDRA-5555 fixes this
          Gavin made changes -
          Workflow patch-available, re-open possible [ 12749525 ] reopen-resolved, no closed status, patch-avail, testing [ 12757076 ]
          Gavin made changes -
          Workflow no-reopen-closed, patch-avail [ 12654352 ] patch-available, re-open possible [ 12749525 ]
          Jonathan Ellis made changes -
          Fix Version/s 1.1.2 [ 12321445 ]
          Jonathan Ellis made changes -
          Fix Version/s 1.1.2 [ 12321445 ]
          Fix Version/s 1.1.1 [ 12319857 ]
          Hide
          Jonathan Ellis added a comment -

          3859 has been committed.

          Show
          Jonathan Ellis added a comment - 3859 has been committed.
          Sylvain Lebresne made changes -
          Fix Version/s 1.1.1 [ 12319857 ]
          Fix Version/s 1.1.0 [ 12317615 ]
          Hide
          Sylvain Lebresne added a comment -

          Moving to 1.1.1. Sound too much involved and too late for 1.1.0.

          Show
          Sylvain Lebresne added a comment - Moving to 1.1.1. Sound too much involved and too late for 1.1.0.
          Hide
          Chris Goffinet added a comment -

          Need to wait on CASSANDRA-3859 getting commited before I can submit patch.

          Show
          Chris Goffinet added a comment - Need to wait on CASSANDRA-3859 getting commited before I can submit patch.
          Chris Goffinet made changes -
          Link This issue is blocked by CASSANDRA-3859 [ CASSANDRA-3859 ]
          Chris Goffinet made changes -
          Field Original Value New Value
          Description When loading large amounts of data, currently the BulkRecordWriter will write out all the sstables, then stream them. This actually caused us GC overhead issues, due to our heap sizes for reducers. We ran into a problem where the number of SSTables on disk that had to be open would cause the jvm process to die. We also wanted a watch to incrementally stream them as we created them. I created support for setting this, the default behavior is wait for them to be created. But if you increase to >= 1, you can determine the batch size. When loading large amounts of data, currently the BulkRecordWriter will write out all the sstables, then stream them. This actually caused us GC overhead issues, due to our heap sizes for reducers. We ran into a problem where the number of SSTables on disk that had to be open would cause the jvm process to die. We also wanted a way to incrementally stream them as we created them. I created support for setting this, the default behavior is wait for them to be created. But if you increase to >= 1, you can determine the batch size.
          Chris Goffinet created issue -

            People

            • Assignee:
              Unassigned
              Reporter:
              Chris Goffinet
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development