Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-1029

Perftest: Data generator for descriptive statistics (slowdown due to update in-place)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • SystemML 0.11
    • None
    • None

    Description

      The data generator for descriptive statistics shows severe performance issues, due to update-in-place. In order to avoid buffer pool serialization and evictions via shallow serialize, we represent sparse matrices which are subject to update in-place in CSR. This works very well for row major updates but has severe problems on column-wise updates (due to shifting of values).

      There are three potential options:

      • (1) Workaround: Change the data generator script from column-wise indexing to row-wise indexing and transpose after the parfor loop.
      • (2) Parfor rewrite: Disable update in-place, if sparse input, no parfor update in-place, and column-wise access pattern.
      • (3) Improved left indexing over CSR: Reallocate and shift the output once. This is feasible as left indexing (specifically sparse-sparse and dense-sparse) are the only operations which update the CSR representation.

      Attachments

        Issue Links

          Activity

            People

              mboehm7 Matthias Boehm
              mboehm7 Matthias Boehm
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: