Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-3031

Improve SPOOF CUDA rowwise handling of intermediate memory

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • SystemDS 2.2
    • None
    • NativeRuntime

    Description

      The allocation of temporary memory for SPOOF CUDA rowwise operators is happening in native space and requires rows*columns*num_intermediates*sizeof(usedDataType) bytes of memory at the moment to enable all threadblocks to work concurrently.

      This improvement should

      • move the allocation to the Java side of SystemDS to integrate better with the gpu memory manager
      • reduce the memory footprint by either having less thread blocks process more rows or queueing less blocks at once for execution.
      • update the estimate of intermediate memory at HOP level

      Attachments

        Activity

          People

            markd Mark Dokter
            markd Mark Dokter
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: