Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-2698

Initial cuda codegen functionality

    XMLWordPrintableJSON

Details

    Description

      Cleaned up/improved pull request for initial CUDA codegen. This includes code cleanup and grouping the changes into a few commits (original PR had 60+ commits and was hard to review)

      In terms of functionality, this PR will include:

      • CUDA code reorganization
      • JNI parts to launch generated CUDA kernels
      • SPOOF compiler extensions
      • CPlan templates
      • Runtime instruction for dense input
      • Code generation for the SpoofCellwise template

      Attachments

        Issue Links

          Activity

            People

              markd Mark Dokter
              markd Mark Dokter
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: