Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-2163

Performance large partitioned broadcasts

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • SystemML 1.1
    • None
    • None

    Description

      Due to Spark's limitations with broadcasts larger than 2G, in SystemML we use partitioned broadcasts that split a large side input into potentially many broadcast variables. For historic reasons the meta data is still maintained in the individual partitioned blocks. However, for many operations this meta data is accessed on the first partitioned block which leads to potentially unnecessary broadcast fetches.

      Attachments

        Activity

          People

            mboehm7 Matthias Boehm
            mboehm7 Matthias Boehm
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: