Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23989

When using `SortShuffleWriter`, the data will be overwritten

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 2.3.0
    • 2.3.1, 2.4.0
    • Spark Core
    • None

    Description

      When using `SortShuffleWriter`, we only insert  'AnyRef' into 'PartitionedAppendOnlyMap' or 'PartitionedPairBuffer'.

      For this function:

      override def write(records: Iterator[Product2[K, V]])

      the value of 'records' is `UnsafeRow`, so  the value will be overwritten

       

      Attachments

        Activity

          People

            cloud_fan Wenchen Fan
            10110346 liuxian
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: