Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-27185

mapPartition to replace map to speedUp Dataset's toLocalIterator process

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Invalid
    • 2.0.0, 2.2.0, 2.3.0, 2.4.0
    • None
    • SQL
    • Patch, Important

    Description

      In my case, I will use DataSet's toLocalIterator function, and I found that underlying code can be improved,it can be changed from map to mapPartitionsInternal to speed Up the process of  decode data to Internal Row 

      Attachments

        Activity

          People

            Unassigned Unassigned
            angerszhuuu angerszhu
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 24h
                24h
                Remaining:
                Remaining Estimate - 24h
                24h
                Logged:
                Time Spent - Not Specified
                Not Specified