Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-47386

Ability to only take first match for join when there are miltiple match

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 3.5.1
    • None
    • Spark Core, SQL

    Description

      When joining 2 datasets, it can be desirable to only get first matching row instead of all. This can be helpful when the right table has fuplicates, and dropping duplicate is not desired due to data volume. 

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            vijayjangir Vijay Jangir
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: