Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-37055

Apply 'compute.eager_check' across all the codebase

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.0
    • None
    • PySpark
    • None

    Description

      As hyukjin.kwon guide

      1 Make every input validation like this covered by the new configuration. For example:

      - a == b
      + def eager_check(f): # Utility function 
      + return not config.compute.eager_check and f() 
      + 
      + eager_check(lambda: a == b)
      

      2 We should check if the output makes sense although the behaviour is not matched with pandas'. If the output does not make sense, we shouldn't cover it with this configuration.

      3 Make this configuration enabled by default so we match the behaviour to pandas' by default.

       

      We have to make sure listing which API is affected in the description of 'compute.eager_check'

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dchvn dch nguyen
              hyukjin.kwon hyukjin.kwon
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: