Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-26591

Scalar Pandas UDF fails with 'illegal hardware instruction' in a certain environment

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Feedback Received
    • 2.4.0
    • None
    • PySpark
    • None

    Description

      When I try to use pandas_udf from examples in documentation:

      from pyspark.sql.functions import pandas_udf, PandasUDFType
      
      from pyspark.sql.types import IntegerType, StringType
      
      slen = pandas_udf(lambda s: s.str.len(), IntegerType()) #here it is crashed

      I get the error:

      [1]    17969 illegal hardware instruction (core dumped)  python3

      The environment is:

      Python 3.6.7
      PySpark 2.4.0
      PyArrow: 0.11.1
      Pandas: 0.23.4
      NumPy: 1.15.4
      OS: Linux 4.15.0-43-generic #46-Ubuntu SMP Thu Dec 6 14:45:28 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

      Attachments

        1. core
          49.67 MB
          Elchin

        Activity

          People

            Unassigned Unassigned
            elch10 Elchin
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: