Description
As you can see below, the name of the columns in SQL vs Dataset is different.
scala> val timesTwoUDF = spark.udf.register("timesTwo", (x: Int) => x * 2) timesTwoUDF: org.apache.spark.sql.expressions.UserDefinedFunction = UserDefinedFunction(<function1>,IntegerType,Some(List(IntegerType))) scala> spark.sql("SELECT timesTwo(1)").show +---------------+ |UDF:timesTwo(1)| +---------------+ | 2| +---------------+ scala> spark.range(1, 2).toDF("x").select(timesTwoUDF($"x")).show +------+ |UDF(x)| +------+ | 2| +------+