[SPARK-11319] PySpark silently accepts null values in non-nullable DataFrame fields. - ASF JIRA

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: PySpark, SQL
Labels:
None

Description

Running the following code with a null value in a non-nullable column silently works. This makes the code incredibly hard to trust.

In [2]: from pyspark.sql.types import *
In [3]: sqlContext.createDataFrame([(None,)], StructType([StructField("a", TimestampType(), False)])).collect()
Out[3]: [Row(a=None)]

Attachments

Issue Links

Add Link

duplicates

SPARK-13740 add null check for _verify_type in types.py

Resolved

Delete this link

is related to

SPARK-11868 wrong results returned from dataframe create from Rows without consistent schma on pyspark

Resolved

Delete this link

SPARK-13740 add null check for _verify_type in types.py

Resolved

Delete this link

links to

[Github] Pull Request #11785 (francoisprunier)

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Wenchen Fan

Reporter:: Kevin Cox

Votes:: 2 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 26/Oct/15 16:53

Updated:: 06/May/16 21:18

Resolved:: 06/May/16 21:18

Agile

View on Board

PySpark silently accepts null values in non-nullable DataFrame fields.

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment