Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.3.0
Description
np.nan series.astype(bool) should be True, rather than Fasle:
>>> pd.Series([1, 2, np.nan], dtype=float).astype(bool)
>>> pd.Series([1, 2, np.nan], dtype=str).astype(bool)
>>> pd.Series([datetime.date(1994, 1, 31), datetime.date(1994, 2, 1), np.nan])
0 True
1 True
2 True
dtype: bool
But in pyspark, it is:
0 True
1 True
2 False
dtype: bool