[SPARK-10533] DataFrame filter is not handling float/double with Scientific Notation 'e' / 'E' - ASF JIRA

Log work

Agile Board

Rank to Top

Rank to Bottom

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

Delete

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.4.1
Fix Version/s: 1.6.0
Component/s: SQL
Labels:
- newbie

Description

In DataFrames filter operation,when giving float comparison with e (2.0e2) it is not converting the comparison constant as expected (200.0 in this case).
For example:

val df = sqlContext.createDataFrame(Seq(("a",1.0),("b",2.0),("c",3.0)))
df.filter("_2 < 2.0e1").show()

+--+---+
|_1| _2|
+--+---+
| a|1.0|
+--+---+

It should return all the three records from the dataframe,but is return record which is less than 2.0.
It seems it is just comparing with the mantissa/coefficient.

On the other hand,sqlContext is handling the above case and giving the desired output.

df.resgisterTempTable("df")
sqlContext.sql("select * from df where `_2` < 2.0e1").show()

+--+---+
|_1| _2|
+--+---+
| a|1.0|
| b|2.0|
| c|3.0|
+--+---+

Attachments

Issue Links

Add Link

links to

[Github] Pull Request #9085 (adrian-wang)

Delete this link

[Github] Pull Request #9482 (cloud-fan)

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Adrian Wang Assign to me

Reporter:: Rishabh Bhardwaj

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 10/Sep/15 10:46

Updated:: 06/Nov/15 14:49

Resolved:: 03/Nov/15 14:31

Agile

View on Board

DataFrame filter is not handling float/double with Scientific Notation 'e' / 'E'

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment