[SPARK-21153] Time windowing for tumbling windows can use a project instead of expand + filter - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.1.1
Fix Version/s: 2.3.0
Component/s: SQL
Labels:
None

Target Version/s:

2.3.0

Description

Time windowing in Spark currently performs an Expand + Filter, because there is no way to guarantee the amount of windows a timestamp will fall in, in the general case. However, for tumbling windows, a record is guaranteed to fall into a single bucket. In this case, doubling the number of records with Expand is wasteful, and can be improved by using a simple Projection instead.

Attachments

Issue Links

links to

[Github] Pull Request #18364 (brkyvz)

Activity

People

Assignee:: Burak Yavuz

Reporter:: Burak Yavuz

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 20/Jun/17 16:12

Updated:: 26/Jun/17 08:27

Resolved:: 26/Jun/17 08:27