[FLINK-10972] Enhancements to Flink Table API - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.9.0
Fix Version/s: 1.9.0
Component/s: Table SQL / API
Labels:
None

Description

link titleWith the continuous efforts from the community, the Flink system has been continuously improved, which has attracted more and more users. Flink SQL is a canonical, widely used relational query language. However, there are still some scenarios where Flink SQL failed to meet user needs in terms of functionality and ease of use, such as:

In terms of functionality

Iteration, user-defined window, user-defined join, user-defined GroupReduce, etc. Users cannot express them with SQL;

In terms of ease of use

Map - e.g. “dataStream.map(mapFun)”. Although “table.select(udf1(), udf2(), udf3()....)” can be used to accomplish the same function., with a map() function returning 100 columns, one has to define or call 100 UDFs when using SQL, which is quite involved.

FlatMap - e.g. “dataStrem.flatmap(flatMapFun)”. Similarly, it can be implemented with “table.join(udtf).select()”. However, it is obvious that datastream is easier to use than SQL.

Due to the above two reasons, In this JIRAs group, we will enhance the TableAPI in stages.

-----------------------

The first stage we seek to support (will describe the details in the sub issue) :

Table.map()
Table.flatMap()
GroupedTable.aggregate()
GroupedTable.flatAggregate()

The FLIP can be find here: FLIP-29

The second part is about column operator/operations:

1) Table(schema) operators

Add columns
Replace columns
Drop columns
Rename columns

2）Fine-grained column/row operations

Column selection
Row package and flatten

Attachments

Issue Links

is related to

FLINK-13470 Enhancements to Flink Table API for blink planner

Open

mentioned in: Page Loading...

Sub-Tasks

1.

Add Map operator to Table API

Closed

Dian Fu

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 10m

2.

Add FlatMap to TableAPI

Closed

Dian Fu

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 10m

3.

Add support for TimeAttribute in Map/FlatMap operator

Closed

sunjincheng

4.

Add Aggregate operator to Table API

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 0.5h

5.

Add FlatAggregate operator to unbounded streaming Table API

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 0.5h

6.

Add support for group keys in Unbounded Aggregate/FlatAggregate operator

Closed

Wei Zhong

7.

Add Bounded(Group Window) FlatAggregate operator to streaming Table API

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

8.

Add documentation for TableAggregate Function

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

9.

BatchTableEnvironment and StreamTableEnvironment should transparent to users

Closed

Wei Zhong

10.

Add Column Operators/Operations

Closed

sunjincheng

11.

Add Column Operators(add/rename/drop)

Closed

sunjincheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

12.

Add Column selections

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

13.

Support incremental emit under AccRetract mode for non-window streaming FlatAggregate on Table API

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

14.

Add group window Aggregate operator to Table API

Closed

Hequn Cheng

100%

Original Estimate - Not Specified

Original Estimate - Not Specified

Time Spent - 20m

Activity

People

Assignee:: sunjincheng

Reporter:: sunjincheng

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 22/Nov/18 01:27

Updated:: 29/Jul/19 12:28

Resolved:: 29/Jul/19 12:27

Time Tracking

Estimated:

Not Specified

Remaining:

0h

Logged:

3h 20m

Include sub-tasks