[FLINK-14398] Further split input unboxing code into separate methods - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.8.0
Fix Version/s: 1.8.3, 1.9.2, 1.10.0
Component/s: Table SQL / Legacy Planner, Table SQL / Planner
Labels:
- pull-request-available

Description

In one of our production pipelines, we have a table with 1200+ columns. At runtime, it failed due to a method inside the generated code exceeding 64kb when compiled to bytecode.

After we investigated the generated code, it appeared that the map method inside a generated RichMapFunction was too long. See attached file (codegen.example.txt).

In the problematic map method, result setters were correctly split into individual methods and did not have the largest footprint.

However, there were also 1000+ input unboxing expressions inside reusableInputUnboxingExprs, which, individually were not trivial and when flattened linearly in the map function here, pushed the method size beyond 64kb in bytecode.

We think it is worthwhile to split these input unboxing code snippets into individual methods. We were able to verify, in our production environment, that splitting input unboxing code snippets into individual methods resolves the issue. Would love to hear thoughts from the team and find the best path to fix it.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

codegen.example.txt
15/Oct/19 13:42
731 kB
Hao Dang

Issue Links

links to

GitHub Pull Request #9980

GitHub Pull Request #10000

Activity

People

Assignee:: Hao Dang

Reporter:: Hao Dang

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 15/Oct/19 13:55

Updated:: 30/Oct/19 06:04

Resolved:: 30/Oct/19 06:04

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

40m