[SPARK-41791] Create distinct metadata attributes for metadata that is constant or file and metadata that is generated during the scan - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.3.1
Fix Version/s: 3.4.0
Component/s: Optimizer
Labels:
None

Description

There are two types or Metadata in Spark

Metadata that is constant per file (file_name, file_size, ...)
Metadata that is not contant (currently only row_index)

The two types are generated differently

File constant metadata is appended to the output after scan
non-constant metadata is generated during the scan

The proposal here is to create different metadata attributes to distinguish those different types throughout the code

Attachments

Issue Links

links to

[Github] Pull Request #39314 (olaky)

[Github] Pull Request #39314 (olaky)

Activity

People

Assignee:: Jan-Ole Sasse

Reporter:: Jan-Ole Sasse

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 30/Dec/22 14:15

Updated:: 05/Jan/23 11:41

Resolved:: 05/Jan/23 11:40