Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-37896

ConstantColumnVector: a column vector with same values

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.3.0
    • SQL
    • None

    Description

      Introduce a new column vector named `ConstantColumnVector`, it represents a column vector where every row has the same constant value.

      It could help improve performance on hidden file metadata columnar file format, since metadata fields for every row in each file are exactly the same, we don't need to copy and keep multiple copies of data.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            yaohua Yaohua Zhao
            yaohua Yaohua Zhao
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment