Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-11457

Enable skip key-value clone for HadoopFormatIO

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: P3
    • Resolution: Fixed
    • Affects Version/s: 2.25.0
    • Fix Version/s: 2.28.0
    • Component/s: io-java-hadoop-format
    • Labels:
      None

      Description

      HadoopFormatIO eagerly clone key-values if they are not a well known immutable types. This make sense due to how hadoop Writables behave. However, user can use key value translation functions which possibly already output immutable types. In such case it would be of benefit if extra clone via coder can be avoided.

      It would be great if coder can be consulted on the type an it's need for clone. However I am not aware if such detection is possible. I propose to add config parameter for skipping the clone which can be used by IO user.

        Attachments

          Activity

            People

            • Assignee:
              jvilcek Jozef Vilcek
              Reporter:
              jvilcek Jozef Vilcek

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 1.5h
                1.5h

                  Issue deployment