Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2831

Pipeline crashes due to Beam encoder breaking Flink memory management

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • 2.0.0, 2.1.0
    • 2.5.0
    • runner-flink
    • None
    • Flink 1.2.1 and 1.3.0, Java HotSpot and OpenJDK 8, macOS 10.12.6 and unknown Linux

    Description

      I’ve been running a Beam pipeline on Flink. Depending on the dataset size and the heap memory configuration of the jobmanager and taskmanager, I may run into an EOFException, which causes the job to fail.

      As discussed on Flink's mailinglist (stacktrace enclosed), Flink catches these EOFExceptions and activates disk spillover. Because Beam wraps these exceptions, this mechanism fails, the exception travels up the stack, and the job aborts.

      Hopefully this is enough information and this is something that can be adjusted for in Beam. I'd be glad to provide more information where needed.

      Attachments

        Issue Links

          Activity

            People

              dwysakowicz Dawid Wysakowicz
              rjkip Reinier Kip
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 20m
                  1h 20m