Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-26105

Rolling log filenames cause end-to-end test to fail (example test failure "Running HA (hashmap, async)")

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Feb 14 01:31:29 Killed TM @ 255483
      Feb 14 01:31:29 Starting new TM.
      Feb 14 01:31:42 Killed TM @ 258722
      Feb 14 01:31:42 Starting new TM.
      Feb 14 01:32:00 Checking for non-empty .out files...
      Feb 14 01:32:00 No non-empty .out files.
      Feb 14 01:32:00 FAILURE: A JM did not take over.
      Feb 14 01:32:00 One or more tests FAILED.
      Feb 14 01:32:00 Stopping job timeout watchdog (with pid=250820)
      Feb 14 01:32:00 Killing JM watchdog @ 252644
      Feb 14 01:32:00 Killing TM watchdog @ 253262
      Feb 14 01:32:00 [FAIL] Test script contains errors.
      Feb 14 01:32:00 Checking of logs skipped.
      Feb 14 01:32:00 
      Feb 14 01:32:00 [FAIL] 'Running HA (hashmap, async) end-to-end test' failed after 2 minutes and 51 seconds! Test exited with exit code 1
      Feb 14 01:32:00 
      01:32:00 ##[group]Environment Information
      Feb 14 01:32:01 Searching for .dump, .dumpstream and related files in '/home/vsts/work/1/s'
      dmesg: read kernel buffer failed: Operation not permitted
      Feb 14 01:32:06 Stopping taskexecutor daemon (pid: 259377) on host fv-az313-602.
      Feb 14 01:32:07 Stopping standalonesession daemon (pid: 256528) on host fv-az313-602.
      Feb 14 01:32:08 Stopping zookeeper...
      Feb 14 01:32:08 Stopping zookeeper daemon (pid: 251023) on host fv-az313-602.
      Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 251636), because it is not running anymore on fv-az313-602.
      Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 255483), because it is not running anymore on fv-az313-602.
      Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 258722), because it is not running anymore on fv-az313-602.
      The STDIO streams did not close within 10 seconds of the exit event from process '/usr/bin/bash'. This may indicate a child process inherited the STDIO streams and has not yet exited.
      ##[error]Bash exited with code '1'.
       

      https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=31347&view=logs&j=e9d3d34f-3d15-59f4-0e3e-35067d100dfe&t=f8a6d3eb-38cf-5cca-9a99-d0badeb5fe62&l=8020

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            mapohl Matthias Pohl
            gaoyunhaii Yun Gao
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment