Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-7111

HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage segfaults

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Accepted
    • Major
    • Resolution: Unresolved
    • 1.2.0, 1.8.0
    • None
    • test
    • ubuntu-16, macOS

    Description

      We observed a segfault in HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage in internal CI on an ubuntu16 machine. Note that ubuntu16 uses gcc-6.

      [ RUN      ] HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage
      I0210 02:47:31.260174 19578 cluster.cpp:160] Creating default 'local' authorizer
      I0210 02:47:31.261225 19597 master.cpp:383] Master d8129420-2a04-48e7-9b28-6b0a0af73168 (ip-10-150-111-24.ec2.internal) started on 10.150.111.24:33608
      I0210 02:47:31.261281 19597 master.cpp:385] Flags at startup: --acls="" --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" --allocation_interval="1secs" --allocator="HierarchicalDRF" --authenticate_agents="true" --authenticate_frameworks="false" --authenticate_http_frameworks="true" --authenticate_http_readonly="true" --authenticate_http_readwrite="true" --authenticators="crammd5" --authorizers="local" --credentials="/tmp/fBrqHi/credentials" --framework_sorter="drf" --help="false" --hostname_lookup="true" --http_authenticators="basic" --http_framework_authenticators="basic" --initialize_driver_logging="true" --log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" --max_unreachable_tasks_per_framework="1000" --quiet="false" --recovery_agent_removal_limit="100%" --registry="in_memory" --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" --registry_store_timeout="100secs" --registry_strict="false" --root_submissions="true" --user_sorter="drf" --version="false" --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/fBrqHi/master" --zk_session_timeout="10secs"
      I0210 02:47:31.261404 19597 master.cpp:437] Master allowing unauthenticated frameworks to register
      I0210 02:47:31.261411 19597 master.cpp:449] Master only allowing authenticated agents to register
      I0210 02:47:31.261415 19597 master.cpp:462] Master only allowing authenticated HTTP frameworks to register
      I0210 02:47:31.261420 19597 credentials.hpp:37] Loading credentials for authentication from '/tmp/fBrqHi/credentials'
      I0210 02:47:31.261488 19597 master.cpp:507] Using default 'crammd5' authenticator
      I0210 02:47:31.261530 19597 http.cpp:919] Using default 'basic' HTTP authenticator for realm 'mesos-master-readonly'
      I0210 02:47:31.261591 19597 http.cpp:919] Using default 'basic' HTTP authenticator for realm 'mesos-master-readwrite'
      I0210 02:47:31.261631 19597 http.cpp:919] Using default 'basic' HTTP authenticator for realm 'mesos-master-scheduler'
      I0210 02:47:31.261698 19597 master.cpp:587] Authorization enabled
      I0210 02:47:31.261754 19601 whitelist_watcher.cpp:77] No whitelist given
      I0210 02:47:31.261754 19602 hierarchical.cpp:161] Initialized hierarchical allocator process
      I0210 02:47:31.262462 19597 master.cpp:2124] Elected as the leading master!
      I0210 02:47:31.262482 19597 master.cpp:1646] Recovering from registrar
      I0210 02:47:31.262545 19603 registrar.cpp:329] Recovering registrar
      I0210 02:47:31.262774 19602 registrar.cpp:362] Successfully fetched the registry (0B) in 201984ns
      I0210 02:47:31.262809 19602 registrar.cpp:461] Applied 1 operations in 2963ns; attempting to update the registry
      I0210 02:47:31.263062 19599 registrar.cpp:506] Successfully updated the registry in 214016ns
      I0210 02:47:31.263119 19599 registrar.cpp:392] Successfully recovered registrar
      I0210 02:47:31.263267 19597 master.cpp:1762] Recovered 0 agents from the registry (172B); allowing 10mins for agents to re-register
      I0210 02:47:31.263295 19598 hierarchical.cpp:188] Skipping recovery of hierarchical allocator: nothing to recover
      I0210 02:47:31.264645 19578 cluster.cpp:446] Creating default 'local' authorizer
      I0210 02:47:31.265029 19598 slave.cpp:211] Mesos agent started on (105)@10.150.111.24:33608
      I0210 02:47:31.265187 19578 scheduler.cpp:184] Version: 1.3.0
      I0210 02:47:31.265043 19598 slave.cpp:212] Flags at startup: --acls="" --appc_simple_discovery_uri_prefix="http://" --appc_store_dir="/tmp/mesos/store/appc" --authenticate_http_readonly="true" --authenticate_http_readwrite="true" --authenticatee="crammd5" --authentication_backoff_factor="1secs" --authorizer="local" --cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false" --cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false" --cgroups_root="mesos" --container_disk_watch_interval="15secs" --containerizers="mesos" --credential="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/credential" --default_role="*" --disk_watch_interval="1mins" --docker="docker" --docker_kill_orphans="true" --docker_registry="https://registry-1.docker.io" --docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock" --docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker" --docker_volume_checkpoint_dir="/var/run/mesos/isolators/docker/volume" --enforce_container_disk_quota="false" --executor_registration_timeout="1mins" --executor_shutdown_grace_period="5secs" --fetcher_cache_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/fetch" --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks" --gc_disk_headroom="0.1" --hadoop_home="" --help="false" --hostname_lookup="true" --http_authenticators="basic" --http_command_executor="false" --http_credentials="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/http_credentials" --http_heartbeat_interval="30secs" --initialize_driver_logging="true" --isolation="posix/cpu,posix/mem" --launcher="linux" --launcher_dir="/home/ubuntu/workspace/mesos/Mesos_CI-build/FLAG/Plain/label/mesos-ec2-ubuntu-16.04/mesos/build/src" --logbufsecs="0" --logging_level="INFO" --max_completed_executors_per_framework="150" --oversubscribed_resources_interval="15secs" --perf_duration="10secs" --perf_interval="1mins" --qos_correction_interval_min="0ns" --quiet="false" --recover="reconnect" --recovery_timeout="15mins" --registration_backoff_factor="10ms" --resources="cpus:2;gpus:0;mem:1024;disk:1024;ports:[31000-32000]" --revocable_cpu_low_priority="true" --runtime_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W" --sandbox_directory="/mnt/mesos/sandbox" --strict="true" --switch_user="true" --systemd_enable_support="true" --systemd_runtime_directory="/run/systemd/system" --version="false" --work_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC"
      I0210 02:47:31.265250 19598 credentials.hpp:86] Loading credential for authentication from '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/credential'
      I0210 02:47:31.265290 19598 slave.cpp:354] Agent using credential for: test-principal
      I0210 02:47:31.265303 19598 credentials.hpp:37] Loading credentials for authentication from '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/http_credentials'
      I0210 02:47:31.265349 19598 http.cpp:919] Using default 'basic' HTTP authenticator for realm 'mesos-agent-readonly'
      I0210 02:47:31.265388 19598 http.cpp:919] Using default 'basic' HTTP authenticator for realm 'mesos-agent-readwrite'
      I0210 02:47:31.265653 19599 scheduler.cpp:470] New master detected at master@10.150.111.24:33608
      I0210 02:47:31.265671 19599 scheduler.cpp:479] Waiting for 0ns before initiating a re-(connection) attempt with the master
      I0210 02:47:31.266012 19598 slave.cpp:541] Agent resources: cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000]
      I0210 02:47:31.266049 19598 slave.cpp:549] Agent attributes: [  ]
      I0210 02:47:31.266055 19598 slave.cpp:554] Agent hostname: ip-10-150-111-24.ec2.internal
      I0210 02:47:31.266111 19602 status_update_manager.cpp:177] Pausing sending status updates
      I0210 02:47:31.266312 19604 state.cpp:62] Recovering state from '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/meta'
      I0210 02:47:31.266428 19597 status_update_manager.cpp:203] Recovering status update manager
      I0210 02:47:31.266559 19600 slave.cpp:5555] Finished recovery
      I0210 02:47:31.266736 19600 slave.cpp:5729] Querying resource estimator for oversubscribable resources
      I0210 02:47:31.266881 19600 status_update_manager.cpp:177] Pausing sending status updates
      I0210 02:47:31.266877 19598 slave.cpp:931] New master detected at master@10.150.111.24:33608
      I0210 02:47:31.266935 19598 slave.cpp:966] Detecting new master
      I0210 02:47:31.266990 19598 slave.cpp:5743] Received oversubscribable resources {} from the resource estimator
      I0210 02:47:31.267329 19603 scheduler.cpp:361] Connected with the master at http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.267763 19601 scheduler.cpp:243] Sending SUBSCRIBE call to http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.268118 19597 process.cpp:3704] Handling HTTP event for process 'master' with path: '/master/api/v1/scheduler'
      I0210 02:47:31.268470 19600 http.cpp:420] HTTP POST for /master/api/v1/scheduler from 10.150.111.24:58632
      I0210 02:47:31.268532 19600 master.cpp:2423] Received subscription request for HTTP framework 'default'
      I0210 02:47:31.268556 19600 master.cpp:2160] Authorizing framework principal 'test-principal' to receive offers for role '*'
      I0210 02:47:31.268748 19599 master.cpp:2537] Subscribing framework 'default' with checkpointing disabled and capabilities [  ]
      I0210 02:47:31.268887 19597 master.hpp:2119] Sending heartbeat to d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.268909 19604 hierarchical.cpp:286] Added framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.268954 19604 hierarchical.cpp:1810] No allocations performed
      I0210 02:47:31.268977 19604 hierarchical.cpp:1900] No inverse offers to send out!
      I0210 02:47:31.268992 19604 hierarchical.cpp:1394] Performed allocation for 0 agents in 45649ns
      I0210 02:47:31.269172 19598 scheduler.cpp:676] Enqueuing event SUBSCRIBED received from http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.269356 19598 scheduler.cpp:676] Enqueuing event HEARTBEAT received from http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.273422 19602 slave.cpp:993] Authenticating with master master@10.150.111.24:33608
      I0210 02:47:31.273448 19602 slave.cpp:1004] Using default CRAM-MD5 authenticatee
      I0210 02:47:31.273509 19599 authenticatee.cpp:121] Creating new client SASL connection
      I0210 02:47:31.273859 19599 master.cpp:7173] Authenticating slave(105)@10.150.111.24:33608
      I0210 02:47:31.273927 19598 authenticator.cpp:414] Starting authentication session for crammd5-authenticatee(241)@10.150.111.24:33608
      I0210 02:47:31.274020 19601 authenticator.cpp:98] Creating new server SASL connection
      I0210 02:47:31.274341 19601 authenticatee.cpp:213] Received SASL authentication mechanisms: CRAM-MD5
      I0210 02:47:31.274369 19601 authenticatee.cpp:239] Attempting to authenticate with mechanism 'CRAM-MD5'
      I0210 02:47:31.274416 19601 authenticator.cpp:204] Received SASL authentication start
      I0210 02:47:31.274457 19601 authenticator.cpp:326] Authentication requires more steps
      I0210 02:47:31.274494 19601 authenticatee.cpp:259] Received SASL authentication step
      I0210 02:47:31.274559 19598 authenticator.cpp:232] Received SASL authentication step
      I0210 02:47:31.274586 19598 auxprop.cpp:109] Request to lookup properties for user: 'test-principal' realm: 'ip-10-150-111-24.ec2.internal' server FQDN: 'ip-10-150-111-24.ec2.internal' SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false 
      I0210 02:47:31.274600 19598 auxprop.cpp:181] Looking up auxiliary property '*userPassword'
      I0210 02:47:31.274615 19598 auxprop.cpp:181] Looking up auxiliary property '*cmusaslsecretCRAM-MD5'
      I0210 02:47:31.274629 19598 auxprop.cpp:109] Request to lookup properties for user: 'test-principal' realm: 'ip-10-150-111-24.ec2.internal' server FQDN: 'ip-10-150-111-24.ec2.internal' SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true 
      I0210 02:47:31.274641 19598 auxprop.cpp:131] Skipping auxiliary property '*userPassword' since SASL_AUXPROP_AUTHZID == true
      I0210 02:47:31.274682 19598 auxprop.cpp:131] Skipping auxiliary property '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
      I0210 02:47:31.274701 19598 authenticator.cpp:318] Authentication success
      I0210 02:47:31.274760 19603 authenticatee.cpp:299] Authentication success
      I0210 02:47:31.274780 19601 authenticator.cpp:432] Authentication session cleanup for crammd5-authenticatee(241)@10.150.111.24:33608
      I0210 02:47:31.274793 19598 master.cpp:7203] Successfully authenticated principal 'test-principal' at slave(105)@10.150.111.24:33608
      I0210 02:47:31.274907 19603 slave.cpp:1088] Successfully authenticated with master master@10.150.111.24:33608
      I0210 02:47:31.274969 19603 slave.cpp:1515] Will retry registration in 1.825425ms if necessary
      I0210 02:47:31.275038 19602 master.cpp:5388] Registering agent at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal) with id d8129420-2a04-48e7-9b28-6b0a0af73168-S0
      I0210 02:47:31.275151 19602 registrar.cpp:461] Applied 1 operations in 12479ns; attempting to update the registry
      I0210 02:47:31.275413 19600 registrar.cpp:506] Successfully updated the registry in 237056ns
      I0210 02:47:31.275595 19601 slave.cpp:4343] Received ping from slave-observer(98)@10.150.111.24:33608
      I0210 02:47:31.275621 19600 master.cpp:5462] Registered agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal) with cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000]
      I0210 02:47:31.275703 19597 hierarchical.cpp:516] Added agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 (ip-10-150-111-24.ec2.internal) with cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: {})
      I0210 02:47:31.275722 19601 slave.cpp:1134] Registered with master master@10.150.111.24:33608; given agent ID d8129420-2a04-48e7-9b28-6b0a0af73168-S0
      I0210 02:47:31.275737 19601 fetcher.cpp:90] Clearing fetcher cache
      I0210 02:47:31.275805 19599 status_update_manager.cpp:184] Resuming sending status updates
      I0210 02:47:31.275915 19601 slave.cpp:1162] Checkpointing SlaveInfo to '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/meta/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/slave.info'
      I0210 02:47:31.275993 19597 hierarchical.cpp:1900] No inverse offers to send out!
      I0210 02:47:31.276023 19597 hierarchical.cpp:1394] Performed allocation for 1 agents in 263025ns
      I0210 02:47:31.276077 19601 slave.cpp:1200] Forwarding total oversubscribed resources {}
      I0210 02:47:31.276083 19602 master.cpp:7002] Sending 1 offers to framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.276250 19602 master.cpp:6019] Received update of agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal) with total oversubscribed resources {}
      I0210 02:47:31.276347 19599 hierarchical.cpp:589] Agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 (ip-10-150-111-24.ec2.internal) updated with oversubscribed resources {} (total: cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000], allocated: cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000])
      I0210 02:47:31.276420 19599 hierarchical.cpp:1810] No allocations performed
      I0210 02:47:31.276445 19599 hierarchical.cpp:1900] No inverse offers to send out!
      I0210 02:47:31.276461 19599 hierarchical.cpp:1394] Performed allocation for 1 agents in 56740ns
      I0210 02:47:31.276593 19598 scheduler.cpp:676] Enqueuing event OFFERS received from http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.277225 19601 scheduler.cpp:243] Sending ACCEPT call to http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.277544 19597 process.cpp:3704] Handling HTTP event for process 'master' with path: '/master/api/v1/scheduler'
      I0210 02:47:31.277899 19598 http.cpp:420] HTTP POST for /master/api/v1/scheduler from 10.150.111.24:58630
      I0210 02:47:31.278055 19598 master.cpp:3776] Processing ACCEPT call for offers: [ d8129420-2a04-48e7-9b28-6b0a0af73168-O0 ] on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal) for framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.278100 19598 master.cpp:3352] Authorizing framework principal 'test-principal' to launch task 512412f2-4982-435f-a963-8dc60a3fb03e
      W0210 02:47:31.278590 19598 validation.cpp:1064] Executor 'default' for task '512412f2-4982-435f-a963-8dc60a3fb03e' uses less CPUs (None) than the minimum required (0.01). Please update your executor, as this will be mandatory in future releases.
      W0210 02:47:31.278615 19598 validation.cpp:1076] Executor 'default' for task '512412f2-4982-435f-a963-8dc60a3fb03e' uses less memory (None) than the minimum required (32MB). Please update your executor, as this will be mandatory in future releases.
      I0210 02:47:31.278728 19598 master.cpp:9063] Adding task 512412f2-4982-435f-a963-8dc60a3fb03e with resources cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000] on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
      I0210 02:47:31.278810 19598 master.cpp:4426] Launching task 512412f2-4982-435f-a963-8dc60a3fb03e of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) with resources cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000] on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
      I0210 02:47:31.278962 19599 slave.cpp:1625] Got assigned task '512412f2-4982-435f-a963-8dc60a3fb03e' for framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.279145 19599 slave.cpp:1785] Launching task '512412f2-4982-435f-a963-8dc60a3fb03e' for framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.279255 19603 hierarchical.cpp:807] Updated allocation of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 from cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000] to cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000]
      I0210 02:47:31.279412 19599 paths.cpp:547] Trying to chown '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d' to user 'root'
      I0210 02:47:31.279538 19599 slave.cpp:6477] Launching executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 with resources {} in work directory '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d'
      I0210 02:47:31.279744 19599 slave.cpp:2115] Queued task '512412f2-4982-435f-a963-8dc60a3fb03e' for executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.279788 19599 slave.cpp:884] Successfully attached file '/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d'
      I0210 02:47:31.280185 19604 executor.cpp:192] Version: 1.3.0
      I0210 02:47:31.280642 19597 executor.cpp:401] Connected with the agent
      I0210 02:47:31.280867 19603 executor.cpp:302] Sending SUBSCRIBE call to http://10.150.111.24:33608/slave(105)/api/v1/executor
      I0210 02:47:31.281189 19597 process.cpp:3704] Handling HTTP event for process 'slave(105)' with path: '/slave(105)/api/v1/executor'
      I0210 02:47:31.281433 19597 http.cpp:307] HTTP POST for /slave(105)/api/v1/executor from 10.150.111.24:58634
      I0210 02:47:31.281486 19597 slave.cpp:3166] Received Subscribe request for HTTP executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.281761 19597 slave.cpp:2328] Sending queued task '512412f2-4982-435f-a963-8dc60a3fb03e' to executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (via HTTP)
      I0210 02:47:31.282199 19602 executor.cpp:716] Enqueuing event SUBSCRIBED received from http://10.150.111.24:33608/slave(105)/api/v1/executor
      I0210 02:47:31.282369 19602 executor.cpp:716] Enqueuing event LAUNCH received from http://10.150.111.24:33608/slave(105)/api/v1/executor
      I0210 02:47:31.282842 19578 scheduler.cpp:184] Version: 1.3.0
      I0210 02:47:31.283253 19604 scheduler.cpp:470] New master detected at master@10.150.111.24:33608
      I0210 02:47:31.283275 19604 scheduler.cpp:479] Waiting for 0ns before initiating a re-(connection) attempt with the master
      I0210 02:47:31.284811 19602 scheduler.cpp:361] Connected with the master at http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.285277 19599 scheduler.cpp:243] Sending SUBSCRIBE call to http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.285604 19604 process.cpp:3704] Handling HTTP event for process 'master' with path: '/master/api/v1/scheduler'
      I0210 02:47:31.285926 19602 http.cpp:420] HTTP POST for /master/api/v1/scheduler from 10.150.111.24:58640
      I0210 02:47:31.285991 19602 master.cpp:2423] Received subscription request for HTTP framework 'default'
      I0210 02:47:31.286016 19602 master.cpp:2160] Authorizing framework principal 'test-principal' to receive offers for role '*'
      I0210 02:47:31.286171 19598 master.cpp:2537] Subscribing framework 'default' with checkpointing disabled and capabilities [  ]
      I0210 02:47:31.286193 19598 master.cpp:2583] Updating info for framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.286348 19603 master.hpp:2119] Sending heartbeat to d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.286393 19601 slave.cpp:2824] Updating framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 pid to @0.0.0.0:0
      I0210 02:47:31.286487 19603 status_update_manager.cpp:184] Resuming sending status updates
      I0210 02:47:31.286502 19600 scheduler.cpp:676] Enqueuing event ERROR received from http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.286542 19603 master.cpp:1237] Ignoring disconnection for framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) as it has already reconnected
      I0210 02:47:31.286800 19603 scheduler.cpp:676] Enqueuing event SUBSCRIBED received from http://10.150.111.24:33608/master/api/v1/scheduler
      E0210 02:47:31.286847 19602 scheduler.cpp:649] End-Of-File received from master. The master closed the event stream
      I0210 02:47:31.287020 19601 scheduler.cpp:444] Re-detecting master
      I0210 02:47:31.287117 19603 scheduler.cpp:676] Enqueuing event HEARTBEAT received from http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.287179 19601 scheduler.cpp:395] Ignoring disconnection attempt from stale connection
      I0210 02:47:31.287207 19601 scheduler.cpp:395] Ignoring disconnection attempt from stale connection
      I0210 02:47:31.287338 19598 scheduler.cpp:470] New master detected at master@10.150.111.24:33608
      I0210 02:47:31.287356 19598 scheduler.cpp:479] Waiting for 0ns before initiating a re-(connection) attempt with the master
      I0210 02:47:31.287582 19598 scheduler.cpp:243] Sending MESSAGE call to http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.288050 19600 process.cpp:3704] Handling HTTP event for process 'master' with path: '/master/api/v1/scheduler'
      I0210 02:47:31.288568 19604 http.cpp:420] HTTP POST for /master/api/v1/scheduler from 10.150.111.24:58638
      I0210 02:47:31.288591 19601 scheduler.cpp:361] Connected with the master at http://10.150.111.24:33608/master/api/v1/scheduler
      I0210 02:47:31.288615 19604 master.cpp:5271] Processing MESSAGE call from framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) to agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
      I0210 02:47:31.289027 19603 executor.cpp:716] Enqueuing event MESSAGE received from http://10.150.111.24:33608/slave(105)/api/v1/executor
      I0210 02:47:31.289861 19597 master.cpp:1381] Framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) disconnected
      I0210 02:47:31.289906 19597 master.cpp:3081] Deactivating framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.289947 19597 master.cpp:3058] Disconnecting framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.289978 19601 hierarchical.cpp:415] Deactivated framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.290050 19597 master.cpp:1396] Giving framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) 0ns to failover
      I0210 02:47:31.291277 19598 master.cpp:6842] Framework failover timeout, removing framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.291296 19598 master.cpp:7727] Removing framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
      I0210 02:47:31.291358 19598 master.cpp:8312] Updating the state of task 512412f2-4982-435f-a963-8dc60a3fb03e of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (latest state: TASK_KILLED, status update state: TASK_KILLED)
      I0210 02:47:31.291368 19599 slave.cpp:2655] Asked to shut down framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 by master@10.150.111.24:33608
      I0210 02:47:31.291404 19599 slave.cpp:2680] Shutting down framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.291416 19599 slave.cpp:5080] Shutting down executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (via HTTP)
      I0210 02:47:31.291527 19598 master.cpp:8406] Removing task 512412f2-4982-435f-a963-8dc60a3fb03e with resources cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000] of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
      I0210 02:47:31.291599 19598 master.cpp:8435] Removing executor 'default' with resources {} of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
      I0210 02:47:31.291600 19603 hierarchical.cpp:1064] Recovered cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000] (total: cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000], allocated: {}) on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 from framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.291684 19603 hierarchical.cpp:362] Removed framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
      I0210 02:47:31.292042 19603 executor.cpp:716] Enqueuing event SHUTDOWN received from http://10.150.111.24:33608/slave(105)/api/v1/executor
      *** Aborted at 1486694851 (unix time) try "date -d @1486694851" if you are using GNU date ***
      PC: @     0x55e75dde42b2 testing::internal::UntypedFunctionMockerBase::SetOwnerAndName()
      *** SIGBUS (@0x0) received by PID 19578 (TID 0x7fd07ac1f700) from PID 0; stack trace: ***
          @     0x7fd082e30390 (unknown)
          @     0x55e75dde42b2 testing::internal::UntypedFunctionMockerBase::SetOwnerAndName()
          @     0x55e75d11ebe6 mesos::internal::tests::executor::TestMesos<>::events()
          @     0x55e75d10f34f std::_Function_handler<>::_M_invoke()
          @     0x7fd084a840b2 process::AsyncExecutorProcess::execute<>()
          @     0x7fd084a9595a _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchI7NothingNS0_20AsyncExecutorProcessERKSt8functionIFvRKSt5queueIN5mesos2v18executor5EventESt5dequeISC_SaISC_EEEEESG_PvSK_SG_SN_EENS0_6FutureIT_EERKNS0_3PIDIT0_EEMSS_FSP_T1_T2_T3_ET4_T5_T6_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
          @     0x7fd0857b340c process::ProcessBase::visit()
          @     0x7fd0857c5c33 process::ProcessManager::resume()
          @     0x7fd0857d06e7 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
          @     0x7fd083613c80 (unknown)
          @     0x7fd082e266ba start_thread
          @     0x7fd082b5c82d (unknown)
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bbannier Benjamin Bannier
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: