Uploaded image for project: 'Flume'
  1. Flume
  2. FLUME-948

[Agent Reliability] In BEChain mode,if primary collector is off,but the secondary collector is on,agent's event send to null sink, instead of seconday collector

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Won't Fix
    • Affects Version/s: 0.9.3
    • Fix Version/s: 0.9.5
    • Component/s: Sinks+Sources
    • Labels:
      None

      Description

      Here is my config for flume.
      agent: : rpcSource(333)| agentDFOChain("192.168.130.15:17876","192.168.130.14:17876")
      collector: collector(17876)|myCustomplugin

      Here is my test case:

      1. use an rpcClient send one event to agent every munite.
      2. shutdown the primay collector and sendory collector
      3. wait about 1.5h, start the sendory collector

      In my expect, the events received by agent at the posted time of the secondary recovered, should be send to sendory collector, but in actually, the events just be discard as it be send to the null sink in BEChain.

      Here is my log:[2012-01-20 10:19:00,098] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.handlers.debug.StubbornAppendSink 76] Append failed java.net.SocketException: No route to host
      [2012-01-20 10:19:00,098] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.handlers.thrift.ThriftEventSink 89] ThriftEventSink on port 17876 closed
      [2012-01-20 10:19:01,066] [WARN ] [Thread-2] [com.cloudera.flume.agent.MultiMasterRPC 198] Could not connect to any master nodes (tried 1: [192.168.130.13:17872])
      [2012-01-20 10:19:01,067] [INFO ] [Heartbeat] [com.cloudera.flume.agent.MultiMasterRPC 194] MasterRPC called while disconnected.
      [2012-01-20 10:19:03,102] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host

      [2012-01-20 10:19:04,070] [WARN ] [Heartbeat] [com.cloudera.flume.agent.MultiMasterRPC 198] Could not connect to any master nodes (tried 1: [192.168.130.13:17872])
      [2012-01-20 10:19:06,070] [INFO ] [Thread-2] [com.cloudera.flume.agent.MultiMasterRPC 194] MasterRPC called while disconnected.
      [2012-01-20 10:19:06,106] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.14:17876 : java.net.NoRouteToHostException: No route to host

      [2012-01-20 10:20:39,830] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:21:47,997] [INFO ] [Thread-2] [com.cloudera.flume.agent.ThriftMasterRPC 78] Connected to master at 192.168.130.13:17872

      [2012-01-20 10:22:44,874] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:23:35,951] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:24:40,049] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:25:44,139] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:27:39,987] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:29:39,914] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host
      [2012-01-20 10:31:43,054] [INFO ] [logicalNode ESC01_agent-24] [com.cloudera.flume.core.BackOffFailOverSink 143] Failed to open thrift event sink at 192.168.130.15:17876 : java.net.NoRouteToHostException: No route to host

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              csulyb@gmail.com jeff
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 96h
                96h
                Remaining:
                Remaining Estimate - 96h
                96h
                Logged:
                Time Spent - Not Specified
                Not Specified