Uploaded image for project: 'CouchDB'
  1. CouchDB
  2. COUCHDB-1194

CouchDB 1.1 _replicator replications do not retry on timeout

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.1
    • 1.1.1
    • Replication
    • None
    • Linux/amd64, Erlang 13B4

    Description

      With 1.1, we started using the _replicator database to try and ensure that two continuous replication processes always run. To help with this, we have max_replication_retry_count = 100 in the replicator section of the configuration. However, it seems that, when there's a "changes loop timeout", the retry process isn't used, so that I end up without active replication (and the replication state in the documents gets set to error). This seems like a bug.

      [Wed, 15 Jun 2011 05:52:33 GMT] [debug] [<0.159.0>] target doesn't need a full commit
      [Wed, 15 Jun 2011 05:52:33 GMT] [info] [<0.159.0>] recording a checkpoint for http://10.8.0.1:5984/fix/ -> fix at source update_seq 1340615
      [Wed, 15 Jun 2011 05:53:09 GMT] [error] [<0.167.0>] changes loop timeout, no data received from http://10.8.0.1:5984/fix/
      [Wed, 15 Jun 2011 05:53:09 GMT] [error] [<0.167.0>] ** Generic server <0.167.0> terminating

        • Last message in was changes_timeout
        • When Server state == {state,nil,nil,
          [<0.159.0>,
          Unknown macro: {http_db,"http}

          ,
          <0.169.0>,<0.172.0>,[],0,

          {[],[]},
          {<0.173.0>,#Ref<0.0.3.202798>},
          false,0,nil,[],
          [1340615]}]},
          {gen_server,handle_msg,5},
          {proc_lib,init_p_do_apply,3}]}},
          {'EXIT',<0.173.0>,
          {{function_clause,
          [{couch_rep_reader,handle_info,
          [{'EXIT',<0.172.0>,
          {{changes_timeout,
          {gen_server,call,
          [<0.167.0>,next_changes,infinity]}},
          {gen_server,call, [<0.169.0>,next_missing_revs,infinity]}}},
          {state,<0.159.0>,
          {http_db,"http://10.8.0.1:5984/fix/",[],[],
          [{"User-Agent","CouchDB/1.1.0"},
          {"Accept","application/json"},
          {"Accept-Encoding","gzip"}],
          [],get,nil,
          [{response_format,binary},
          {inactivity_timeout,30000}],
          10,500,nil},
          <0.169.0>,<0.172.0>,[],0,
          {[],[]}

          ,

          {<0.173.0>,#Ref<0.0.3.202798>}

          ,
          false,0,nil,[],
          [1340615]}]},

          {gen_server,handle_msg,5}

          ,

          {proc_lib,init_p_do_apply,3}

          ]},

          {gen_server,call, [<0.171.0>,next_docs,infinity]}

          }}]},

          {links,[<0.83.0>]}

          ,
          {dictionary,
          [{task_status_update,{{1308,117148,88291},0}}]},

          {trap_exit,true}

          ,

          {status,running}

          ,

          {heap_size,6765}

          ,

          {stack_size,24}

          ,

          {reductions,271204}

          ],
          []]}}
          [Wed, 15 Jun 2011 05:53:09 GMT] [error] [<0.83.0>] {error_report,<0.32.0>,
          {<0.83.0>,supervisor_report,
          [{supervisor,{local,couch_rep_sup}},

          {errorContext,child_terminated}

          ,

          {reason,changes_timeout}

          ,
          {offender,
          [

          {pid,<0.159.0>}

          ,

          {name, "812005cfdfded6c80a444f85d71caffd+continuous"}

          ,
          {mfa,
          {gen_server,start_link,
          [couch_rep,
          ["812005cfdfded6c80a444f85d71caffd",

          Unknown macro: {[{<<"_id">>, <<"c7311d13160739f3d1b9458b2300242e">>}, {<<"_rev">>, <<"18-94d8df687ec8434de355fa1c630fb14d">>}, {<<"source">>, <<"http://10.8.0.1:5984/fix">>}, {<<"target">>,<<"fix">>}, {<<"continuous">>,true}, {<<"_replication_state_time">>, <<"2011-06-15T07:23:15+02:00">>}, {<<"_replication_id">>, <<"812005cfdfded6c80a444f85d71caffd">>}, {<<"_replication_state">>, <<"triggered">>}]}

          ,

          {user_ctx,null,[],undefined}

          ],
          []]}},

          {restart_type,temporary}

          ,

          {shutdown,1}

          ,

          {child_type,worker}

          ]}]}}
          [Wed, 15 Jun 2011 05:53:09 GMT] [debug] [<0.134.0>] Include Doc: <<"c7311d13160739f3d1b9458b2300242e">>

          {19, <<220,6, 99,17, 13,51, 140,159, 244,19, 55,198, 109,204, 109,149>>}

      Attachments

        Activity

          People

            fdmanana Filipe David Borba Manana
            djc Dirkjan
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: