Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-1915

Supervisor keeps restarting forever

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0.1
    • 2.0.0, 1.0.2, 1.1.0
    • storm-core
    • None
    • Linode 4GB running on KVM - Ubuntu 14.04 LTS

    Description

      While submitting a topology to a 20 node 40 worker strong cluster, the supervisor keeps throwing errors and keeps restarting the workers it is supervising.

      For this reason the topology never starts, instead it keeps dancing by reassigning the bolts and spouts forever.

      I'd love to attach the logs here but I can't find any upload button in the JIRA form.

      The error basically says:

      2016-06-18 12:04:26.589 o.a.s.config [WARN] Failed to get worker user for . #error {
       :cause /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
       :via
       [{:type java.io.FileNotFoundException
         :message /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
         :at [java.io.FileInputStream open0 FileInputStream.java -2]}]
       :trace
       [[java.io.FileInputStream open0 FileInputStream.java -2]
        [java.io.FileInputStream open FileInputStream.java 195]
        [java.io.FileInputStream <init> FileInputStream.java 138]
        [clojure.java.io$fn__9189 invoke io.clj 229]
        [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
        [clojure.java.io$fn__9201 invoke io.clj 258]
        [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
        [clojure.java.io$fn__9163 invoke io.clj 165]
        [clojure.java.io$fn__9115$G__9091__9122 invoke io.clj 69]
        [clojure.java.io$reader doInvoke io.clj 102]
        [clojure.lang.RestFn invoke RestFn.java 410]
        [clojure.lang.AFn applyToHelper AFn.java 154]
        [clojure.lang.RestFn applyTo RestFn.java 132]
        [clojure.core$apply invoke core.clj 632]
        [clojure.core$slurp doInvoke core.clj 6653]
        [clojure.lang.RestFn invoke RestFn.java 410]
        [org.apache.storm.config$get_worker_user invoke config.clj 239]
        [org.apache.storm.daemon.supervisor$shutdown_worker invoke supervisor.clj 281]
        [org.apache.storm.daemon.supervisor$kill_existing_workers_with_change_in_components invoke supervisor.clj 536]
        [org.apache.storm.daemon.supervisor$mk_synchronize_supervisor$this__9078 invoke supervisor.clj 595]
        [org.apache.storm.event$event_manager$fn__8630 invoke event.clj 40]
        [clojure.lang.AFn run AFn.java 22]
        [java.lang.Thread run Thread.java 745]]}
      

      Attachments

        Issue Links

          Activity

            People

              kabhwan Jungtaek Lim
              fogetti Gergely Nagy
              Votes:
              3 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: