Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-3598

Storm UI visualization throws NullPointerException

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0, 2.1.0
    • 2.2.0, 2.1.1
    • None

    Description

      We encountered an issue with visualization on UI. 

       

      2020-03-09 19:59:01.756 o.a.s.d.u.r.StormApiResource qtp1919834117-167291 [ERROR] Failure getting topology visualization
      java.lang.NullPointerException: null
              at org.apache.storm.stats.StatsUtil.mergeWithAddPair(StatsUtil.java:1855) ~[storm-server-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.stats.StatsUtil.expandAveragesSeq(StatsUtil.java:2308) ~[storm-server-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.stats.StatsUtil.aggregateAverages(StatsUtil.java:832) ~[storm-server-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.stats.StatsUtil.aggregateBoltStats(StatsUtil.java:731) ~[storm-server-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.stats.StatsUtil.boltStreamsStats(StatsUtil.java:900) ~[storm-server-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.daemon.ui.UIHelpers.getVisualizationData(UIHelpers.java:1939) ~[storm-webapp-2.2.0.y.jar:2.2.0.y]
              at org.apache.storm.daemon.ui.resources.StormApiResource.getTopologyVisualization(StormApiResource.java:423) ~[storm-webapp-2.2.0.y.jar:2.2.0.y]
      

      This is a bug in the code. https://github.com/apache/storm/blob/master/storm-server/src/main/java/org/apache/storm/stats/StatsUtil.java#L1846-L1858

      for (K kk : mm1.keySet()) {
                          List seq1 = mm1.get(kk);
                          List seq2 = mm2.get(kk);
                          List sums = new ArrayList();
                          for (int i = 0; i < seq1.size(); i++) {
                              if (seq1.get(i) instanceof Long) {
                                  sums.add(((Number) seq1.get(i)).longValue() + ((Number) seq2.get(i)).longValue());
                              } else {
                                  sums.add(((Number) seq1.get(i)).doubleValue() + ((Number) seq2.get(i)).doubleValue());
                              }
                          }
                          tmp.put(kk, sums);
                      }
      

      It assume mm1 and mm2 always have the same key, which is not true.

      And it can be reproduced by my example code: 

       

      public class  WordCountTopology extends ConfigurableTopology {
          private static final Logger LOG = LoggerFactory.getLogger(WordCountTopology.class);
      
          public static void main(String[] args) {
              ConfigurableTopology.start(new WordCountTopology(), args);
          }
      
          protected int run(String[] args) {
              TopologyBuilder builder = new TopologyBuilder();
      
              builder.setSpout("spout1", new RandomSpout(1), 1);
              builder.setSpout("spout2", new RandomSpout(2), 1);
              builder.setBolt("bolt", new RandomBolt(), 2).directGrouping("spout1", "stream1")
                      .directGrouping("spout2", "stream2");
      
              String topologyName = "word-count";
      
              conf.setNumWorkers(3);
      
              if (args != null && args.length > 0) {
                  topologyName = args[0];
              }
              return submit(topologyName, conf, builder);
          }
      
          static class RandomSpout extends BaseRichSpout {
              String stream;
              int id;
      
              public RandomSpout(int id) {
                  this.id = id;
                  stream = "stream" + id;
              }
      
              int taskId = 0;
              SpoutOutputCollector collector;
              public void open(Map<String, Object> conf, TopologyContext context, SpoutOutputCollector collector) {
                  taskId = context.getThisTaskId();
                  this.collector = collector;
              }
      
              /**
               * Different spout send tuples to different bolt via different stream.
               */
              public void nextTuple() {
                  LOG.info("emitting {}", id);
                  if (id == 1) {
                      Values val = new Values("test a sentence");
                      collector.emitDirect(2, stream, val, val);
                  } else {
                      Values val = new Values("test 2 sentence");
                      collector.emitDirect(3, stream, val, val);
                  }
                  try {
                      Thread.sleep(1000);
                  } catch (InterruptedException e) {
                      e.printStackTrace();
                  }
              }
      
              public void declareOutputFields(OutputFieldsDeclarer declarer) {
                  declarer.declareStream(stream, new Fields("word"));
              }
          }
      
          static class RandomBolt extends BaseBasicBolt {
      
              public void execute(Tuple input, BasicOutputCollector collector) {
                  LOG.info("executing:" + input.getSourceComponent());
              }
      
              public void declareOutputFields(OutputFieldsDeclarer declarer) {
      
              }
          }
      }
      

       In this example, one of the bolt will only receive data from stream1 and another bolt will only receive data from stream2. So in the map,

       

                          List seq1 = mm1.get(kk);
                          List seq2 = mm2.get(kk);
      

      seq1 is null if kk is stream1, seq2 is null if kk is stream2.

       

       We have other places aggregating executor stats without this problem because it's using different code https://github.com/apache/storm/blob/master/storm-server/src/main/java/org/apache/storm/stats/StatsUtil.java#L502-L513 and this problem has been taken cared of.

      Attachments

        Issue Links

          Activity

            People

              ethanli Ethan Li
              ethanli Ethan Li
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m