Uploaded image for project: 'River (Retired)'
  1. River (Retired)
  2. RIVER-230

(mux) SelectionManager catch Error block assumes getMessage() returns non-null

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • jtsk_2.0
    • River_2.1.2
    • net_jini_jeri
    • None
    • 6332547

    Description

      Bugtraq ID 6332547

      com.sun.jini.jeri.internal.runtime.SelectionManager's waitForReadyKey method has the following catch block in order to work around JDK bug 4458268:

      		} catch (Error e) {
      		    if (e.getMessage().startsWith("POLLNVAL")) {
      			Thread.yield();
      			continue;		// work around 4458268
      		    } else {
      			throw e;
      		    }
      

      The problem is that Error.getMessage() may return null (in cases other than those that this code was intended to handle), in which case a NullPointerException will occur. Such a NullPointerException has been observed in the output of tests that cause an out-of-memory condition due to thread creation (in particular, the regression test /vob/qa/jtreg/net/jini/jeri/tcp/outOfThreads/OutOfThreads.java on Solaris 9):

      Oct 3, 2005 2:00:52 AM com.sun.jini.jeri.internal.runtime.SelectionManager$SelectLoop run
      WARNING: select loop throws
      java.lang.NullPointerException
      	at com.sun.jini.jeri.internal.runtime.SelectionManager.waitForReadyKey(SelectionManager.java:369)
      	at com.sun.jini.jeri.internal.runtime.SelectionManager.access$600(SelectionManager.java:80)
      	at com.sun.jini.jeri.internal.runtime.SelectionManager$SelectLoop.run(SelectionManager.java:287)
      	at com.sun.jini.thread.ThreadPool$Worker.run(ThreadPool.java:136)
      	at java.lang.Thread.run(Thread.java:595)
      (repeated)
      

      With the fix for 6304782, the select loop will continue after such an exception, but the type and stack trace of the original exception does not get logged properly (only the NPE gets logged).

      -------------

      Two examples of this phenomenon in test failures: (Sun internal URLs removed).

      In the second case, the VM terminated abruptly because of the out-of-memory condition anyway (not much we can do about that test failure mode?).

      In the first case, the harness timed out the test. Was that because I/O could not make progress because of the recurring select loop failures (perhaps also slowed by the loop throttling after repeated failures)?

      Attachments

        1. RIVER-230.patch
          0.6 kB
          Peter Jones

        Issue Links

          Activity

            People

              peter.jones Peter Jones
              foliver Fred Oliver
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: