Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1767

Unable to run tests on H2O enigne in distributed mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 0.11.0
    • 1.0.0
    • classic
    • None

    Description

      When one follows the instructions located in README.md for H2O module and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

      1. host-1:~/mahout$ ./bin/mahout h2o-node
        ...
        .. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
        
      2. host-2:~/mahout$ ./bin/mahout h2o-node
        ...
        .. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
        
      3. host-N:~/mahout/h2o$ mvn test
        ...
        .. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
        ...
        All tests passed.
        ...
        host-N:~/mahout/h2o$
        

      First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or masterURL in terms of the code). If you look in the code, you can find the following:

      DistributedH2OSuite.scala
      ...
      mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
      ...
      

      After we removed the code which appends current time to cloud name, it started to work.

      Attachments

        Issue Links

          Activity

            People

              Andrew_Palumbo Andrew Palumbo
              dyaraev Dmitry Yaraev
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: