Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-5595

[kylin 5.0] Launch Job Node not initialize spark session issue

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 5.0-beta
    • 5.0-beta
    • Job Engine
    • None

    Description

         Saving model will execute  'checkFlatTableSql'  method at job node, and it will not skip the step by default. When execute 'checkFlatTableSql' method, it will initialize spark session if the job node just started, the process of getting spark session costs too much time.

        The pic 87 shows that get spark session costs more than 63s, and the execution of checking sql costs more than 2s, it is not friendly for saving model first time after node launched, and it is also unreasonable.

        So I suggest that the job node's process of  spark session initialization should be consistent with the query node, it means that the spark session should be initialized as the node just started. And the spark session is a singleton model, just need once initialization, as the pic 88 display.

      Attachments

        1. image (87).png
          553 kB
          Li Can
        2. image (88).png
          678 kB
          Li Can

        Activity

          People

            cli2 Li Can
            cli2 Li Can
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: