Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-2764

Build the dict for UHC column with MR

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • v2.0.0
    • v2.3.0
    • Job Engine
    • None

    Description

      KYLIN-2217 has built dict for normal column with MR, but the UHC column still build dict in JobServer. Like KYLIN-2217, we also could use MR build dict for UHC column. which could thoroughly release the memory pressure and improve job concurrent for JobServer as well as speed up multi UHC columns procedure.

      The MR input is the output of "Extract Fact Table Distinct Columns", the MR output is the UHC column dict. Because it is very hard build global dict with multi reducers, I use one reducer handle one UHC column and allocate enough memory to the reducer. According to my test, 8G memory is enough.

      Attachments

        1. job-memory-after.png
          448 kB
          Kaisen Kang
        2. job-memory-before.png
          121 kB
          Kaisen Kang

        Issue Links

          Activity

            People

              kangkaisen Kaisen Kang
              kangkaisen Kaisen Kang
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: