Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
In dev mail list's discussion, I suggest to enable shrunken dictionary by default, and received some dev's aggrement.
When using bitmap measure on a large cardinality column(require global dictionaty), build base cuboid step need frequent cache swap so it cannot finished within a reasonable period.
When shrunken dictionary enabled, a new step will be added to build separated dictionary for each `InputSplit`, Mapper of *BuildBaseCuboid* step only has to fetch a smaller dictionary for itself, instead of a larger global dictionary. It will reduce cache swap and make *BuildBaseCuboid* step run as quicker as possible.
Attachments
Attachments
Issue Links
- is related to
-
KYLIN-4619 Make shrunken dict able to coexist with mr-hive global dict
- Closed
- links to