Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-2543

Still build dictionary for TopN group by column even using non-dict encoding

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: v2.0.0
    • Fix Version/s: v2.0.0
    • Component/s: Job Engine
    • Labels:
      None

      Description

      Using latest 2.0 beta, create a cube with TopN, but specifying using non-dict encoding, the JSON is like:

      {
            "name": "TOP_CUSTOMER",
            "function": {
              "expression": "TOP_N",
              "parameter": {
                "type": "column",
                "value": "P_LINEORDER.V_REVENUE",
                "next_parameter": {
                  "type": "column",
                  "value": "P_LINEORDER.LO_CUSTKEY"
                }
              },
              "returntype": "topn(100,4)",
              "configuration": {
                "topn.encoding.P_LINEORDER.LO_CUSTKEY": "integer:4",
                "topn.encoding_version.P_LINEORDER.LO_CUSTKEY": "1"
              }
            }
      

      But, in the third step "Extract Fact Table Distinct Columns", there is still a reducer running for "P_LINEORDER.LO_CUSTKEY"; as this column is a UHC, it takes a long time and finally got failed.

      Expected behavior: no reducer for this column. 1.6.0 doesn't have this issue; this is a new issue in 2.0 during the upgrade to snowflake.

        Attachments

          Activity

            People

            • Assignee:
              shaofengshi Shao Feng Shi
              Reporter:
              shaofengshi Shao Feng Shi
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: