Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-1403

Kylin Hive Column Cardinality Job unable to read bucketed table

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: v1.2, v1.3.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Environment:
      - Tested against apache-kylin-1.2-HBase1.1-incubating-SNAPSHOT-bin and apache-kylin-1.3-HBase-1.1-SNAPSHOT-bin
      - Environment is HDP 2.3.4
      - Hive version: hive-1.2.1.2.3.4.0
      - HBase version: HBase 1.1.2.2.3.4.0-3485

      Description

      This issue is connected with https://issues.apache.org/jira/browse/KYLIN-1402 and states the findings while investigating on the StringIndexOutOfBoundsException.

      While trying to find out why the outputfile created in the cardinality job is empty, we discovered that the only difference between this non-working job and all our other jobs (which work without problems), is that the underlying table is bucketed.

      The data folder is dbfolder/db/table/partition/bucketfolder/file
      Kylin checks for data in dbfolder/db/table/partition and so is unable to find the data.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                gwang3 Wang, Gang
                Reporter:
                hd2 Sebastian Zimmermann
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: