Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-3463

Improve optimize job by avoiding creating empty output files on HDFS

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: v2.4.1, v2.5.0
    • Component/s: None
    • Labels:
      None

      Description

      For steps in an optimize job, like FilterRecommendCuboidDataJob & UpdateOldCuboidShardJob, MultipleOutputs is used to output data into different directories. Therefore, the default output file is empty. LazyOutputFormat should be used to prevent to create zero-sized default output.

        Attachments

          Activity

            People

            • Assignee:
              yaho Zhong Yanghong
              Reporter:
              yaho Zhong Yanghong

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment