Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4462

Rolling mechanism for demux output

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      In order to reduce the number of file on HDFS we need to have a rolling mechanism for the demux output

      • avoid immediate merging if there's already file for the same time range, create a spill file instead
      • merge all raw files every hours
      • merge all hourly files every days

      Attachments

        Issue Links

          Activity

            People

              jboulon Jerome Boulon
              jboulon Jerome Boulon
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: