Uploaded image for project: 'Flume'
  1. Flume
  2. FLUME-2128

HDFS Sink rollSize is calculated based off of uncompressed size of cumulative events.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 1.4.0, 1.3.1
    • None
    • Sinks+Sources
    • I made the fix to trunk

    Description

      The hdfs sink rollSize parameter is compared against uncompressed event sizes.
      The net of this is that if you are using compression and expect the size of your files on HDFS to be rolled/sized based on the value set for rollSize than your files will be much smaller due to compression.

      We should take into account when compression is set and roll based on the compressed size on hdfs.

      Attachments

        1. FLUME-2128-0.patch
          16 kB
          Theodore michael Malaska
        2. FLUME-2128-1.patch
          17 kB
          Theodore michael Malaska
        3. FLUME-2128-2.patch
          20 kB
          Theodore michael Malaska

        Issue Links

          Activity

            People

              ted.m Theodore michael Malaska
              jlord Jeff Lord
              Votes:
              2 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: