Uploaded image for project: 'Flume'
  1. Flume
  2. FLUME-1850

HDFSSink lower hdfs.maxOpenFiles to 100

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.3.0
    • None
    • Node
    • None
    • RHEL 6

    Description

      We are using flume-1.3.0. After flume is up for a while (30 days+) we get OutOfMemory error. Our heap is set to 2G and load on the system is very low. Around 50 request/minute. We use AvroClient and long lived connection.

      Below is the stack trace. I don't have the heap dump but I plan to enable that for next time.

      13/01/16 09:09:38 ERROR hdfs.HDFSEventSink: process failed
      java.lang.OutOfMemoryError: Java heap space
      at java.util.Arrays.copyOf(Arrays.java:2786)
      at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
      at java.io.DataOutputStream.write(DataOutputStream.java:90)
      at org.apache.hadoop.io.Text.write(Text.java:282)
      at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90)
      at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77)
      at org.apache.hadoop.io.SequenceFile$BlockCompressWriter.append(SequenceFile.java:1320)
      at org.apache.flume.sink.hdfs.HDFSSequenceFile.append(HDFSSequenceFile.java:72)
      at org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:376)
      at org.apache.flume.sink.hdfs.HDFSEventSink$2.call(HDFSEventSink.java:729)
      at org.apache.flume.sink.hdfs.HDFSEventSink$2.call(HDFSEventSink.java:727)
      at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
      at java.util.concurrent.FutureTask.run(FutureTask.java:138)
      at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
      at java.lang.Thread.run(Thread.java:662)
      Exception in thread "SinkRunner-PollingRunner-DefaultSinkProcessor" java.lang.OutOfMemoryError: Java heap space
      at java.util.Arrays.copyOf(Arrays.java:2786)
      at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
      at java.io.DataOutputStream.write(DataOutputStream.java:90)
      at org.apache.hadoop.io.Text.write(Text.java:282)
      at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90)
      at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77)
      at org.apache.hadoop.io.SequenceFile$BlockCompressWriter.append(SequenceFile.java:1320)
      at org.apache.flume.sink.hdfs.HDFSSequenceFile.append(HDFSSequenceFile.java:72)
      at org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:376)
      at org.apache.flume.sink.hdfs.HDFSEventSink$2.call(HDFSEventSink.java:729)
      at org.apache.flume.sink.hdfs.HDFSEventSink$2.call(HDFSEventSink.java:727)
      at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
      at java.util.concurrent.FutureTask.run(FutureTask.java:138)
      at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
      at java.lang.Thread.run(Thread.java:662)

      Attachments

        1. HA_result.jpg
          316 kB
          Xiang Li
        2. Screen Shot 2013-01-16 at 11.05.55 PM.png
          101 kB
          Brock Noland
        3. flume-oo.docx
          261 kB
          Mohit Anchlia

        Activity

          People

            Unassigned Unassigned
            mohitanchlia Mohit Anchlia
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: