Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1187

java.lang.OutOfMemoryError: Java heap space

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Cannot Reproduce
    • Affects Version/s: 1.3
    • Fix Version/s: None
    • Component/s: general
    • Labels:
      None
    • Environment:

      Ubuntu

      Description

      Hi,

      While parsing the content we are getting below exception in parse method.
      The file which we are parsing is 1 mb.

      TIKA JAR: tika-core-1.3.jar
      File size: 1 MB.

      Parser parser = new AutoDetectParser();
      parser.parse(is, handler, metaData, new ParseContext());

      java.lang.OutOfMemoryError: Java heap space
      at java.util.Arrays.copyOf(Arrays.java:2734)
      at java.util.ArrayList.ensureCapacity(ArrayList.java:167)
      at java.util.ArrayList.add(ArrayList.java:351)
      at org.apache.fontbox.ttf.GlyfCompositeDescript.(GlyfCompositeDescript.java:60)
      at org.apache.fontbox.ttf.GlyphData.initData(GlyphData.java:63)
      at org.apache.fontbox.ttf.GlyphTable.initData(GlyphTable.java:71)
      at org.apache.fontbox.ttf.AbstractTTFParser.parseTables(AbstractTTFParser.java:163)
      at org.apache.fontbox.ttf.TTFParser.parseTables(TTFParser.java:61)
      at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:90)
      at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
      at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
      at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
      at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:65)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
      at com.impetus.vajra.parser.tika.TikaParser.processContent(TikaParser.java:96)
      at com.impetus.vajra.storm.helper.TextAnalyserBoltHelper.execute(TextAnalyserBoltHelper.java:283)
      at com.impetus.vajra.storm.TextAnalyserBolt.execute(TextAnalyserBolt.java:182)
      at backtype.storm.daemon.executor$fn_4050$tuple_action_fn_4052.invoke(executor.clj:566)
      at backtype.storm.daemon.executor$mk_task_receiver$fn__3976.invoke(executor.clj:345)
      at backtype.storm.disruptor$clojure_handler$reify__1606.onEvent(disruptor.clj:43)
      at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:84)
      at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:58)
      at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
      at backtype.storm.daemon.executor$fn_4050$fn4059$fn_4106.invoke(executor.clj:658)
      at backtype.storm.util$async_loop$fn__465.invoke(util.clj:377)
      at clojure.lang.AFn.run(AFn.java:24)
      at java.lang.Thread.run(Thread.java:662)

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              Guffi GURFAN
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 612h
                612h
                Remaining:
                Remaining Estimate - 612h
                612h
                Logged:
                Time Spent - Not Specified
                Not Specified