Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Cannot Reproduce
-
1.3
-
None
-
None
-
Ubuntu
Description
Hi,
While parsing the content we are getting below exception in parse method.
The file which we are parsing is 1 mb.
TIKA JAR: tika-core-1.3.jar
File size: 1 MB.
Parser parser = new AutoDetectParser();
parser.parse(is, handler, metaData, new ParseContext());
java.lang.OutOfMemoryError: Java heap space
at java.util.Arrays.copyOf(Arrays.java:2734)
at java.util.ArrayList.ensureCapacity(ArrayList.java:167)
at java.util.ArrayList.add(ArrayList.java:351)
at org.apache.fontbox.ttf.GlyfCompositeDescript.(GlyfCompositeDescript.java:60)
at org.apache.fontbox.ttf.GlyphData.initData(GlyphData.java:63)
at org.apache.fontbox.ttf.GlyphTable.initData(GlyphTable.java:71)
at org.apache.fontbox.ttf.AbstractTTFParser.parseTables(AbstractTTFParser.java:163)
at org.apache.fontbox.ttf.TTFParser.parseTables(TTFParser.java:61)
at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:90)
at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:65)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
at com.impetus.vajra.parser.tika.TikaParser.processContent(TikaParser.java:96)
at com.impetus.vajra.storm.helper.TextAnalyserBoltHelper.execute(TextAnalyserBoltHelper.java:283)
at com.impetus.vajra.storm.TextAnalyserBolt.execute(TextAnalyserBolt.java:182)
at backtype.storm.daemon.executor$fn_4050$tuple_action_fn_4052.invoke(executor.clj:566)
at backtype.storm.daemon.executor$mk_task_receiver$fn__3976.invoke(executor.clj:345)
at backtype.storm.disruptor$clojure_handler$reify__1606.onEvent(disruptor.clj:43)
at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:84)
at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:58)
at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
at backtype.storm.daemon.executor$fn_4050$fn4059$fn_4106.invoke(executor.clj:658)
at backtype.storm.util$async_loop$fn__465.invoke(util.clj:377)
at clojure.lang.AFn.run(AFn.java:24)
at java.lang.Thread.run(Thread.java:662)