Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
1.14
-
None
-
None
-
Windows 7 x64, JVM 1.8.0_101
Description
The attached file, which opens in Excel, errors out in Tika:
java.lang.IllegalArgumentException: Cannot format given Object as a Number
at java.text.DecimalFormat.format:-1
at java.text.Format.format:-1
at org.apache.poi.ss.usermodel.DataFormatter.performDateFormatting:736
at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:804
at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:785
at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.formatNumberDateCell:143
at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener$TikaFormatTrackingHSSFListener.formatNumberDateCell:633
at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.internalProcessRecord:432
at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processRecord:336
at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.processRecord:92
at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord:109
at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEvents:179
at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents:136
at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processFile:312
at org.apache.tika.parser.microsoft.ExcelExtractor.parse:169
at org.apache.tika.parser.microsoft.OfficeParser.parse:177
at org.apache.tika.parser.microsoft.OfficeParser.parse:130
at gov.nih.niaid.fscanner.Extract.ExtractContents:69
Attachments
Attachments
Issue Links
- duplicates
-
TIKA-2196 IllegalArgumentException on a valid Excel file
- Open