Tika
  1. Tika
  2. TIKA-521

OutOfMemoryError Parsing XSLX File

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.7, 0.8
    • Fix Version/s: 0.10
    • Component/s: parser
    • Labels:
      None

      Description

      I have several XSLX files I'm trying to parse with Tika that are failing with an OutOfMemoryError even when using a large heap size. For instance the attached 1.26MB excel file fails using a 512MB heap.

      1. memory-test.xlsx
        1.27 MB
        Stephen Duncan Jr
      2. tika-new-files.tar.bz2
        5 kB
        Sjoerd Smeets
      3. tika-diff.txt
        2 kB
        Sjoerd Smeets
      4. TikaExcelEventBasedExtraction.diff
        21 kB
        Nick Burch
      5. Out of memory issue in 1.0.jpg
        229 kB
        samraj
      6. Out of memory issue in 1.0.jpg
        229 kB
        samraj

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Nick Burch
            Reporter:
            Stephen Duncan Jr
          • Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development