Tika
  1. Tika
  2. TIKA-1100

cannot extract text in text-box for Excel 2007 file(.xlsx, .xlsm)

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1.3
    • Fix Version/s: 1.5
    • Component/s: parser
    • Labels:
      None
    • Environment:

      Windows7 64bit

      Description

      When I launch Tika gui from command-line and drag and drop .xlsx file that have textbox, no text in the textbox are extracted.

      When drag and drop .xls file, text in the textbox are extracted.

        Activity

        Hide
        Tim Allison added a comment -

        Waiting for improvements in POI-55292. Will make Tika-side upgrades when the next version of POI is released.

        Reference: http://issues.apache.org/bugzilla/show_bug.cgi?id=55292

        Show
        Tim Allison added a comment - Waiting for improvements in POI-55292. Will make Tika-side upgrades when the next version of POI is released. Reference: http://issues.apache.org/bugzilla/show_bug.cgi?id=55292
        Hide
        Tim Allison added a comment -

        Simple example file attached for now. Will fill out with test cases when POI is ready.

        Show
        Tim Allison added a comment - Simple example file attached for now. Will fill out with test cases when POI is ready.
        Hide
        Kazuaki Matsuba added a comment -

        Thanks, Tim

        I'll wait for the next version of POI is bundled in Tika.

        Show
        Kazuaki Matsuba added a comment - Thanks, Tim I'll wait for the next version of POI is bundled in Tika.
        Hide
        Tim Allison added a comment -

        Updated XSSFExcelExtractorDecorator and added test as of r1526489.

        Show
        Tim Allison added a comment - Updated XSSFExcelExtractorDecorator and added test as of r1526489.
        Hide
        Tim Allison added a comment -

        r1526498

        Show
        Tim Allison added a comment - r1526498

          People

          • Assignee:
            Unassigned
            Reporter:
            Kazuaki Matsuba
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development