Tika
  1. Tika
  2. TIKA-816

(XLS/XLSX) Improperly formatted date/time in text content.

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1.0
    • Fix Version/s: 1.2
    • Component/s: general
    • Labels:
      None
    • Environment:

      Win7-64 + java version "1.6.0_26"

      Description

      Improperly formated text content for XLS and XLSX files.

      The date and time are not formatted as date/time data but rather floating point numbers. This occurs for cells with the content as "=now()" or "=today()".

        Issue Links

          Activity

          Hide
          Albert L. added a comment -

          XLS files seem to work when calling text extraction via HSSF from POI v3.8 beta 5.

          XLSX files seem to still FAIL when calling text extraction via XSSF from POI v3.8 beta 5.

          Show
          Albert L. added a comment - XLS files seem to work when calling text extraction via HSSF from POI v3.8 beta 5. XLSX files seem to still FAIL when calling text extraction via XSSF from POI v3.8 beta 5.
          Hide
          Albert L. added a comment -

          Bug 52369 - XLSX: text extraction malformed "=NOW()" and "=TODAY()" cells
          https://issues.apache.org/bugzilla/show_bug.cgi?id=52369

          Show
          Albert L. added a comment - Bug 52369 - XLSX: text extraction malformed "=NOW()" and "=TODAY()" cells https://issues.apache.org/bugzilla/show_bug.cgi?id=52369
          Hide
          Nick Burch added a comment -

          Now that POI bug #52369 is fixed, we should get the XLSX fix on the next POI upgrade

          For the XLS side, we weren't formatting formula cells. I've fixed this in r1221119.

          Show
          Nick Burch added a comment - Now that POI bug #52369 is fixed, we should get the XLSX fix on the next POI upgrade For the XLS side, we weren't formatting formula cells. I've fixed this in r1221119.
          Hide
          Chris A. Mattmann added a comment -
          • push out to 1.2
          Show
          Chris A. Mattmann added a comment - push out to 1.2
          Hide
          Nick Burch added a comment -

          As of r1309005 we've upgraded to POI 3.8 Final, which includes the required fixes

          Show
          Nick Burch added a comment - As of r1309005 we've upgraded to POI 3.8 Final, which includes the required fixes

            People

            • Assignee:
              Unassigned
              Reporter:
              Albert L.
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development