Tika
  1. Tika
  2. TIKA-656

Outlook dates using the wrong metadata key

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.9
    • Fix Version/s: 0.10
    • Component/s: parser
    • Labels:
      None

      Description

      Currently, the Outlook extractor fetches the "Accepted By Mail Server" date from POI, and then saves this into Metadata.EDIT_TIME and Metadata.LAST_SAVED, neither of which look right, and neither of which are date properties.

      The rfc822 parser uses Metadata.CREATION_DATE, which is a Date property. The mbox parser uses Metadata.DATE, another (but different) Date property

      All three should probably use the same. I'd suggest that for now, they all output the same value to both CREATION_DATE and DATE

        Activity

        Jukka Zitting made changes -
        Status Resolved [ 5 ] Closed [ 6 ]
        Nick Burch made changes -
        Field Original Value New Value
        Status Open [ 1 ] Resolved [ 5 ]
        Fix Version/s 1.0 [ 12313535 ]
        Resolution Fixed [ 1 ]
        Hide
        Nick Burch added a comment -

        Fixed - the three mail parsers now all output their dates as proper ISO8601 formatted, as Metadata.DATE and Metadata.CREATION_DATE

        Also fixed a poifs date extraction as iso8601 issue too

        Show
        Nick Burch added a comment - Fixed - the three mail parsers now all output their dates as proper ISO8601 formatted, as Metadata.DATE and Metadata.CREATION_DATE Also fixed a poifs date extraction as iso8601 issue too
        Nick Burch created issue -

          People

          • Assignee:
            Nick Burch
            Reporter:
            Nick Burch
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development