Tika
  1. Tika
  2. TIKA-737

Use (Incubating) ODFToolkit to improve ODF file format processing

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.10
    • Fix Version/s: None
    • Component/s: parser
    • Labels:
      None

      Description

      Currently, we have our own ODF Parser code, which is based on SAX parsing of the content and meta parts. It covers all the common parts, but is by no means complete

      The ODF Toolkit project has recently joined the Apache Incubator, and is working towards its first release. Once there's an incubating version, we should re-write the parser to delegate most of the work to ODF Toolkit.

        Issue Links

          Activity

          Hide
          Chris A. Mattmann added a comment -

          Agreed, Mike and Nick, this is great!

          Show
          Chris A. Mattmann added a comment - Agreed, Mike and Nick, this is great!
          Hide
          Michael McCandless added a comment -

          +1, sounds great!

          Show
          Michael McCandless added a comment - +1, sounds great!

            People

            • Assignee:
              Unassigned
              Reporter:
              Nick Burch
            • Votes:
              2 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:

                Development