Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2224

Mime magic for OneNote formats

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 1.14
    • Fix Version/s: None
    • Component/s: mime
    • Labels:
      None

      Description

      As raised at http://stackoverflow.com/questions/41272195/onenote-support-for-apache-tika-parsers, we don't have any magic for the OneNote formats. Several years ago we dug out the file format specs (see http://lucene.472066.n3.nabble.com/Tika-OneNote-Support-td4020393.html), but didn't have volunteer energy to implement a parser. However, armed with those specs, we should be able to come up with some mime magic for detection

        Attachments

        1. Sample1.one
          352 kB
          Krishnan Narayan
        2. Sample1.json
          501 kB
          Nicholas DiPiazza
        3. note-ssn-test-mmmm.one
          30 kB
          Krishnan Narayan

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              gagravarr Nick Burch
            • Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

              • Created:
                Updated: