Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3077

OneNote parser - very inefficient when parsing OneNote <= 2007 files

Agile BoardAttach filesAttach ScreenshotVotersStop watchingWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.24.1
    • core
    • None

    Description

      The code I put in place for OneNote 2007 files is horribly inefficient. I hadn't realized that the OneNoteDirectFileResource that I extracted from another parser was not buffering the bytes. So every time I did a set position, it was very expensive.

      The fix is to buffer the bytes into chunks and operate them instead.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            ndipiazza_gmail Nicholas DiPiazza
            Votes:
            0 Vote for this issue
            Watchers:
            3 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment