Tika
  1. Tika
  2. TIKA-923

iWork keynote content on master slides are not being parsed

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Critical Critical
    • Resolution: Fixed
    • Affects Version/s: 1.0
    • Fix Version/s: 1.2
    • Component/s: parser
    • Labels:
    • Environment:

      Windows 7, 64 bit

      Description

      iWork Keynote slides can contain tables, however these are being dropped entirely by the Tika parser.

      1. TIKA-923.patch
        4 kB
        Michael McCandless
      2. testKeynoteTemplateTable.key
        5.40 MB
        Erik Peterson
      3. testTables.key
        211 kB
        Michael McCandless

        Activity

        Hide
        Michael McCandless added a comment -

        I created this simple test case (attached) but I see the text inside the table cells being correctly extracted on Tika's current trunk (rev 1340046). I'll commit this as a test case...

        Erik can you attach an example Keynote table that doesn't extract correctly? Thanks.

        Show
        Michael McCandless added a comment - I created this simple test case (attached) but I see the text inside the table cells being correctly extracted on Tika's current trunk (rev 1340046). I'll commit this as a test case... Erik can you attach an example Keynote table that doesn't extract correctly? Thanks.
        Hide
        Erik Peterson added a comment -

        A table on slides 5,6, & 7 are not parsing

        Show
        Erik Peterson added a comment - A table on slides 5,6, & 7 are not parsing
        Hide
        Michael McCandless added a comment -

        OK, I see: the table is defined on the master slide ... so we are not extracting user-created items from master slides ... I'll dig.

        Show
        Michael McCandless added a comment - OK, I see: the table is defined on the master slide ... so we are not extracting user-created items from master slides ... I'll dig.
        Hide
        Michael McCandless added a comment -

        Erik it looks like you forgot to check the "grant ASF license" box... can you do that, so I can use this as a test file? Thanks.

        Show
        Michael McCandless added a comment - Erik it looks like you forgot to check the "grant ASF license" box... can you do that, so I can use this as a test file? Thanks.
        Hide
        Erik Peterson added a comment -

        I forgot to check the license box, so I now grant ASF License for the attachment 'testKeynoteTemplateTable.key

        Show
        Erik Peterson added a comment - I forgot to check the license box, so I now grant ASF License for the attachment 'testKeynoteTemplateTable.key
        Hide
        Michael McCandless added a comment -

        Patch, also visiting master slides to extract content...

        Show
        Michael McCandless added a comment - Patch, also visiting master slides to extract content...

          People

          • Assignee:
            Michael McCandless
            Reporter:
            Erik Peterson
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development