Tika
  1. Tika
  2. TIKA-923

iWork keynote content on master slides are not being parsed

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Critical Critical
    • Resolution: Fixed
    • Affects Version/s: 1.0
    • Fix Version/s: 1.2
    • Component/s: parser
    • Labels:
    • Environment:

      Windows 7, 64 bit

      Description

      iWork Keynote slides can contain tables, however these are being dropped entirely by the Tika parser.

      1. testTables.key
        211 kB
        Michael McCandless
      2. testKeynoteTemplateTable.key
        5.40 MB
        Erik Peterson
      3. TIKA-923.patch
        4 kB
        Michael McCandless

        Activity

        Hide
        Michael McCandless added a comment -

        I created this simple test case (attached) but I see the text inside the table cells being correctly extracted on Tika's current trunk (rev 1340046). I'll commit this as a test case...

        Erik can you attach an example Keynote table that doesn't extract correctly? Thanks.

        Show
        Michael McCandless added a comment - I created this simple test case (attached) but I see the text inside the table cells being correctly extracted on Tika's current trunk (rev 1340046). I'll commit this as a test case... Erik can you attach an example Keynote table that doesn't extract correctly? Thanks.
        Hide
        Erik Peterson added a comment -

        A table on slides 5,6, & 7 are not parsing

        Show
        Erik Peterson added a comment - A table on slides 5,6, & 7 are not parsing
        Hide
        Michael McCandless added a comment -

        OK, I see: the table is defined on the master slide ... so we are not extracting user-created items from master slides ... I'll dig.

        Show
        Michael McCandless added a comment - OK, I see: the table is defined on the master slide ... so we are not extracting user-created items from master slides ... I'll dig.
        Hide
        Michael McCandless added a comment -

        Erik it looks like you forgot to check the "grant ASF license" box... can you do that, so I can use this as a test file? Thanks.

        Show
        Michael McCandless added a comment - Erik it looks like you forgot to check the "grant ASF license" box... can you do that, so I can use this as a test file? Thanks.
        Hide
        Erik Peterson added a comment -

        I forgot to check the license box, so I now grant ASF License for the attachment 'testKeynoteTemplateTable.key

        Show
        Erik Peterson added a comment - I forgot to check the license box, so I now grant ASF License for the attachment 'testKeynoteTemplateTable.key
        Hide
        Michael McCandless added a comment -

        Patch, also visiting master slides to extract content...

        Show
        Michael McCandless added a comment - Patch, also visiting master slides to extract content...

          People

          • Assignee:
            Michael McCandless
            Reporter:
            Erik Peterson
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development