Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-775

Embed Capabilities

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0
    • 1.3
    • general, metadata
    • The default ExternalEmbedder requires that sed be installed.

    Description

      This patch defines and implements the concept of embedding tika metadata into a file stream, the reverse of extraction.

      In the tika-core project an interface defining an Embedder and a generic sed ExternalEmbedder implementation meant to be extended or configured are added. These classes are essentially a reverse flow of the existing Parser and ExternalParser classes.

      In the tika-parsers project an ExternalEmbedderTest unit test is added which uses the default ExternalEmbedder (calls sed) to embed a value placed in Metadata.DESCRIPTION then verify the operation by parsing the resulting stream.

      Attachments

        1. embed_20121029.diff
          36 kB
          Ray Gauss II
        2. embed.diff
          33 kB
          Ray Gauss II
        3. tika-core-embed-patch.txt
          19 kB
          Ray Gauss II
        4. tika-parsers-embed-patch.txt
          8 kB
          Ray Gauss II

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rgauss Ray Gauss II
            rgauss Ray Gauss II
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment