Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-593

Tika network server

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.10
    • 1.2
    • general
    • None

    Description

      It would be cool to be able to run Tika as a network service that accepts a binary document as input and produces the extracted content (as XHTML, text, or just metadata) as output. A bit like TIKA-169, but without the dependency to a servlet container.

      I'd like to be able to set up and run such a server like this:

      $ java -jar tika-app.jar --port 1234

      We should also add a NetworkParser class that acts as a local client for such a service. This way a lightweight client could use the full set of Tika parsing functionality even with just the tika-core jar within its classpath.

      Attachments

        1. TIKA-593.Mattmann.032712.patch.2.txt
          38 kB
          Chris A. Mattmann
        2. TIKA-593.Mattmann.032712.patch.txt
          38 kB
          Chris A. Mattmann
        3. TIKA-593.Mattmann.032612.patch.2.txt
          19 kB
          Chris A. Mattmann
        4. TIKA-593.Mattmann.032612.patch.txt
          13 kB
          Chris A. Mattmann
        5. TIKA-593_pom.diff
          1 kB
          Ingo Renner

        Activity

          People

            chrismattmann Chris A. Mattmann
            jukkaz Jukka Zitting
            Votes:
            2 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: