Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.10
    • Fix Version/s: 1.2
    • Component/s: general
    • Labels:
      None

      Description

      It would be cool to be able to run Tika as a network service that accepts a binary document as input and produces the extracted content (as XHTML, text, or just metadata) as output. A bit like TIKA-169, but without the dependency to a servlet container.

      I'd like to be able to set up and run such a server like this:

      $ java -jar tika-app.jar --port 1234

      We should also add a NetworkParser class that acts as a local client for such a service. This way a lightweight client could use the full set of Tika parsing functionality even with just the tika-core jar within its classpath.

        Attachments

        1. TIKA-593.Mattmann.032712.patch.txt
          38 kB
          Chris A. Mattmann
        2. TIKA-593.Mattmann.032712.patch.2.txt
          38 kB
          Chris A. Mattmann
        3. TIKA-593.Mattmann.032612.patch.txt
          13 kB
          Chris A. Mattmann
        4. TIKA-593.Mattmann.032612.patch.2.txt
          19 kB
          Chris A. Mattmann
        5. TIKA-593_pom.diff
          1 kB
          Ingo Renner

          Activity

            People

            • Assignee:
              chrismattmann Chris A. Mattmann
              Reporter:
              jukkaz Jukka Zitting
            • Votes:
              2 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: