Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.10
    • Fix Version/s: 1.2
    • Component/s: general
    • Labels:
      None

      Description

      It would be cool to be able to run Tika as a network service that accepts a binary document as input and produces the extracted content (as XHTML, text, or just metadata) as output. A bit like TIKA-169, but without the dependency to a servlet container.

      I'd like to be able to set up and run such a server like this:

      $ java -jar tika-app.jar --port 1234

      We should also add a NetworkParser class that acts as a local client for such a service. This way a lightweight client could use the full set of Tika parsing functionality even with just the tika-core jar within its classpath.

      1. TIKA-593_pom.diff
        1 kB
        Ingo Renner
      2. TIKA-593.Mattmann.032612.patch.2.txt
        19 kB
        Chris A. Mattmann
      3. TIKA-593.Mattmann.032612.patch.txt
        13 kB
        Chris A. Mattmann
      4. TIKA-593.Mattmann.032712.patch.2.txt
        38 kB
        Chris A. Mattmann
      5. TIKA-593.Mattmann.032712.patch.txt
        38 kB
        Chris A. Mattmann

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Chris A. Mattmann
            Reporter:
            Jukka Zitting
          • Votes:
            2 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development