Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1047

Provide a JAX-RS to detect only the mediatype

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 1.4
    • server
    • None

    Description

      Currently I can use the JAX-RS server to detect the mediatype using the meta endpoint. The problem I have with this is that I need to send the entire document to get all metadata.

      To detect the mediatype, only a few bytes are often necessary and so I'd like to only send, say 8K or so, to the server and let it tell me the mediatype.

      In order to accomplish this, it would be good to modify the /meta endpoint to address the individual fields that might be returned:

      /meta/mediatype
      /meta/author
      /meta/lastModified

      The parts currently following the path could be turned into a query parameter, which I think is more appropriate anyways (also easier to manipulate with tools like jquery).

      If sufficient data is not available, I'd just return with a BAD_REQUEST.

      If this would be of interest to TIKA, I think I could possibly implement this.

      Attachments

        1. patch
          24 kB
          Raimund Merkert
        2. TIKA-1047.rmerkert.012713.patch.txt
          4 kB
          Chris A. Mattmann

        Issue Links

          Activity

            People

              chrismattmann Chris A. Mattmann
              spamdaemon Raimund Merkert
              Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: