Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-834

server problem only 1st result is correct additional runs include data from 1st run

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0
    • 1.2
    • cli
    • None
    • java version "1.6.0_23"
      OpenJDK Runtime Environment (IcedTea6 1.11pre) (6b23~pre11-0ubuntu1.11.10)
      OpenJDK Server VM (build 20.0-b11, mixed mode)

    Description

      -j json shows following behavior but plain text -m is also a problem and just keeps returning the 1st result even with subsequent different pdf files

      1. Running server to get meta data
        java -jar tika-app-1.0.jar -m -j --server 9000
      2. send pdf document
        nc localhost 9000 < test.pdf
      3. get good result {"Author":"unknown", "Content-Type":"application/pdf", "Creation-Date":"2011-12-27T18:21:59Z", "Last-Modified":"2011-12-27T18:21:59Z", "created":"Tue Dec 27 12:21:59 CST 2011", "creator":"PScript5.dll Version 5.2.2", "producer":"GPL Ghostscript 9.04", "title":"Aaron Rodgers the clear-cut MVP after dismantling Chicago Bears - Peter King - SI.com", "xmpTPg:NPages":3 }
      4. send pdf document again
        nc localhost 9000 < test.pdf
      5. get bad result with extra values from last run { "Author":["unknown", "unknown"], "Content-Type":"application/pdf", "Creation-Date":"2011-12-27T18:21:59Z", "Last-Modified":"2011-12-27T18:21:59Z", "created":"Tue Dec 27 12:21:59 CST 2011", "creator":["PScript5.dll Version 5.2.2", "PScript5.dll Version 5.2.2"], "producer":["GPL Ghostscript 9.04", "GPL Ghostscript 9.04"], "title":["Aaron Rodgers the clear-cut MVP after dismantling Chicago Bears - Peter King - SI.com", "Aaron Rodgers the clear-cut MVP after dismantling Chicago Bears - Peter King - SI.com"], "xmpTPg:NPages":3 }
      6. send another pdf document
        nc localhost 9000 < ctypes.pdf
      7. get bad result with extra values from last 2 runs
        { "Author":["unknown", "unknown", "unknown"],
        "Content-Type":"application/pdf",
        "Creation-Date":"2011-12-27T18:03:18Z",
        "Last-Modified":"2011-12-27T18:03:18Z",
        "created":"Tue Dec 27 12:03:18 CST 2011",
        "creator":["PScript5.dll Version 5.2.2", "PScript5.dll Version 5.2.2", "PScript5.dll Version 5.2.2"],
        "producer":["GPL Ghostscript 9.04", "GPL Ghostscript 9.04", "GPL Ghostscript 9.04"],
        "title":["Aaron Rodgers the clear-cut MVP after dismantling Chicago Bears - Peter King - SI.com", "Aaron Rodgers the clear-cut MVP after dismantling Chicago Bears - Peter King - SI.com", "(15.17. ctypes \\227 A foreign function library for Python \\227 Python v2.7.2 documentation)"],

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jukkaz Jukka Zitting
            stilldodge George Kappel
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment