Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2322

Video labeling using existing ObjectRecognition

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.15
    • Component/s: parser
    • Labels:

      Description

      Currently TIKA supports ObjectRecognition in Images. I am proposing to extend this to support videos.

      Idea is -
      1. Extract frames from video and run IncV3 to get labels for these frames.
      2. We average confidence scores of same labels for each frame.
      3. Return results in sorted order of confidence score.

      I am writing code for different modes of frame extractions -
      1. Extract center image.
      2. Extract frames after every fixed interval.
      3. Extract N frames equally divided across video.

      We used this approach in [0]. Code in [1]

      [0] https://github.com/USCDataScience/hadoop-pot
      [1] https://github.com/USCDataScience/video-recognition

        Activity

        Hide
        msharan@usc.edu Madhav Sharan added a comment -

        I am working on a PR that adds this functionality through image recognition REST API. Once we are more comfortable with pure java implementation we can add this there too.

        Are there any concerns in adding OpenCV as maven dependency in Tika?

        Show
        msharan@usc.edu Madhav Sharan added a comment - I am working on a PR that adds this functionality through image recognition REST API. Once we are more comfortable with pure java implementation we can add this there too. Are there any concerns in adding OpenCV as maven dependency in Tika?
        Hide
        chrismattmann Chris A. Mattmann added a comment -

        Thanks Madhav Sharan please check the license for OpenCV, and make sure it's available in the Central Repository.

        Show
        chrismattmann Chris A. Mattmann added a comment - Thanks Madhav Sharan please check the license for OpenCV, and make sure it's available in the Central Repository.
        Hide
        kranthigv Kranthi Kiran GV added a comment -

        OpenCV is available on the Central repository (https://search.maven.org/#artifactdetails%7Corg.bytedeco.javacpp-presets%7Copencv%7C3.2.0-1.3%7Cjar).
        I believe the license of OpenCV would allow us to use it in the current proposal. (http://opencv.org/license.html)
        They want us to include the copyright notice.

        Show
        kranthigv Kranthi Kiran GV added a comment - OpenCV is available on the Central repository ( https://search.maven.org/#artifactdetails%7Corg.bytedeco.javacpp-presets%7Copencv%7C3.2.0-1.3%7Cjar ). I believe the license of OpenCV would allow us to use it in the current proposal. ( http://opencv.org/license.html ) They want us to include the copyright notice.
        Hide
        msharan@usc.edu Madhav Sharan added a comment -

        Cool, thanks. As of now I am adding a new API in existing inception REST API.

        Will try OpenCV java and DL4J and raise a PR if things look good

        Show
        msharan@usc.edu Madhav Sharan added a comment - Cool, thanks. As of now I am adding a new API in existing inception REST API. Will try OpenCV java and DL4J and raise a PR if things look good
        Hide
        kranthigv Kranthi Kiran GV added a comment -

        Looking forward to your PR. I would be very interested to collaborate on the tika-dl version of the API.

        Show
        kranthigv Kranthi Kiran GV added a comment - Looking forward to your PR. I would be very interested to collaborate on the tika-dl version of the API.
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha opened a new pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168

        Instructions

        • Install OpneCV with python support. on mac it can be done by `brew
          install opencv` and adding cv2 to python path
        • Bring up inception server as per Tika and Vision wiki
        • java -jar tika-app/target/tika-app-1.14.jar
          --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognit
          ion/tika-config-tflow-video-rest.xml
          ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4

        output from Tika
        ```
        $ java -jar tika-app/target/tika-app-1.14.jar
        --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognit
        ion/tika-config-tflow-video-rest.xml
        ./tika-parsesources/test-documents/testVideoMp4.mp4
        INFO Available = true, API Status = HTTP/1.0 200 OK
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser =
        org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = true
        <?xml version="1.0" encoding="UTF-8"?><html
        xmlns="http://www.w3.org/1999/xhtml">
        <head>
        <meta name="org.apache.tika.parser.recognition.object.rec.impl"
        content="org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogn
        iser"/>
        <meta name="X-Parsed-By"
        content="org.apache.tika.parser.CompositeParser"/>
        <meta name="X-Parsed-By"
        content="org.apache.tika.parser.recognition.ObjectRecognitionParser"/>
        <meta name="resourceName" content="testVideoMp4.mp4"/>
        <meta name="Content-Length" content="1053651"/>
        <meta name="OBJECT" content="dugong, Dugong dugon (0.15187)"/>
        <meta name="OBJECT" content="television, television system (0.08859)"/>
        <meta name="OBJECT" content="ice bear, polar bear, Ursus Maritimus,
        Thalarctos maritimus (0.06996)"/>
        <meta name="OBJECT" content="triceratops (0.05711)"/>
        <meta name="Content-Type" content="video/mp4"/>
        <title/>
        </head>
        <body><ol id="objects"> <li id="dugong, Dugong dugon"> dugong, Dugong
        dugon [en](confidence = 0.151870 )</li>
        <li id="television, television system"> television, television system
        [en](confidence = 0.088589 )</li>
        <li id="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus">
        ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus
        [en](confidence = 0.069961 )</li>
        <li id="triceratops"> triceratops [en](confidence = 0.057112 )</li>
        </ol>
        </body></html>LMC-032212:tika sharan$

        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha opened a new pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168 Instructions Install OpneCV with python support. on mac it can be done by `brew install opencv` and adding cv2 to python path Bring up inception server as per Tika and Vision wiki java -jar tika-app/target/tika-app-1.14.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognit ion/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 Sample video downloaded from http://www.sample-videos.com/ output from Tika ``` $ java -jar tika-app/target/tika-app-1.14.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognit ion/tika-config-tflow-video-rest.xml ./tika-parsesources/test-documents/testVideoMp4.mp4 INFO Available = true, API Status = HTTP/1.0 200 OK INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = true <?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta name="org.apache.tika.parser.recognition.object.rec.impl" content="org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogn iser"/> <meta name="X-Parsed-By" content="org.apache.tika.parser.CompositeParser"/> <meta name="X-Parsed-By" content="org.apache.tika.parser.recognition.ObjectRecognitionParser"/> <meta name="resourceName" content="testVideoMp4.mp4"/> <meta name="Content-Length" content="1053651"/> <meta name="OBJECT" content="dugong, Dugong dugon (0.15187)"/> <meta name="OBJECT" content="television, television system (0.08859)"/> <meta name="OBJECT" content="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus (0.06996)"/> <meta name="OBJECT" content="triceratops (0.05711)"/> <meta name="Content-Type" content="video/mp4"/> <title/> </head> <body><ol id="objects"> <li id="dugong, Dugong dugon"> dugong, Dugong dugon [en] (confidence = 0.151870 )</li> <li id="television, television system"> television, television system [en] (confidence = 0.088589 )</li> <li id="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus"> ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus [en] (confidence = 0.069961 )</li> <li id="triceratops"> triceratops [en] (confidence = 0.057112 )</li> </ol> </body></html>LMC-032212:tika sharan$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        msharan@usc.edu Madhav Sharan added a comment -

        Thanks for your interest Kranthi. It will be great working this out in tika-dl. Let fix this issue and then we can start investiagting more for adding it in tika-dl, it will be a good addition there. We need to confirm few things -

        • I came to know that OpenCV is pulled internally by DL4J we need to confirm this and if that's the case it will be super simple for us to use.
        • We need to choose best way of using OpenCV I have used below [0], [1], [2]. I liked [2] the most because of easy integration but last time I used it I faced lot of memory management issues. We need to double check if we can use these integrations with tika-server with atleast 1000 videos without server restart.

        [0] https://github.com/USCDataScience/hadoop-pot/blob/master/hadoop-pot-video/pom.xml#L10-L14
        [1] https://opencv-java-tutorials.readthedocs.io/en/latest/index.html
        [2] https://github.com/bytedeco/javacv

        Show
        msharan@usc.edu Madhav Sharan added a comment - Thanks for your interest Kranthi. It will be great working this out in tika-dl. Let fix this issue and then we can start investiagting more for adding it in tika-dl, it will be a good addition there. We need to confirm few things - I came to know that OpenCV is pulled internally by DL4J we need to confirm this and if that's the case it will be super simple for us to use. We need to choose best way of using OpenCV I have used below [0] , [1] , [2] . I liked [2] the most because of easy integration but last time I used it I faced lot of memory management issues. We need to double check if we can use these integrations with tika-server with atleast 1000 videos without server restart. [0] https://github.com/USCDataScience/hadoop-pot/blob/master/hadoop-pot-video/pom.xml#L10-L14 [1] https://opencv-java-tutorials.readthedocs.io/en/latest/index.html [2] https://github.com/bytedeco/javacv
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-293767158

        amazing work @smadha ! I will review this now.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-293767158 amazing work @smadha ! I will review this now. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-294090011

        @chrismattmann - Let me know if you have comments. I'll put up a wiki on Monday

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-294090011 @chrismattmann - Let me know if you have comments. I'll put up a wiki on Monday ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r111529768

        ##########
        File path: tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java
        ##########
        @@ -0,0 +1,56 @@
        +/*
        + * Licensed to the Apache Software Foundation (ASF) under one or more
        + * contributor license agreements. See the NOTICE file distributed with
        + * this work for additional information regarding copyright ownership.
        + * The ASF licenses this file to You under the Apache License, Version 2.0
        + * (the "License"); you may not use this file except in compliance with
        + * the License. You may obtain a copy of the License at
        + *
        + * http://www.apache.org/licenses/LICENSE-2.0
        + *
        + * Unless required by applicable law or agreed to in writing, software
        + * distributed under the License is distributed on an "AS IS" BASIS,
        + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        + * See the License for the specific language governing permissions and
        + * limitations under the License.
        + */
        +
        +package org.apache.tika.parser.recognition.tf;
        +
        +import org.apache.tika.config.Param;
        +import org.apache.tika.metadata.Metadata;
        +import org.apache.tika.parser.ParseContext;
        +import org.apache.tika.parser.recognition.RecognisedObject;
        +import org.junit.Assert;
        +import org.junit.Ignore;
        +import org.junit.Test;
        +import org.xml.sax.helpers.DefaultHandler;
        +
        +import java.io.InputStream;
        +import java.util.HashMap;
        +import java.util.HashSet;
        +import java.util.List;
        +import java.util.Set;
        +
        +
        +@Ignore
        +public class TensorflowVideoRecParserTest {
        +
        + @Test
        + public void recognise() throws Exception {
        + TensorflowRESTVideoRecogniser recogniser = new TensorflowRESTVideoRecogniser();
        + recogniser.initialize(new HashMap<String, Param>());
        + try (InputStream stream = getClass().getClassLoader().getResourceAsStream("test-documents/testVideoMp4.mp4")) {
        + List<RecognisedObject> objects = recogniser.recognise(stream, new DefaultHandler(), new Metadata(), new ParseContext());
        +
        + Assert.assertTrue(objects.size() > 0);
        + Set<String> objectLabels = new HashSet<>();
        + for (RecognisedObject object : objects) {
        + objectLabels.add(object.getLabel());
        + System.out.println(object.getLabel());

        Review comment:
        Please use Logs instead of std out prints. Maybe this print is not necessary, instead one or more asserts expected

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r111529768 ########## File path: tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java ########## @@ -0,0 +1,56 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.tika.parser.recognition.tf; + +import org.apache.tika.config.Param; +import org.apache.tika.metadata.Metadata; +import org.apache.tika.parser.ParseContext; +import org.apache.tika.parser.recognition.RecognisedObject; +import org.junit.Assert; +import org.junit.Ignore; +import org.junit.Test; +import org.xml.sax.helpers.DefaultHandler; + +import java.io.InputStream; +import java.util.HashMap; +import java.util.HashSet; +import java.util.List; +import java.util.Set; + + +@Ignore +public class TensorflowVideoRecParserTest { + + @Test + public void recognise() throws Exception { + TensorflowRESTVideoRecogniser recogniser = new TensorflowRESTVideoRecogniser(); + recogniser.initialize(new HashMap<String, Param>()); + try (InputStream stream = getClass().getClassLoader().getResourceAsStream("test-documents/testVideoMp4.mp4")) { + List<RecognisedObject> objects = recogniser.recognise(stream, new DefaultHandler(), new Metadata(), new ParseContext()); + + Assert.assertTrue(objects.size() > 0); + Set<String> objectLabels = new HashSet<>(); + for (RecognisedObject object : objects) { + objectLabels.add(object.getLabel()); + System.out.println(object.getLabel()); Review comment: Please use Logs instead of std out prints. Maybe this print is not necessary, instead one or more asserts expected ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r111529580

        ##########
        File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java
        ##########
        @@ -70,7 +70,11 @@
        private URI healthUri = URI.create("http://localhost:8764/inception/v3/ping");

        private boolean available;
        -
        +
        + protected URI getApiUri(Metadata metadata){
        + return apiUri;

        Review comment:
        Curious : why pass `metadata` to this getter method, any plans to use it in future?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r111529580 ########## File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java ########## @@ -70,7 +70,11 @@ private URI healthUri = URI.create("http://localhost:8764/inception/v3/ping"); private boolean available; - + + protected URI getApiUri(Metadata metadata){ + return apiUri; Review comment: Curious : why pass `metadata` to this getter method, any plans to use it in future? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r111529989

        ##########
        File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java
        ##########
        @@ -70,7 +70,11 @@
        private URI healthUri = URI.create("http://localhost:8764/inception/v3/ping");

        private boolean available;
        -
        +
        + protected URI getApiUri(Metadata metadata){
        + return apiUri;

        Review comment:
        I have overriden this method

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r111529989 ########## File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java ########## @@ -70,7 +70,11 @@ private URI healthUri = URI.create("http://localhost:8764/inception/v3/ping"); private boolean available; - + + protected URI getApiUri(Metadata metadata){ + return apiUri; Review comment: I have overriden this method ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r111530043

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py
        ##########
        @@ -0,0 +1,112 @@
        +#!/usr/bin/env python
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +
        +import cv2
        +import os
        +import sys
        +import ntpath
        +import numpy as np
        +
        +def _get_image_from_array(image_array):
        + #JPG to support tensorflow
        + byte_arr = cv2.imencode(".jpg", image_array )[1]
        + return "".join(map(chr, byte_arr))
        +
        +def _path_leaf(path):
        + """
        + Returns file name from path. Path should not end with slash
        + """
        + head, tail = ntpath.split(path)
        + return tail or ntpath.basename(head)
        +
        +def get_center_frame(video_path):
        + """
        + Traverse till half of video and saves center snapshot
        + @param video_path: Path to video file on system
        + """
        + cap = cv2.VideoCapture(video_path)
        +
        + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT))
        +
        + success,image = cap.read()
        + count = 0
        +
        + while(success and count < length/2):
        + success,image = cap.read()
        + count += 1
        +
        + return _get_image_from_array(image)
        +
        +def get_frames_interval(video_path, frame_interval):
        + """
        + Selects one frames after every frame_interval
        + @param video_path: Path to video file on system
        + @param frame_interval: Interval after which frame should be picked. If frame_interval=10 then every 10th frame will be extracted
        + """
        + cap = cv2.VideoCapture(video_path)
        +
        + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT))
        +
        + success,image = cap.read()
        + count = 0
        +
        + image_arr = []
        + while(success and count < length):
        + success,image = cap.read()
        + if count % frame_interval == 0:
        + image = _get_image_from_array(image )
        + image_arr.append(image)
        +
        + count += 1
        +
        + return image_arr
        +
        +def get_n_frames(video_path, num_frame):
        + """
        + Get N frames equidistant to each other in a video
        + @param video_path: Path to video file on system
        + @param num_frame: Number of frames to be extracted from video. If num_frame=10 then 10 frames equally distant from each other will be extracted
        + """
        + cap = cv2.VideoCapture(video_path)
        +
        + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT))
        +
        + op_frame_idx = set(np.linspace(0, length - 2, num_frame, dtype=int))
        +
        + success, image = cap.read()
        + count = 0
        +
        + image_arr = []
        + while(success and count < length):
        + success, image = cap.read()
        + if success and count in op_frame_idx:
        +
        + image = _get_image_from_array(image )
        + image_arr.append(image)
        +
        + count += 1
        +
        + return image_arr
        +
        +
        +
        +if _name_ == '_main_':
        + img_data = get_center_frame("/Users/sharan/Documents/workspace/data/prep_13-28-video.mp4")

        Review comment:
        Oh yeah, good catch buddy. This was for testing I can remove it all together

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r111530043 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py ########## @@ -0,0 +1,112 @@ +#!/usr/bin/env python +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. + +import cv2 +import os +import sys +import ntpath +import numpy as np + +def _get_image_from_array(image_array): + #JPG to support tensorflow + byte_arr = cv2.imencode(".jpg", image_array ) [1] + return "".join(map(chr, byte_arr)) + +def _path_leaf(path): + """ + Returns file name from path. Path should not end with slash + """ + head, tail = ntpath.split(path) + return tail or ntpath.basename(head) + +def get_center_frame(video_path): + """ + Traverse till half of video and saves center snapshot + @param video_path: Path to video file on system + """ + cap = cv2.VideoCapture(video_path) + + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT)) + + success,image = cap.read() + count = 0 + + while(success and count < length/2): + success,image = cap.read() + count += 1 + + return _get_image_from_array(image) + +def get_frames_interval(video_path, frame_interval): + """ + Selects one frames after every frame_interval + @param video_path: Path to video file on system + @param frame_interval: Interval after which frame should be picked. If frame_interval=10 then every 10th frame will be extracted + """ + cap = cv2.VideoCapture(video_path) + + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT)) + + success,image = cap.read() + count = 0 + + image_arr = [] + while(success and count < length): + success,image = cap.read() + if count % frame_interval == 0: + image = _get_image_from_array(image ) + image_arr.append(image) + + count += 1 + + return image_arr + +def get_n_frames(video_path, num_frame): + """ + Get N frames equidistant to each other in a video + @param video_path: Path to video file on system + @param num_frame: Number of frames to be extracted from video. If num_frame=10 then 10 frames equally distant from each other will be extracted + """ + cap = cv2.VideoCapture(video_path) + + length = int(cap.get(cv2.cv.CV_CAP_PROP_FRAME_COUNT)) + + op_frame_idx = set(np.linspace(0, length - 2, num_frame, dtype=int)) + + success, image = cap.read() + count = 0 + + image_arr = [] + while(success and count < length): + success, image = cap.read() + if success and count in op_frame_idx: + + image = _get_image_from_array(image ) + image_arr.append(image) + + count += 1 + + return image_arr + + + +if _ name _ == '_ main _': + img_data = get_center_frame("/Users/sharan/Documents/workspace/data/prep_13-28-video.mp4") Review comment: Oh yeah, good catch buddy. This was for testing I can remove it all together ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r111530110

        ##########
        File path: tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java
        ##########
        @@ -0,0 +1,56 @@
        +/*
        + * Licensed to the Apache Software Foundation (ASF) under one or more
        + * contributor license agreements. See the NOTICE file distributed with
        + * this work for additional information regarding copyright ownership.
        + * The ASF licenses this file to You under the Apache License, Version 2.0
        + * (the "License"); you may not use this file except in compliance with
        + * the License. You may obtain a copy of the License at
        + *
        + * http://www.apache.org/licenses/LICENSE-2.0
        + *
        + * Unless required by applicable law or agreed to in writing, software
        + * distributed under the License is distributed on an "AS IS" BASIS,
        + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        + * See the License for the specific language governing permissions and
        + * limitations under the License.
        + */
        +
        +package org.apache.tika.parser.recognition.tf;
        +
        +import org.apache.tika.config.Param;
        +import org.apache.tika.metadata.Metadata;
        +import org.apache.tika.parser.ParseContext;
        +import org.apache.tika.parser.recognition.RecognisedObject;
        +import org.junit.Assert;
        +import org.junit.Ignore;
        +import org.junit.Test;
        +import org.xml.sax.helpers.DefaultHandler;
        +
        +import java.io.InputStream;
        +import java.util.HashMap;
        +import java.util.HashSet;
        +import java.util.List;
        +import java.util.Set;
        +
        +
        +@Ignore
        +public class TensorflowVideoRecParserTest {
        +
        + @Test
        + public void recognise() throws Exception {
        + TensorflowRESTVideoRecogniser recogniser = new TensorflowRESTVideoRecogniser();
        + recogniser.initialize(new HashMap<String, Param>());
        + try (InputStream stream = getClass().getClassLoader().getResourceAsStream("test-documents/testVideoMp4.mp4")) {
        + List<RecognisedObject> objects = recogniser.recognise(stream, new DefaultHandler(), new Metadata(), new ParseContext());
        +
        + Assert.assertTrue(objects.size() > 0);
        + Set<String> objectLabels = new HashSet<>();
        + for (RecognisedObject object : objects) {
        + objectLabels.add(object.getLabel());
        + System.out.println(object.getLabel());

        Review comment:
        Good catch again. This should be removed

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r111530110 ########## File path: tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java ########## @@ -0,0 +1,56 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.tika.parser.recognition.tf; + +import org.apache.tika.config.Param; +import org.apache.tika.metadata.Metadata; +import org.apache.tika.parser.ParseContext; +import org.apache.tika.parser.recognition.RecognisedObject; +import org.junit.Assert; +import org.junit.Ignore; +import org.junit.Test; +import org.xml.sax.helpers.DefaultHandler; + +import java.io.InputStream; +import java.util.HashMap; +import java.util.HashSet; +import java.util.List; +import java.util.Set; + + +@Ignore +public class TensorflowVideoRecParserTest { + + @Test + public void recognise() throws Exception { + TensorflowRESTVideoRecogniser recogniser = new TensorflowRESTVideoRecogniser(); + recogniser.initialize(new HashMap<String, Param>()); + try (InputStream stream = getClass().getClassLoader().getResourceAsStream("test-documents/testVideoMp4.mp4")) { + List<RecognisedObject> objects = recogniser.recognise(stream, new DefaultHandler(), new Metadata(), new ParseContext()); + + Assert.assertTrue(objects.size() > 0); + Set<String> objectLabels = new HashSet<>(); + for (RecognisedObject object : objects) { + objectLabels.add(object.getLabel()); + System.out.println(object.getLabel()); Review comment: Good catch again. This should be removed ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-294923610

        @chrismattmann - ping. Any comment?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-294923610 @chrismattmann - ping. Any comment? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-294943747

        Hi @smadha, this is interesting. Since you are proposing this as an improvement to existing ObjectRecognition parser, why not update the InceptionRestDockerfile[1] to install OpenCV within the container, otherwise one should have Tensorflow, Flask, Requests, OpenCV all installed in his system. Needless to say that there can be version conflicts of these dependencies with the versions the user has.
        [1] https://github.com/apache/tika/blob/master/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionRestDockerfile

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-294943747 Hi @smadha, this is interesting. Since you are proposing this as an improvement to existing ObjectRecognition parser, why not update the InceptionRestDockerfile [1] to install OpenCV within the container, otherwise one should have Tensorflow, Flask, Requests, OpenCV all installed in his system. Needless to say that there can be version conflicts of these dependencies with the versions the user has. [1] https://github.com/apache/tika/blob/master/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionRestDockerfile ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-294956981

        I like your idea @ThejanW. I will look into it but I will like to create a separate PR for this and I will add another Dockerfile making sure that it does not affect current inception server. I will also make sure that OpenCV is not required for Image recognition services

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-294956981 I like your idea @ThejanW. I will look into it but I will like to create a separate PR for this and I will add another Dockerfile making sure that it does not affect current inception server. I will also make sure that OpenCV is not required for Image recognition services ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297598025

        quick note, I had to:
        ```
        Error: No available formula with the name "opencv"
        It was migrated from homebrew/core to homebrew/science.
        You can access it again by running:
        brew tap homebrew/science
        ```

        To get opencv on Mac.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297598025 quick note, I had to: ``` Error: No available formula with the name "opencv" It was migrated from homebrew/core to homebrew/science. You can access it again by running: brew tap homebrew/science ``` To get opencv on Mac. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297601661

        also had to fix the URL to inception API in https://github.com/apache/tika/pull/168/commits/10529eba2bda73c577bedba3c053a01953911922

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297601661 also had to fix the URL to inception API in https://github.com/apache/tika/pull/168/commits/10529eba2bda73c577bedba3c053a01953911922 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297601693

        am now getting a weird error where the Inception server isn't showing on 8764.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297601693 am now getting a weird error where the Inception server isn't showing on 8764. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297602511

        @thammegowda can you take a look here. I can't access the server on 8764:

        Docker quick start terminal

        ```
        LMC-053601:tf mattmann$ docker run -p 8764:8764 -it inception-rest-tika
        >> Downloading inception_v4_2016_09_09.tar.gz 100.0%
        Successfully downloaded inception_v4_2016_09_09.tar.gz 171177982 bytes.
        >> Downloading imagenet_lsvrc_2015_synsets.txt 163.8%
        Successfully downloaded imagenet_lsvrc_2015_synsets.txt 10000 bytes.
        >> Downloading imagenet_metadata.txt 100.5%
        Successfully downloaded imagenet_metadata.txt 741401 bytes.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE3 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
        Serving on port 8764

        Docker client (separate terminal)

        ```
        LMC-053601:tf mattmann$ curl "http://localhost:8764/inception/v4/classify?topk=2&url=https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg"
        curl: (7) Failed to connect to localhost port 8764: Connection refused
        LMC-053601:tf mattmann$
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297602511 @thammegowda can you take a look here. I can't access the server on 8764: Docker quick start terminal ``` LMC-053601:tf mattmann$ docker run -p 8764:8764 -it inception-rest-tika >> Downloading inception_v4_2016_09_09.tar.gz 100.0% Successfully downloaded inception_v4_2016_09_09.tar.gz 171177982 bytes. >> Downloading imagenet_lsvrc_2015_synsets.txt 163.8% Successfully downloaded imagenet_lsvrc_2015_synsets.txt 10000 bytes. >> Downloading imagenet_metadata.txt 100.5% Successfully downloaded imagenet_metadata.txt 741401 bytes. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE3 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations. Serving on port 8764 Running on http://0.0.0.0:8764/ (Press CTRL+C to quit) ``` Docker client (separate terminal) ``` LMC-053601:tf mattmann$ curl "http://localhost:8764/inception/v4/classify?topk=2&url= https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg " curl: (7) Failed to connect to localhost port 8764: Connection refused LMC-053601:tf mattmann$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297602511

        @thammegowda can you take a look here. I can't access the server on 8764:

        Docker quick start terminal

        ```
        LMC-053601:tf mattmann$ docker run -p 8764:8764 -it inception-rest-tika
        >> Downloading inception_v4_2016_09_09.tar.gz 100.0%
        Successfully downloaded inception_v4_2016_09_09.tar.gz 171177982 bytes.
        >> Downloading imagenet_lsvrc_2015_synsets.txt 163.8%
        Successfully downloaded imagenet_lsvrc_2015_synsets.txt 10000 bytes.
        >> Downloading imagenet_metadata.txt 100.5%
        Successfully downloaded imagenet_metadata.txt 741401 bytes.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE3 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations.
        W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
        Serving on port 8764

        Docker client (separate terminal)

        ```
        LMC-053601:tf mattmann$ curl "http://localhost:8764/inception/v4/classify?topk=2&url=https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg"
        curl: (7) Failed to connect to localhost port 8764: Connection refused
        LMC-053601:tf mattmann$
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297602511 @thammegowda can you take a look here. I can't access the server on 8764: Docker quick start terminal ``` LMC-053601:tf mattmann$ docker run -p 8764:8764 -it inception-rest-tika >> Downloading inception_v4_2016_09_09.tar.gz 100.0% Successfully downloaded inception_v4_2016_09_09.tar.gz 171177982 bytes. >> Downloading imagenet_lsvrc_2015_synsets.txt 163.8% Successfully downloaded imagenet_lsvrc_2015_synsets.txt 10000 bytes. >> Downloading imagenet_metadata.txt 100.5% Successfully downloaded imagenet_metadata.txt 741401 bytes. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE3 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations. W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations. Serving on port 8764 Running on http://0.0.0.0:8764/ (Press CTRL+C to quit) ``` Docker client (separate terminal) ``` LMC-053601:tf mattmann$ curl "http://localhost:8764/inception/v4/classify?topk=2&url= https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg " curl: (7) Failed to connect to localhost port 8764: Connection refused LMC-053601:tf mattmann$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297603308

        So I was able to access this by directly using the docker IP:

        ```
        http://192.168.99.100:8764/inception/v4/classify?topk=2&url=https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg
        ```

        returns:

        ```json
        {
        "confidence": [
        0.7843596339225769,
        0.06694009155035019
        ],
        "classnames": [
        "German shepherd, German shepherd dog, German police dog, alsatian",
        "military uniform"
        ],
        "classids": [
        236,
        653
        ],
        "time":

        { "read": 380, "units": "ms", "classification": 886 }

        }
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297603308 So I was able to access this by directly using the docker IP: ``` http://192.168.99.100:8764/inception/v4/classify?topk=2&url=https://upload.wikimedia.org/wikipedia/commons/f/f6/Working_Dogs%2C_Handlers_Share_Special_Bond_DVIDS124942.jpg ``` returns: ```json { "confidence": [ 0.7843596339225769, 0.06694009155035019 ], "classnames": [ "German shepherd, German shepherd dog, German police dog, alsatian", "military uniform" ], "classids": [ 236, 653 ], "time": { "read": 380, "units": "ms", "classification": 886 } } ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297603344

        anyone have any idea why my localhost port forward not working?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297603344 anyone have any idea why my localhost port forward not working? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297611010

        OK, here are two things we need to put in the wiki guide for this (and also we need to update the TikaAndVision page with this):

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297611010 OK, here are two things we need to put in the wiki guide for this (and also we need to update the TikaAndVision page with this): On Mac, [Docker Containers Don't Run on the Host Machine itself] ( http://stackoverflow.com/questions/35878297/cant-connect-to-docker-containers-on-osx ). This means that in order to use `localhost` like we specify in our instructions, you have to fiddle with Virtual Box (which is the VM server that Docker, and Docker-Machine work with). To fiddle with Virtual Box, and get it to expose our port, 8764, you need to [enable and add the specified port forwarding rules to your Virtual Box default machine] ( https://jhipster.github.io/tips/020_tip_using_docker_containers_as_localhost_on_mac_and_windows.html ). You can find instructions on how to do that in the previous link, and you should add a rule for 8764 and 8764, and map it to 127.0.0.1. After that you're set! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297611248

        OK now I am stuck on this error (running `mvn clean install` so that I can get a fresh tika-app to test, using `TIKA-2322` branch on smada/tika):

        ```
        [ERROR] COMPILATION ERROR :
        [INFO] -------------------------------------------------------------
        [ERROR] /Users/mattmann/git/smadha-tika/tika-parsers/src/test/java/org/apache/tika/parser/pkg/PackageParserTest.java:[40,55] incompatible types: java.lang.String cannot be converted to org.apache.commons.compress.archivers.ArchiveInputStream
        [INFO] 1 error
        [INFO] -------------------------------------------------------------
        [INFO] ------------------------------------------------------------------------
        [INFO] Reactor Summary:
        [INFO]
        [INFO] Apache Tika parent ................................. SUCCESS [ 1.087 s]
        [INFO] Apache Tika core ................................... SUCCESS [ 22.586 s]
        [INFO] Apache Tika parsers ................................ FAILURE [ 5.888 s]
        [INFO] Apache Tika XMP .................................... SKIPPED
        [INFO] Apache Tika serialization .......................... SKIPPED
        [INFO] Apache Tika batch .................................. SKIPPED
        [INFO] Apache Tika language detection ..................... SKIPPED
        [INFO] Apache Tika application ............................ SKIPPED
        [INFO] Apache Tika OSGi bundle ............................ SKIPPED
        [INFO] Apache Tika translate .............................. SKIPPED
        [INFO] Apache Tika server ................................. SKIPPED
        [INFO] Apache Tika examples ............................... SKIPPED
        [INFO] Apache Tika Java-7 Components ...................... SKIPPED
        [INFO] Apache Tika eval ................................... SKIPPED
        [INFO] Apache Tika ........................................ SKIPPED
        [INFO] ------------------------------------------------------------------------
        [INFO] BUILD FAILURE
        [INFO] ------------------------------------------------------------------------
        [INFO] Total time: 29.993 s
        [INFO] Finished at: 2017-04-26T21:48:03-07:00
        [INFO] Final Memory: 66M/1014M
        [INFO] ------------------------------------------------------------------------
        [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.2:testCompile (default-testCompile) on project tika-parsers: Compilation failure
        [ERROR] /Users/mattmann/git/smadha-tika/tika-parsers/src/test/java/org/apache/tika/parser/pkg/PackageParserTest.java:[40,55] incompatible types: java.lang.String cannot be converted to org.apache.commons.compress.archivers.ArchiveInputStream
        [ERROR] -> [Help 1]
        [ERROR]
        [ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
        [ERROR] Re-run Maven using the -X switch to enable full debug logging.
        [ERROR]
        [ERROR] For more information about the errors and possible solutions, please read the following articles:
        [ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
        [ERROR]
        [ERROR] After correcting the problems, you can resume the build with the command
        [ERROR] mvn <goals> -rf :tika-parsers
        LMC-053601:smadha-tika mattmann$
        ```

        I tried with both Java7, and Java8, and get the same error, any ideas?

        ```
        LMC-053601:smadha-tika mattmann$ env | grep JAVA
        JAVA_HOME=/Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home
        LMC-053601:smadha-tika mattmann$ echo $JAVA_HOME
        /Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home
        LMC-053601:smadha-tika mattmann$ java -version
        java version "1.8.0_91"
        Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
        Java HotSpot(TM) 64-Bit Server VM (build 25.91-b14, mixed mode)
        LMC-053601:smadha-tika mattmann$ mvn -version
        Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; 2015-11-10T08:41:47-08:00)
        Maven home: /usr/local/Cellar/maven/3.3.9/libexec
        Java version: 1.8.0_91, vendor: Oracle Corporation
        Java home: /Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home/jre
        Default locale: en_US, platform encoding: UTF-8
        OS name: "mac os x", version: "10.11.3", arch: "x86_64", family: "mac"
        LMC-053601:smadha-tika mattmann$
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297611248 OK now I am stuck on this error (running `mvn clean install` so that I can get a fresh tika-app to test, using ` TIKA-2322 ` branch on smada/tika): ``` [ERROR] COMPILATION ERROR : [INFO] ------------------------------------------------------------- [ERROR] /Users/mattmann/git/smadha-tika/tika-parsers/src/test/java/org/apache/tika/parser/pkg/PackageParserTest.java: [40,55] incompatible types: java.lang.String cannot be converted to org.apache.commons.compress.archivers.ArchiveInputStream [INFO] 1 error [INFO] ------------------------------------------------------------- [INFO] ------------------------------------------------------------------------ [INFO] Reactor Summary: [INFO] [INFO] Apache Tika parent ................................. SUCCESS [ 1.087 s] [INFO] Apache Tika core ................................... SUCCESS [ 22.586 s] [INFO] Apache Tika parsers ................................ FAILURE [ 5.888 s] [INFO] Apache Tika XMP .................................... SKIPPED [INFO] Apache Tika serialization .......................... SKIPPED [INFO] Apache Tika batch .................................. SKIPPED [INFO] Apache Tika language detection ..................... SKIPPED [INFO] Apache Tika application ............................ SKIPPED [INFO] Apache Tika OSGi bundle ............................ SKIPPED [INFO] Apache Tika translate .............................. SKIPPED [INFO] Apache Tika server ................................. SKIPPED [INFO] Apache Tika examples ............................... SKIPPED [INFO] Apache Tika Java-7 Components ...................... SKIPPED [INFO] Apache Tika eval ................................... SKIPPED [INFO] Apache Tika ........................................ SKIPPED [INFO] ------------------------------------------------------------------------ [INFO] BUILD FAILURE [INFO] ------------------------------------------------------------------------ [INFO] Total time: 29.993 s [INFO] Finished at: 2017-04-26T21:48:03-07:00 [INFO] Final Memory: 66M/1014M [INFO] ------------------------------------------------------------------------ [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.2:testCompile (default-testCompile) on project tika-parsers: Compilation failure [ERROR] /Users/mattmann/git/smadha-tika/tika-parsers/src/test/java/org/apache/tika/parser/pkg/PackageParserTest.java: [40,55] incompatible types: java.lang.String cannot be converted to org.apache.commons.compress.archivers.ArchiveInputStream [ERROR] -> [Help 1] [ERROR] [ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch. [ERROR] Re-run Maven using the -X switch to enable full debug logging. [ERROR] [ERROR] For more information about the errors and possible solutions, please read the following articles: [ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException [ERROR] [ERROR] After correcting the problems, you can resume the build with the command [ERROR] mvn <goals> -rf :tika-parsers LMC-053601:smadha-tika mattmann$ ``` I tried with both Java7, and Java8, and get the same error, any ideas? ``` LMC-053601:smadha-tika mattmann$ env | grep JAVA JAVA_HOME=/Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home LMC-053601:smadha-tika mattmann$ echo $JAVA_HOME /Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home LMC-053601:smadha-tika mattmann$ java -version java version "1.8.0_91" Java(TM) SE Runtime Environment (build 1.8.0_91-b14) Java HotSpot(TM) 64-Bit Server VM (build 25.91-b14, mixed mode) LMC-053601:smadha-tika mattmann$ mvn -version Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; 2015-11-10T08:41:47-08:00) Maven home: /usr/local/Cellar/maven/3.3.9/libexec Java version: 1.8.0_91, vendor: Oracle Corporation Java home: /Library/Java/JavaVirtualMachines/jdk1.8.0_91.jdk/Contents/Home/jre Default locale: en_US, platform encoding: UTF-8 OS name: "mac os x", version: "10.11.3", arch: "x86_64", family: "mac" LMC-053601:smadha-tika mattmann$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297611984

        OK, nevermind, those were leftover files untracked by git, after I git reset --hard this branch to remove the unwanted rebased commits from @smadha. I removed them and the build is progressing.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297611984 OK, nevermind, those were leftover files untracked by git, after I git reset --hard this branch to remove the unwanted rebased commits from @smadha. I removed them and the build is progressing. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297612783

        @chrismattmann Glad, you fixed it!. Wish I was online to help.

        But I see your docker setup needs revision.
        Please use "Docker for Mac" instead of "Docker toolbox" on mac.
        Docker for mac is the new+efficient way to run docker on OSX.
        More here https://blog.docker.com/2016/03/docker-for-mac-windows-beta/

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297612783 @chrismattmann Glad, you fixed it!. Wish I was online to help. But I see your docker setup needs revision. Please use "Docker for Mac" instead of "Docker toolbox" on mac. Docker for mac is the new+efficient way to run docker on OSX. More here https://blog.docker.com/2016/03/docker-for-mac-windows-beta/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297612783

        @chrismattmann Glad, you fixed it!. Wish I was online to help.

        But I see your docker setup needs revision.
        Please use "Docker for Mac" instead of "Docker toolbox" on mac.
        Docker for mac is the new+efficient way to run docker on OSX.
        More here https://blog.docker.com/2016/03/docker-for-mac-windows-beta/

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297612783 @chrismattmann Glad, you fixed it!. Wish I was online to help. But I see your docker setup needs revision. Please use "Docker for Mac" instead of "Docker toolbox" on mac. Docker for mac is the new+efficient way to run docker on OSX. More here https://blog.docker.com/2016/03/docker-for-mac-windows-beta/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297612977

        thanks for the tip, @thammegowda. I still think we should include the above notes I put in our Wiki pages for old guys like me who haven't upgraded yet

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297612977 thanks for the tip, @thammegowda. I still think we should include the above notes I put in our Wiki pages for old guys like me who haven't upgraded yet ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297612994

        OK got this built:

        ```
        [INFO] — forbiddenapis:2.2:testCheck (default) @ tika —
        [INFO] Skipping execution for packaging "pom"
        [INFO]
        [INFO] — maven-install-plugin:2.5.2:install (default-install) @ tika —
        [INFO] Installing /Users/mattmann/git/smadha-tika/pom.xml to /Users/mattmann/.m2/repository/org/apache/tika/tika/1.15-SNAPSHOT/tika-1.15-SNAPSHOT.pom
        [INFO] ------------------------------------------------------------------------
        [INFO] Reactor Summary:
        [INFO]
        [INFO] Apache Tika parent ................................. SUCCESS [ 1.104 s]
        [INFO] Apache Tika core ................................... SUCCESS [ 22.681 s]
        [INFO] Apache Tika parsers ................................ SUCCESS [03:32 min]
        [INFO] Apache Tika XMP .................................... SUCCESS [ 1.492 s]
        [INFO] Apache Tika serialization .......................... SUCCESS [ 1.392 s]
        [INFO] Apache Tika batch .................................. SUCCESS [01:47 min]
        [INFO] Apache Tika language detection ..................... SUCCESS [ 3.170 s]
        [INFO] Apache Tika application ............................ SUCCESS [ 29.415 s]
        [INFO] Apache Tika OSGi bundle ............................ SUCCESS [ 22.927 s]
        [INFO] Apache Tika translate .............................. SUCCESS [ 2.137 s]
        [INFO] Apache Tika server ................................. SUCCESS [ 33.826 s]
        [INFO] Apache Tika examples ............................... SUCCESS [ 7.159 s]
        [INFO] Apache Tika Java-7 Components ...................... SUCCESS [ 1.874 s]
        [INFO] Apache Tika eval ................................... SUCCESS [ 24.681 s]
        [INFO] Apache Tika ........................................ SUCCESS [ 0.019 s]
        [INFO] ------------------------------------------------------------------------
        [INFO] BUILD SUCCESS
        [INFO] ------------------------------------------------------------------------
        [INFO] Total time: 07:51 min
        [INFO] Finished at: 2017-04-26T22:05:39-07:00
        [INFO] Final Memory: 178M/1570M
        [INFO] ------------------------------------------------------------------------
        LMC-053601:smadha-tika mattmann$
        ```

        now time to test!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297612994 OK got this built: ``` [INFO] — forbiddenapis:2.2:testCheck (default) @ tika — [INFO] Skipping execution for packaging "pom" [INFO] [INFO] — maven-install-plugin:2.5.2:install (default-install) @ tika — [INFO] Installing /Users/mattmann/git/smadha-tika/pom.xml to /Users/mattmann/.m2/repository/org/apache/tika/tika/1.15-SNAPSHOT/tika-1.15-SNAPSHOT.pom [INFO] ------------------------------------------------------------------------ [INFO] Reactor Summary: [INFO] [INFO] Apache Tika parent ................................. SUCCESS [ 1.104 s] [INFO] Apache Tika core ................................... SUCCESS [ 22.681 s] [INFO] Apache Tika parsers ................................ SUCCESS [03:32 min] [INFO] Apache Tika XMP .................................... SUCCESS [ 1.492 s] [INFO] Apache Tika serialization .......................... SUCCESS [ 1.392 s] [INFO] Apache Tika batch .................................. SUCCESS [01:47 min] [INFO] Apache Tika language detection ..................... SUCCESS [ 3.170 s] [INFO] Apache Tika application ............................ SUCCESS [ 29.415 s] [INFO] Apache Tika OSGi bundle ............................ SUCCESS [ 22.927 s] [INFO] Apache Tika translate .............................. SUCCESS [ 2.137 s] [INFO] Apache Tika server ................................. SUCCESS [ 33.826 s] [INFO] Apache Tika examples ............................... SUCCESS [ 7.159 s] [INFO] Apache Tika Java-7 Components ...................... SUCCESS [ 1.874 s] [INFO] Apache Tika eval ................................... SUCCESS [ 24.681 s] [INFO] Apache Tika ........................................ SUCCESS [ 0.019 s] [INFO] ------------------------------------------------------------------------ [INFO] BUILD SUCCESS [INFO] ------------------------------------------------------------------------ [INFO] Total time: 07:51 min [INFO] Finished at: 2017-04-26T22:05:39-07:00 [INFO] Final Memory: 178M/1570M [INFO] ------------------------------------------------------------------------ LMC-053601:smadha-tika mattmann$ ``` now time to test! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297613266

        Welp, first test failed:

        ```
        LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        WARN JBIG2ImageReader not loaded. jbig2 files will be ignored
        INFO Available = false, API Status = HTTP/1.0 404 NOT FOUND
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = false
        WARN class org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser is not available for service
        LMC-053601:smadha-tika mattmann$
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297613266 Welp, first test failed: ``` LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 WARN JBIG2ImageReader not loaded. jbig2 files will be ignored INFO Available = false, API Status = HTTP/1.0 404 NOT FOUND INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = false WARN class org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser is not available for service LMC-053601:smadha-tika mattmann$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297613477

        So, the URLs were old in the config. I updated them to the v4, URLs, like so:

        ```diff
        diff --git a/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml b/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml
        index f8037a7..c6c50ed 100644
        — a/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml
        +++ b/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml
        @@ -24,8 +24,8 @@
        <param name="topN" type="int">4</param>
        <param name="minConfidence" type="double">0.015</param>
        <param name="class" type="string">org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser</param>

        Still broke:

        ```
        MC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        WARN JBIG2ImageReader not loaded. jbig2 files will be ignored
        INFO Available = true, API Status = HTTP/1.0 200 OK
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = true
        INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe
        INFO Retrying request
        INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe
        INFO Retrying request
        INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe
        INFO Retrying request
        WARN Broken pipe
        java.net.SocketException: Broken pipe
        at java.net.SocketOutputStream.socketWrite0(Native Method)
        at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:109)
        at java.net.SocketOutputStream.write(SocketOutputStream.java:153)
        at org.apache.http.impl.io.AbstractSessionOutputBuffer.write(AbstractSessionOutputBuffer.java:169)
        at org.apache.http.impl.io.ContentLengthOutputStream.write(ContentLengthOutputStream.java:119)
        at org.apache.http.entity.ByteArrayEntity.writeTo(ByteArrayEntity.java:115)
        at org.apache.http.entity.HttpEntityWrapper.writeTo(HttpEntityWrapper.java:98)
        at org.apache.http.impl.client.EntityEnclosingRequestWrapper$EntityWrapper.writeTo(EntityEnclosingRequestWrapper.java:108)
        at org.apache.http.impl.entity.EntitySerializer.serialize(EntitySerializer.java:122)
        at org.apache.http.impl.AbstractHttpClientConnection.sendRequestEntity(AbstractHttpClientConnection.java:271)
        at org.apache.http.impl.conn.ManagedClientConnectionImpl.sendRequestEntity(ManagedClientConnectionImpl.java:197)
        at org.apache.http.protocol.HttpRequestExecutor.doSendRequest(HttpRequestExecutor.java:257)
        at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:125)
        at org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:715)
        at org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:520)
        at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906)
        at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805)
        at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784)
        at org.apache.tika.parser.recognition.tf.TensorflowRESTRecogniser.recognise(TensorflowRESTRecogniser.java:119)
        at org.apache.tika.parser.recognition.ObjectRecognitionParser.parse(ObjectRecognitionParser.java:120)
        at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:189)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:481)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:143)
        WARN NO objects
        LMC-053601:smadha-tika mattmann$
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297613477 So, the URLs were old in the config. I updated them to the v4, URLs, like so: ```diff diff --git a/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml b/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml index f8037a7..c6c50ed 100644 — a/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml +++ b/tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml @@ -24,8 +24,8 @@ <param name="topN" type="int">4</param> <param name="minConfidence" type="double">0.015</param> <param name="class" type="string">org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser</param> <param name="healthUri" type="uri"> http://localhost:8764/inception/v3/ping </param> <param name="apiUri" type="uri"> http://localhost:8764/inception/v3/classify/video?mode=fixed </param> + <param name="healthUri" type="uri"> http://localhost:8764/inception/v4/ping </param> + <param name="apiUri" type="uri"> http://localhost:8764/inception/v4/classify/video?mode=fixed </param> </params> </parser> </parsers> ``` Still broke: ``` MC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 WARN JBIG2ImageReader not loaded. jbig2 files will be ignored INFO Available = true, API Status = HTTP/1.0 200 OK INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = true INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe INFO Retrying request INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe INFO Retrying request INFO I/O exception (java.net.SocketException) caught when processing request: Broken pipe INFO Retrying request WARN Broken pipe java.net.SocketException: Broken pipe at java.net.SocketOutputStream.socketWrite0(Native Method) at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:109) at java.net.SocketOutputStream.write(SocketOutputStream.java:153) at org.apache.http.impl.io.AbstractSessionOutputBuffer.write(AbstractSessionOutputBuffer.java:169) at org.apache.http.impl.io.ContentLengthOutputStream.write(ContentLengthOutputStream.java:119) at org.apache.http.entity.ByteArrayEntity.writeTo(ByteArrayEntity.java:115) at org.apache.http.entity.HttpEntityWrapper.writeTo(HttpEntityWrapper.java:98) at org.apache.http.impl.client.EntityEnclosingRequestWrapper$EntityWrapper.writeTo(EntityEnclosingRequestWrapper.java:108) at org.apache.http.impl.entity.EntitySerializer.serialize(EntitySerializer.java:122) at org.apache.http.impl.AbstractHttpClientConnection.sendRequestEntity(AbstractHttpClientConnection.java:271) at org.apache.http.impl.conn.ManagedClientConnectionImpl.sendRequestEntity(ManagedClientConnectionImpl.java:197) at org.apache.http.protocol.HttpRequestExecutor.doSendRequest(HttpRequestExecutor.java:257) at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:125) at org.apache.http.impl.client.DefaultRequestDirector.tryExecute(DefaultRequestDirector.java:715) at org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:520) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:906) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:805) at org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:784) at org.apache.tika.parser.recognition.tf.TensorflowRESTRecogniser.recognise(TensorflowRESTRecogniser.java:119) at org.apache.tika.parser.recognition.ObjectRecognitionParser.parse(ObjectRecognitionParser.java:120) at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135) at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:189) at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:481) at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:143) WARN NO objects LMC-053601:smadha-tika mattmann$ ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297614197

        OK so found the next problem:

        ```
        "POST /inception/v3/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 404 -
        ```
        It's looking for a v3 video method on POST. I think it's set up to use GET.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297614197 OK so found the next problem: ``` "POST /inception/v3/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 404 - ``` It's looking for a v3 video method on POST. I think it's set up to use GET. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297614466

        > POST /inception/v3/classify/video?mode=fixed&ext=.mp4 HTTP/1.1"

        I think it should be going to `/inception/v4/classify/video?mode=fixed&ext=.mp4 `

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297614466 > POST /inception/v3/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" I think it should be going to `/inception/v4/classify/video?mode=fixed&ext=.mp4 ` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297616815

        @smadha so I believe the error is that the Docker I am using for InceptionDockerRest doesn't include OpenCV. I found [this](http://stackoverflow.com/questions/36862589/install-opencv-in-a-docker-container) that you pointed me to that talks about installing Anaconda/OpenCV. Can you suggest changes to the Docker file that make it work? cc / @thammegowda once that happens I can move forward and test.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297616815 @smadha so I believe the error is that the Docker I am using for InceptionDockerRest doesn't include OpenCV. I found [this] ( http://stackoverflow.com/questions/36862589/install-opencv-in-a-docker-container ) that you pointed me to that talks about installing Anaconda/OpenCV. Can you suggest changes to the Docker file that make it work? cc / @thammegowda once that happens I can move forward and test. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297619073

        I tried to test it too.. Couldn't get it working.
        @smadha Let's test it properly tomorrow with docker and inceptionv4.
        A lot has been changed in the last 2 weeks and hence this happened

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297619073 I tried to test it too.. Couldn't get it working. @smadha Let's test it properly tomorrow with docker and inceptionv4. A lot has been changed in the last 2 weeks and hence this happened ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297619325

        @chrismattmann
        I tried to edit the wiki page, but unfortunately, I can't edit it!
        The wiki page is shown as "Immutable page" and hence no edit link is available for my account.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297619325 @chrismattmann I tried to edit the wiki page, but unfortunately, I can't edit it! The wiki page is shown as "Immutable page" and hence no edit link is available for my account. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297629474

        Hey guys - Yeah I fixed OpenCV and docker. It's still failing with some other issue occurring after changes done for IncV4, I'll fix it and test. Thanks for your help on this

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297629474 Hey guys - Yeah I fixed OpenCV and docker. It's still failing with some other issue occurring after changes done for IncV4, I'll fix it and test. Thanks for your help on this ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297656791

        @smadha
        I suggest you use continuumio/miniconda instead of Anaconda.
        Anaconda comes with a lot of packages that aren't useful for us.
        We need only their package manager.

        Replace `continuumio/anaconda` with `continuumio/miniconda` in the docker file.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297656791 @smadha I suggest you use continuumio/miniconda instead of Anaconda. Anaconda comes with a lot of packages that aren't useful for us. We need only their package manager. Replace `continuumio/anaconda` with `continuumio/miniconda` in the docker file. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        KranthiGV commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113648625

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda
        +EXPOSE 5000

        Review comment:
        I wonder why port 5000 is exposed.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - KranthiGV commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113648625 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda +EXPOSE 5000 Review comment: I wonder why port 5000 is exposed. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        KranthiGV commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113648562

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        @smadha
        I suggest you use continuumio/miniconda instead of Anaconda.
        Anaconda comes with a lot of packages that aren't useful for us.
        We need only their package manager.

        Replace `continuumio/anaconda` with `continuumio/miniconda` in the docker file.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - KranthiGV commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113648562 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: @smadha I suggest you use continuumio/miniconda instead of Anaconda. Anaconda comes with a lot of packages that aren't useful for us. We need only their package manager. Replace `continuumio/anaconda` with `continuumio/miniconda` in the docker file. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297661252

        @thammegowda

        > I tried to edit the wiki page, but unfortunately, I can't edit it!
        > The wiki page is shown as "Immutable page" and hence no edit link is available for my account.

        I am able to modify the wiki. I'll make the necessary changes:
        1) Document the tensorflow/models issues.
        2) Docker issues on mac.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297661252 @thammegowda > I tried to edit the wiki page, but unfortunately, I can't edit it! > The wiki page is shown as "Immutable page" and hence no edit link is available for my account. I am able to modify the wiki. I'll make the necessary changes: 1) Document the tensorflow/models issues. 2) Docker issues on mac. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297661252

        @thammegowda

        > I tried to edit the wiki page, but unfortunately, I can't edit it!
        > The wiki page is shown as "Immutable page" and hence no edit link is available for my account.

        I am able to modify the wiki. I'll make the necessary changes:
        1) Document the tensorflow/models issues.
        2) Docker issues on mac.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - KranthiGV commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297661252 @thammegowda > I tried to edit the wiki page, but unfortunately, I can't edit it! > The wiki page is shown as "Immutable page" and hence no edit link is available for my account. I am able to modify the wiki. I'll make the necessary changes: 1) Document the tensorflow/models issues. 2) Docker issues on mac. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        tallison@mitre.org Tim Allison added a comment -

        Should I start the regression run for 1.15? Or is there more to do to Tika? Thank you!

        Show
        tallison@mitre.org Tim Allison added a comment - Should I start the regression run for 1.15? Or is there more to do to Tika? Thank you!
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297701510

        thanks @KranthiGV and @smadha - not ready to test yet, right?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297701510 thanks @KranthiGV and @smadha - not ready to test yet, right? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113684569

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
        ##########
        @@ -362,6 +398,114 @@ def classify_image():
        res['classnames'] = classnames
        return Response(response=json.dumps(res), status=200, mimetype="application/json")

        +CENTER = "center"
        +INTERVAL = "interval"
        +FIXED = "fixed"
        +
        +ALLOWED_MODE = set([CENTER ,INTERVAL , FIXED])
        +
        +@app.route("/inception/v3/classify/video", methods=["GET", "POST"])

        Review comment:
        @smadha can you please update this to /v4 throughout this file?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113684569 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py ########## @@ -362,6 +398,114 @@ def classify_image(): res ['classnames'] = classnames return Response(response=json.dumps(res), status=200, mimetype="application/json") +CENTER = "center" +INTERVAL = "interval" +FIXED = "fixed" + +ALLOWED_MODE = set( [CENTER ,INTERVAL , FIXED] ) + +@app.route("/inception/v3/classify/video", methods= ["GET", "POST"] ) Review comment: @smadha can you please update this to /v4 throughout this file? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113734528

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Have you used miniconda to install OpenCV? I used it once earlier and I was unable to do so. I agree miniconda is more convenient if it's possible I will change it

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113734528 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Have you used miniconda to install OpenCV? I used it once earlier and I was unable to do so. I agree miniconda is more convenient if it's possible I will change it ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113748430

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        @smadha What issues with simple `pip install` for that package?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113748430 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: @smadha What issues with simple `pip install` for that package? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113749219

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        OpenCV requires binaries too, pip install is too simple unfortunately

        read this for more info - http://stackoverflow.com/questions/36862589/install-opencv-in-a-docker-container

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113749219 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: OpenCV requires binaries too, pip install is too simple unfortunately read this for more info - http://stackoverflow.com/questions/36862589/install-opencv-in-a-docker-container ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113750059

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        makes sense! Thanks for the clarification.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113750059 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: makes sense! Thanks for the clarification. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113751108

        ##########
        File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTVideoRecogniser.java
        ##########
        @@ -0,0 +1,79 @@
        +/*
        + * Licensed to the Apache Software Foundation (ASF) under one or more
        + * contributor license agreements. See the NOTICE file distributed with
        + * this work for additional information regarding copyright ownership.
        + * The ASF licenses this file to You under the Apache License, Version 2.0
        + * (the "License"); you may not use this file except in compliance with
        + * the License. You may obtain a copy of the License at
        + *
        + * http://www.apache.org/licenses/LICENSE-2.0
        + *
        + * Unless required by applicable law or agreed to in writing, software
        + * distributed under the License is distributed on an "AS IS" BASIS,
        + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        + * See the License for the specific language governing permissions and
        + * limitations under the License.
        + */
        +
        +package org.apache.tika.parser.recognition.tf;
        +
        +import java.io.IOException;
        +import java.io.InputStream;
        +import java.net.URI;
        +import java.util.Collections;
        +import java.util.Set;
        +
        +import javax.ws.rs.core.UriBuilder;
        +
        +import org.apache.tika.Tika;
        +import org.apache.tika.config.Field;
        +import org.apache.tika.config.TikaConfig;
        +import org.apache.tika.metadata.Metadata;
        +import org.apache.tika.mime.MediaType;
        +import org.apache.tika.mime.MimeType;
        +import org.apache.tika.mime.MimeTypeException;
        +import org.slf4j.Logger;
        +import org.slf4j.LoggerFactory;
        +
        +/**
        + * Tensor Flow video recogniser which has high performance.
        + * This implementation uses Tensorflow via REST API.
        + * <p>
        + * NOTE : //TODO: link to wiki page here
        + *
        + * @since Apache Tika 1.15
        + */
        +public class TensorflowRESTVideoRecogniser extends TensorflowRESTRecogniser{
        +
        + private static final Logger LOG = LoggerFactory.getLogger(TensorflowRESTRecogniser.class);
        +
        + private static final Set<MediaType> SUPPORTED_MIMES = Collections.singleton(MediaType.video("mp4"));;
        +
        + @Field
        + private URI apiUri = URI.create("http://localhost:8764/inception/v3/classify/video?topk=10");

        Review comment:
        shouldn't this read v4?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113751108 ########## File path: tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTVideoRecogniser.java ########## @@ -0,0 +1,79 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.tika.parser.recognition.tf; + +import java.io.IOException; +import java.io.InputStream; +import java.net.URI; +import java.util.Collections; +import java.util.Set; + +import javax.ws.rs.core.UriBuilder; + +import org.apache.tika.Tika; +import org.apache.tika.config.Field; +import org.apache.tika.config.TikaConfig; +import org.apache.tika.metadata.Metadata; +import org.apache.tika.mime.MediaType; +import org.apache.tika.mime.MimeType; +import org.apache.tika.mime.MimeTypeException; +import org.slf4j.Logger; +import org.slf4j.LoggerFactory; + +/** + * Tensor Flow video recogniser which has high performance. + * This implementation uses Tensorflow via REST API. + * <p> + * NOTE : //TODO: link to wiki page here + * + * @since Apache Tika 1.15 + */ +public class TensorflowRESTVideoRecogniser extends TensorflowRESTRecogniser{ + + private static final Logger LOG = LoggerFactory.getLogger(TensorflowRESTRecogniser.class); + + private static final Set<MediaType> SUPPORTED_MIMES = Collections.singleton(MediaType.video("mp4"));; + + @Field + private URI apiUri = URI.create("http://localhost:8764/inception/v3/classify/video?topk=10"); Review comment: shouldn't this read v4? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113751276

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
        ##########
        @@ -310,6 +320,32 @@ def index():
        </td></tr>
        </table>
        </li>
        + <li> <code>/inception/v3/classify/video</code> - <br/>
        + <table>
        + <tr><th align="left"> Description </th><td> This is a classifier service that can classify videos</td></tr>
        + <tr><td></td> <td>Query Params : <br/>
        + <code>topk </code>: type = int : top classes to get; default : 10 <br/>
        + <code>human </code>: type = boolean : human readable class names; default : true <br/>
        + <code>mode </code>: options = <code>

        {"center", "interval", "fixed"}

        </code> : Modes of frame extraction; default : center <br/>
        +   <code>"center"</code> - Just one frame in center. <br/>
        +   <code>"interval"</code> - Extracts frames after fixed interval. <br/>
        +   <code>"fixed"</code> - Extract fixed number of frames.<br/>
        + <code>frame-interval </code>: type = int : Interval for frame extraction to be used with INTERVAL mode. If frame_interval=10 then every 10th frame will be extracted; default : 10 <br/>
        + <code>num-frame </code>: type = int : Number of frames to be extracted from video while using FIXED model. If num_frame=10 then 10 frames equally distant from each other will be extracted; default : 10 <br/>
        +
        + </td></tr>
        + <tr><th align="left"> How to supply Video Content </th></tr>
        + <tr><th align="left"> With HTTP GET : </th> <td>
        + Include a query parameter <code>url </code> which is path on file system <br/>
        + Example: <code> curl "localhost:8764/inception/v3/classify/video?url=filesystem/path/to/video"</code><br/>

        Review comment:
        v4

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113751276 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py ########## @@ -310,6 +320,32 @@ def index(): </td></tr> </table> </li> + <li> <code>/inception/v3/classify/video</code> - <br/> + <table> + <tr><th align="left"> Description </th><td> This is a classifier service that can classify videos</td></tr> + <tr><td></td> <td>Query Params : <br/> + <code>topk </code>: type = int : top classes to get; default : 10 <br/> + <code>human </code>: type = boolean : human readable class names; default : true <br/> + <code>mode </code>: options = <code> {"center", "interval", "fixed"} </code> : Modes of frame extraction; default : center <br/> +   <code>"center"</code> - Just one frame in center. <br/> +   <code>"interval"</code> - Extracts frames after fixed interval. <br/> +   <code>"fixed"</code> - Extract fixed number of frames.<br/> + <code>frame-interval </code>: type = int : Interval for frame extraction to be used with INTERVAL mode. If frame_interval=10 then every 10th frame will be extracted; default : 10 <br/> + <code>num-frame </code>: type = int : Number of frames to be extracted from video while using FIXED model. If num_frame=10 then 10 frames equally distant from each other will be extracted; default : 10 <br/> + + </td></tr> + <tr><th align="left"> How to supply Video Content </th></tr> + <tr><th align="left"> With HTTP GET : </th> <td> + Include a query parameter <code>url </code> which is path on file system <br/> + Example: <code> curl "localhost:8764/inception/v3/classify/video?url=filesystem/path/to/video"</code><br/> Review comment: v4 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113751309

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
        ##########
        @@ -310,6 +320,32 @@ def index():
        </td></tr>
        </table>
        </li>
        + <li> <code>/inception/v3/classify/video</code> - <br/>

        Review comment:
        v4

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113751309 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py ########## @@ -310,6 +320,32 @@ def index(): </td></tr> </table> </li> + <li> <code>/inception/v3/classify/video</code> - <br/> Review comment: v4 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113751331

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
        ##########
        @@ -310,6 +320,32 @@ def index():
        </td></tr>
        </table>
        </li>
        + <li> <code>/inception/v3/classify/video</code> - <br/>
        + <table>
        + <tr><th align="left"> Description </th><td> This is a classifier service that can classify videos</td></tr>
        + <tr><td></td> <td>Query Params : <br/>
        + <code>topk </code>: type = int : top classes to get; default : 10 <br/>
        + <code>human </code>: type = boolean : human readable class names; default : true <br/>
        + <code>mode </code>: options = <code>

        {"center", "interval", "fixed"}

        </code> : Modes of frame extraction; default : center <br/>
        +   <code>"center"</code> - Just one frame in center. <br/>
        +   <code>"interval"</code> - Extracts frames after fixed interval. <br/>
        +   <code>"fixed"</code> - Extract fixed number of frames.<br/>
        + <code>frame-interval </code>: type = int : Interval for frame extraction to be used with INTERVAL mode. If frame_interval=10 then every 10th frame will be extracted; default : 10 <br/>
        + <code>num-frame </code>: type = int : Number of frames to be extracted from video while using FIXED model. If num_frame=10 then 10 frames equally distant from each other will be extracted; default : 10 <br/>
        +
        + </td></tr>
        + <tr><th align="left"> How to supply Video Content </th></tr>
        + <tr><th align="left"> With HTTP GET : </th> <td>
        + Include a query parameter <code>url </code> which is path on file system <br/>
        + Example: <code> curl "localhost:8764/inception/v3/classify/video?url=filesystem/path/to/video"</code><br/>
        + </td></tr><br/>
        + <tr><th align="left"> With HTTP POST :</th><td>
        + POST video content as binary data in request body. If video can be decoded by OpenCV it should be fine. It's tested on mp4 and avi on mac <br/>
        + Include a query parameter <code>ext </code>this extension is needed to tell OpenCV which decoder to use, default is ".mp4" </br>
        + Example: <code> curl -X POST "localhost:8764/inception/v3/classify?topk=10&human=false" --data-binary @example.mp4 </code>

        Review comment:
        v4

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113751331 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py ########## @@ -310,6 +320,32 @@ def index(): </td></tr> </table> </li> + <li> <code>/inception/v3/classify/video</code> - <br/> + <table> + <tr><th align="left"> Description </th><td> This is a classifier service that can classify videos</td></tr> + <tr><td></td> <td>Query Params : <br/> + <code>topk </code>: type = int : top classes to get; default : 10 <br/> + <code>human </code>: type = boolean : human readable class names; default : true <br/> + <code>mode </code>: options = <code> {"center", "interval", "fixed"} </code> : Modes of frame extraction; default : center <br/> +   <code>"center"</code> - Just one frame in center. <br/> +   <code>"interval"</code> - Extracts frames after fixed interval. <br/> +   <code>"fixed"</code> - Extract fixed number of frames.<br/> + <code>frame-interval </code>: type = int : Interval for frame extraction to be used with INTERVAL mode. If frame_interval=10 then every 10th frame will be extracted; default : 10 <br/> + <code>num-frame </code>: type = int : Number of frames to be extracted from video while using FIXED model. If num_frame=10 then 10 frames equally distant from each other will be extracted; default : 10 <br/> + + </td></tr> + <tr><th align="left"> How to supply Video Content </th></tr> + <tr><th align="left"> With HTTP GET : </th> <td> + Include a query parameter <code>url </code> which is path on file system <br/> + Example: <code> curl "localhost:8764/inception/v3/classify/video?url=filesystem/path/to/video"</code><br/> + </td></tr><br/> + <tr><th align="left"> With HTTP POST :</th><td> + POST video content as binary data in request body. If video can be decoded by OpenCV it should be fine. It's tested on mp4 and avi on mac <br/> + Include a query parameter <code>ext </code>this extension is needed to tell OpenCV which decoder to use, default is ".mp4" </br> + Example: <code> curl -X POST "localhost:8764/inception/v3/classify?topk=10&human=false" --data-binary @example.mp4 </code> Review comment: v4 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113797759

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully built it, then I sshed into the docker and checked whether Opencv is installed and it was.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113797759 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully built it, then I sshed into the docker and checked whether Opencv is installed and it was. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113797759

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully built it, then I sshed into the docker and checked whether Opencv is installed and it was.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113797759 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully built it, then I sshed into the docker and checked whether Opencv is installed and it was. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113797759

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully build it, then I sshed into the docker and checked whether Opencv is installed and it was.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113797759 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Opencv can be installed with miniconda. There shouldn't be any issue. I've done that multiple times. So to be sure, I built your dockerfile, replacing anaconda with miniconda, I could successfully build it, then I sshed into the docker and checked whether Opencv is installed and it was. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113802156

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Thanks for trying that, we'll change it to miniconda but as of now I am trying to install it with ffmpeg for video support. If you want to help can you try installing openCV with video support and try inception video api if that works?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113802156 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Thanks for trying that, we'll change it to miniconda but as of now I am trying to install it with ffmpeg for video support. If you want to help can you try installing openCV with video support and try inception video api if that works? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113802445

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda
        +EXPOSE 5000

        Review comment:
        This can be removed. Got it along with some example from stackoverflow

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113802445 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda +EXPOSE 5000 Review comment: This can be removed. Got it along with some example from stackoverflow ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113802156

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Thanks for trying that, we'll change it to miniconda but as of now I am trying to install it with ffmpeg for video support. If you want to help can you try installing openCV with video support and try inception video api if that works?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113802156 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Thanks for trying that, we'll change it to miniconda but as of now I am trying to install it with ffmpeg for video support. If you want to help can you try installing openCV with video support and try inception video api if that works? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#discussion_r113807453

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -0,0 +1,58 @@
        +#
        +# Licensed to the Apache Software Foundation (ASF) under one or more
        +# contributor license agreements. See the NOTICE file distributed with
        +# this work for additional information regarding copyright ownership.
        +# The ASF licenses this file to You under the Apache License, Version 2.0
        +# (the "License"); you may not use this file except in compliance with
        +# the License. You may obtain a copy of the License at
        +#
        +# http://www.apache.org/licenses/LICENSE-2.0
        +#
        +# Unless required by applicable law or agreed to in writing, software
        +# distributed under the License is distributed on an "AS IS" BASIS,
        +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
        +# See the License for the specific language governing permissions and
        +# limitations under the License.
        +#
        +FROM ubuntu
        +
        +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in>
        +
        +# Install anaconda
        +FROM continuumio/anaconda

        Review comment:
        Yeah, I am happy to help :100: . I'm having some issues with trying out the API, it's giving me a 500 internal server error. Adding video support is great. But can we use ffmpeg? It's coming with a LGPL license, see https://www.ffmpeg.org/legal.html

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#discussion_r113807453 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -0,0 +1,58 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +# +FROM ubuntu + +MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> + +# Install anaconda +FROM continuumio/anaconda Review comment: Yeah, I am happy to help :100: . I'm having some issues with trying out the API, it's giving me a 500 internal server error. Adding video support is great. But can we use ffmpeg? It's coming with a LGPL license, see https://www.ffmpeg.org/legal.html ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297904818

        thanks @smadha I am testing your latest commits to see if OpenCV + FFMPEG work for me in docker. Others, please also test.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297904818 thanks @smadha I am testing your latest commits to see if OpenCV + FFMPEG work for me in docker. Others, please also test. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297907725

        OK I was able to build your latest Docker @smadha from https://github.com/apache/tika/pull/168/commits/434736be63373e8caa85fd8c9bd117e6edbec555 https://github.com/apache/tika/pull/168/commits/58a116c2123d9c01ba054969121244364059c0d2 and and found the following:

        == Running the Tika App Client Command
        ```
        LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        WARN JBIG2ImageReader not loaded. jbig2 files will be ignored
        INFO Available = true, API Status = HTTP/1.0 200 OK
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = true
        WARN Status = HTTP/1.0 500 INTERNAL SERVER ERROR
        WARN Response = <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
        <title>500 Internal Server Error</title>
        <h1>Internal Server Error</h1>
        <p>The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.</p>

        WARN NO objects
        LMC-053601:smadha-tika mattmann$
        ```

        == Results (from Tensorflow Video Docker Server)
        ```

        • Running on http://0.0.0.0:8764/ (Press CTRL+C to quit)
          10.0.2.2 - - [28/Apr/2017 05:01:26] "GET /inception/v4/ping HTTP/1.1" 200 -
          [2017-04-28 05:01:26,287] ERROR in app: Exception on /inception/v4/classify/video [POST]
          Traceback (most recent call last):
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1982, in wsgi_app
          response = self.full_dispatch_request()
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1614, in full_dispatch_request
          rv = self.handle_user_exception(e)
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1517, in handle_user_exception
          reraise(exc_type, exc_value, tb)
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1612, in full_dispatch_request
          rv = self.dispatch_request()
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1598, in dispatch_request
          return self.view_functions[rule.endpoint](**req.view_args)
          File "/usr/bin/inceptionapi.py", line 489, in classify_video
          classids, classnames, confidence = zip(*classes)
          ValueError: need more than 0 values to unpack
          10.0.2.2 - - [28/Apr/2017 05:01:26] "POST /inception/v4/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 500
          ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297907725 OK I was able to build your latest Docker @smadha from https://github.com/apache/tika/pull/168/commits/434736be63373e8caa85fd8c9bd117e6edbec555 https://github.com/apache/tika/pull/168/commits/58a116c2123d9c01ba054969121244364059c0d2 and and found the following: == Running the Tika App Client Command ``` LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 WARN JBIG2ImageReader not loaded. jbig2 files will be ignored INFO Available = true, API Status = HTTP/1.0 200 OK INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = true WARN Status = HTTP/1.0 500 INTERNAL SERVER ERROR WARN Response = <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> <title>500 Internal Server Error</title> <h1>Internal Server Error</h1> <p>The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.</p> WARN NO objects LMC-053601:smadha-tika mattmann$ ``` == Results (from Tensorflow Video Docker Server) ``` Running on http://0.0.0.0:8764/ (Press CTRL+C to quit) 10.0.2.2 - - [28/Apr/2017 05:01:26] "GET /inception/v4/ping HTTP/1.1" 200 - [2017-04-28 05:01:26,287] ERROR in app: Exception on /inception/v4/classify/video [POST] Traceback (most recent call last): File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1982, in wsgi_app response = self.full_dispatch_request() File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1614, in full_dispatch_request rv = self.handle_user_exception(e) File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1517, in handle_user_exception reraise(exc_type, exc_value, tb) File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1612, in full_dispatch_request rv = self.dispatch_request() File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1598, in dispatch_request return self.view_functions [rule.endpoint] (**req.view_args) File "/usr/bin/inceptionapi.py", line 489, in classify_video classids, classnames, confidence = zip(*classes) ValueError: need more than 0 values to unpack 10.0.2.2 - - [28/Apr/2017 05:01:26] "POST /inception/v4/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 500 ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297907725

        OK I was able to build your latest Docker @smadha from https://github.com/apache/tika/pull/168/commits/434736be63373e8caa85fd8c9bd117e6edbec555 https://github.com/apache/tika/pull/168/commits/58a116c2123d9c01ba054969121244364059c0d2 and and found the following:

        == Running the Tika App Client Command
        ```
        LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        WARN JBIG2ImageReader not loaded. jbig2 files will be ignored
        INFO Available = true, API Status = HTTP/1.0 200 OK
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = true
        WARN Status = HTTP/1.0 500 INTERNAL SERVER ERROR
        WARN Response = <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
        <title>500 Internal Server Error</title>
        <h1>Internal Server Error</h1>
        <p>The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.</p>

        WARN NO objects
        LMC-053601:smadha-tika mattmann$
        ```

        == Results (from Tensorflow Video Docker Server)
        ```

        • Running on http://0.0.0.0:8764/ (Press CTRL+C to quit)
          10.0.2.2 - - [28/Apr/2017 05:01:26] "GET /inception/v4/ping HTTP/1.1" 200 -
          [2017-04-28 05:01:26,287] ERROR in app: Exception on /inception/v4/classify/video [POST]
          Traceback (most recent call last):
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1982, in wsgi_app
          response = self.full_dispatch_request()
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1614, in full_dispatch_request
          rv = self.handle_user_exception(e)
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1517, in handle_user_exception
          reraise(exc_type, exc_value, tb)
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1612, in full_dispatch_request
          rv = self.dispatch_request()
          File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1598, in dispatch_request
          return self.view_functions[rule.endpoint](**req.view_args)
          File "/usr/bin/inceptionapi.py", line 489, in classify_video
          classids, classnames, confidence = zip(*classes)
          ValueError: need more than 0 values to unpack
          10.0.2.2 - - [28/Apr/2017 05:01:26] "POST /inception/v4/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 500
          ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297907725 OK I was able to build your latest Docker @smadha from https://github.com/apache/tika/pull/168/commits/434736be63373e8caa85fd8c9bd117e6edbec555 https://github.com/apache/tika/pull/168/commits/58a116c2123d9c01ba054969121244364059c0d2 and and found the following: == Running the Tika App Client Command ``` LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 WARN JBIG2ImageReader not loaded. jbig2 files will be ignored INFO Available = true, API Status = HTTP/1.0 200 OK INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = true WARN Status = HTTP/1.0 500 INTERNAL SERVER ERROR WARN Response = <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> <title>500 Internal Server Error</title> <h1>Internal Server Error</h1> <p>The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.</p> WARN NO objects LMC-053601:smadha-tika mattmann$ ``` == Results (from Tensorflow Video Docker Server) ``` Running on http://0.0.0.0:8764/ (Press CTRL+C to quit) 10.0.2.2 - - [28/Apr/2017 05:01:26] "GET /inception/v4/ping HTTP/1.1" 200 - [2017-04-28 05:01:26,287] ERROR in app: Exception on /inception/v4/classify/video [POST] Traceback (most recent call last): File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1982, in wsgi_app response = self.full_dispatch_request() File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1614, in full_dispatch_request rv = self.handle_user_exception(e) File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1517, in handle_user_exception reraise(exc_type, exc_value, tb) File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1612, in full_dispatch_request rv = self.dispatch_request() File "/opt/conda/lib/python2.7/site-packages/flask/app.py", line 1598, in dispatch_request return self.view_functions [rule.endpoint] (**req.view_args) File "/usr/bin/inceptionapi.py", line 489, in classify_video classids, classnames, confidence = zip(*classes) ValueError: need more than 0 values to unpack 10.0.2.2 - - [28/Apr/2017 05:01:26] "POST /inception/v4/classify/video?mode=fixed&ext=.mp4 HTTP/1.1" 500 ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297907929

        Yeah, OpenCV still can't read videos

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297907929 Yeah, OpenCV still can't read videos ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297907994

        looks like you are pretty close @smadha just looks like some error in line 489 of classify_video. Looking at that line here: https://github.com/apache/tika/pull/168/files#diff-bec45a808394395f223b4aa2468e9cddR489 it seems to not have any extracted classes.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297907994 looks like you are pretty close @smadha just looks like some error in line 489 of classify_video. Looking at that line here: https://github.com/apache/tika/pull/168/files#diff-bec45a808394395f223b4aa2468e9cddR489 it seems to not have any extracted classes. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297910260

        talked to @smadha on GChat. Basically the issue is even in the current Docker file with Conda/OpenCV, etc., we can't get it to recognize the video file (FFMPEG + OpenCV, not working). I SSH'ed onto the docker machine:

        ```
        root@2156ca034e87:/# curl -LO https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4?raw=true
        % Total % Received % Xferd Average Speed Time Time Time Current
        Dload Upload Total Spent Left Speed
        100 174 0 174 0 0 459 0 -::- -::- -::- 519
        100 185 100 185 0 0 320 0 -::- -::- -::- 320
        100 1028k 100 1028k 0 0 523k 0 0:00:01 0:00:01 -::- 888k
        root@2156ca034e87:/# ls
        bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4?raw=true usr
        boot dev home lib64 mnt opt root sbin sys tmp var
        root@2156ca034e87:/# mv testVideoMp4.mp4?raw\=true testVideoMp4.mp4
        root@2156ca034e87:/# ls
        bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4 usr
        boot dev home lib64 mnt opt root sbin sys tmp var
        root@2156ca034e87:/# python
        Python 2.7.13 |Continuum Analytics, Inc.| (default, Dec 20 2016, 23:09:15)
        [GCC 4.4.7 20120313 (Red Hat 4.4.7-1)] on linux2
        Type "help", "copyright", "credits" or "license" for more information.
        Anaconda is brought to you by Continuum Analytics.
        Please check out: http://continuum.io/thanks and https://anaconda.org
        >>> import cv2
        >>> cap = cv2.VideoCapture('testVideoMp4.mp4')
        >>> cap.isOpened()
        False
        >>>
        root@2156ca034e87:/# ls
        bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4 usr
        boot dev home lib64 mnt opt root sbin sys tmp var
        root@2156ca034e87:/# exit
        exit
        LMC-053601:tf mattmann$
        ```
        So, @smadha is going to try to build OpenCV with Python and FFMPEG support from source, and then include the instructions for that in the Docker. Any help from others is appreciated.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297910260 talked to @smadha on GChat. Basically the issue is even in the current Docker file with Conda/OpenCV, etc., we can't get it to recognize the video file (FFMPEG + OpenCV, not working). I SSH'ed onto the docker machine: ``` root@2156ca034e87:/# curl -LO https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4?raw=true % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 174 0 174 0 0 459 0 - : : - - : : - - : : - 519 100 185 100 185 0 0 320 0 - : : - - : : - - : : - 320 100 1028k 100 1028k 0 0 523k 0 0:00:01 0:00:01 - : : - 888k root@2156ca034e87:/# ls bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4?raw=true usr boot dev home lib64 mnt opt root sbin sys tmp var root@2156ca034e87:/# mv testVideoMp4.mp4?raw\=true testVideoMp4.mp4 root@2156ca034e87:/# ls bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4 usr boot dev home lib64 mnt opt root sbin sys tmp var root@2156ca034e87:/# python Python 2.7.13 |Continuum Analytics, Inc.| (default, Dec 20 2016, 23:09:15) [GCC 4.4.7 20120313 (Red Hat 4.4.7-1)] on linux2 Type "help", "copyright", "credits" or "license" for more information. Anaconda is brought to you by Continuum Analytics. Please check out: http://continuum.io/thanks and https://anaconda.org >>> import cv2 >>> cap = cv2.VideoCapture('testVideoMp4.mp4') >>> cap.isOpened() False >>> root@2156ca034e87:/# ls bin c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip etc lib media models-c15fada28113eca32dc98d6e3bec4755d0d5b4c2 proc run srv testVideoMp4.mp4 usr boot dev home lib64 mnt opt root sbin sys tmp var root@2156ca034e87:/# exit exit LMC-053601:tf mattmann$ ``` So, @smadha is going to try to build OpenCV with Python and FFMPEG support from source, and then include the instructions for that in the Docker. Any help from others is appreciated. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297910531

        note: check out: http://www.pyimagesearch.com/2015/06/22/install-opencv-3-0-and-python-2-7-on-ubuntu/

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297910531 note: check out: http://www.pyimagesearch.com/2015/06/22/install-opencv-3-0-and-python-2-7-on-ubuntu/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-297948356

        Guys I tried more stuff but OpenCV is still failing to read videos inside docker. It seems as ffmpeg is not installed as per OpenCV requirements.

        I am attaching logs that I get on building OpenCV while I am sshed inside docker. [ssh_docker_opencv_build.txt](https://github.com/apache/tika/files/963888/ssh_docker_opencv_build.txt)

        After my changes I can successfully see ffmpeg/avformat library being found in logs `Looking for libavformat/avformat.h - found` but still it does not builds with ffmpeg
        ```
        – Looking for linux/videodev.h
        – Looking for linux/videodev.h - not found
        – Looking for linux/videodev2.h
        – Looking for linux/videodev2.h - found
        – Looking for sys/videoio.h
        – Looking for sys/videoio.h - not found
        – Looking for libavformat/avformat.h
        – Looking for libavformat/avformat.h - found
        – Looking for ffmpeg/avformat.h
        – Looking for ffmpeg/avformat.h - found
        ```
        Below lines in log should have a YES against FFMPEG. I think only thing left to try is to build ffmmpeg from source.

        ```
        – Video I/O:
        – DC1394 1.x: NO
        – DC1394 2.x: NO
        – FFMPEG: NO
        – codec: YES (ver )
        – format: YES (ver )
        – util: YES (ver )
        – swscale: NO
        – resample: NO
        – gentoo-style: YES
        – GStreamer: NO
        – OpenNI: NO
        – OpenNI PrimeSensor Modules: NO
        – OpenNI2: NO
        – PvAPI: NO
        – GigEVisionSDK: NO
        – UniCap: NO
        – UniCap ucil: NO
        ```

        Once you have installed OpenCV you can test it by -
        Shell -
        ```shell
        curl -o testVideoMp4.mp4 "https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4?raw=true"
        ```
        python -
        ```python
        import cv2
        cap = cv2.VideoCapture('testVideoMp4.mp4')
        print cap.isOpened()
        ```

        I am thinking it should be either a small change in my approach or we need to replace https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L26-L32 with ffmpeg build from source.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-297948356 Guys I tried more stuff but OpenCV is still failing to read videos inside docker. It seems as ffmpeg is not installed as per OpenCV requirements. I am attaching logs that I get on building OpenCV while I am sshed inside docker. [ssh_docker_opencv_build.txt] ( https://github.com/apache/tika/files/963888/ssh_docker_opencv_build.txt ) After my changes I can successfully see ffmpeg/avformat library being found in logs `Looking for libavformat/avformat.h - found` but still it does not builds with ffmpeg ``` – Looking for linux/videodev.h – Looking for linux/videodev.h - not found – Looking for linux/videodev2.h – Looking for linux/videodev2.h - found – Looking for sys/videoio.h – Looking for sys/videoio.h - not found – Looking for libavformat/avformat.h – Looking for libavformat/avformat.h - found – Looking for ffmpeg/avformat.h – Looking for ffmpeg/avformat.h - found ``` Below lines in log should have a YES against FFMPEG. I think only thing left to try is to build ffmmpeg from source. ``` – Video I/O: – DC1394 1.x: NO – DC1394 2.x: NO – FFMPEG: NO – codec: YES (ver ) – format: YES (ver ) – util: YES (ver ) – swscale: NO – resample: NO – gentoo-style: YES – GStreamer: NO – OpenNI: NO – OpenNI PrimeSensor Modules: NO – OpenNI2: NO – PvAPI: NO – GigEVisionSDK: NO – UniCap: NO – UniCap ucil: NO ``` Once you have installed OpenCV you can test it by - Shell - ```shell curl -o testVideoMp4.mp4 "https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4?raw=true" ``` python - ```python import cv2 cap = cv2.VideoCapture('testVideoMp4.mp4') print cap.isOpened() ``` I am thinking it should be either a small change in my approach or we need to replace https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L26-L32 with ffmpeg build from source. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298059923

        @smadha may be those video libraries require GUI / X-window server? Docker image is ubuntu server without any X11 stuff. I am saying may be

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298059923 @smadha may be those video libraries require GUI / X-window server? Docker image is ubuntu server without any X11 stuff. I am saying may be ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298082341

        perhaps building OpenCV from sources inside the docker might solve the problem, but building OpenCV takes a lot of time.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298082341 perhaps building OpenCV from sources inside the docker might solve the problem, but building OpenCV takes a lot of time. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298083264

        @smadha did you try building ffmpeg from sources?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298083264 @smadha did you try building ffmpeg from sources? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298082341

        perhaps building OpenCV from sources inside the docker might solve the problem, but building OpenCV takes a lot of time.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298082341 perhaps building OpenCV from sources inside the docker might solve the problem, but building OpenCV takes a lot of time. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298133404

        updates/help wanted. if folks could try to update the docker for FFMPEG/OpenCV (even from source, thanks)

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298133404 updates/help wanted. if folks could try to update the docker for FFMPEG/OpenCV (even from source, thanks) ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298181098

        @chrismattmann I'd give it a try.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298181098 @chrismattmann I'd give it a try. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298195464

        Well, I built the image after replacing https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L26-L32 with ffmpeg build from source. unfortunately, no luck, still it gives me false for "cap.isOpened()". The problem is OpenCV has to find FFMPEG on the given system at *compile time* and generate links. Currently , Conda's pre-built OpenCV package isn't compiled with ffmpeg support. They say ffmpeg is difficult to support across platforms. So you are only left with a single solution, you have to build OpenCV within docker.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298195464 Well, I built the image after replacing https://github.com/smadha/tika/blob/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L26-L32 with ffmpeg build from source. unfortunately, no luck, still it gives me false for "cap.isOpened()". The problem is OpenCV has to find FFMPEG on the given system at * compile time * and generate links. Currently , Conda's pre-built OpenCV package isn't compiled with ffmpeg support. They say ffmpeg is difficult to support across platforms. So you are only left with a single solution, you have to build OpenCV within docker. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298196167

        Thanks @ThejanW agreed, we need to build OpenCV within Docker, and before that build FFMPEG from source too within Docker, and then just link them. Can you provide a update or gist showing what you did to successfully build FFMPEG within Docker?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298196167 Thanks @ThejanW agreed, we need to build OpenCV within Docker, and before that build FFMPEG from source too within Docker, and then just link them. Can you provide a update or gist showing what you did to successfully build FFMPEG within Docker? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298196751

        1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
          FROM ubuntu:16.04
          RUN apt-get update && \
          apt-get install ffmpeg
        1. Dependencies
          RUN apt-get install build-essential cmake pkg-config \
          libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \
          libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \
          libxvidcore-dev libx264-dev \
          libgtk2.0-dev \
          libatlas-base-dev gfortran \
          python2.7-dev python3-dev \
          python-pip
          RUN \
          pip install --upgrade pip && \
          pip install numpy

        WORKDIR /usr/local/src

        1. Download OpenCV 3 + Contrib
          RUN \
          wget -O opencv.zip https://github.com/Itseez/opencv/archive/3.2.0.zip && \
          unzip opencv.zip && \
          wget -O opencv_contrib.zip https://github.com/Itseez/opencv_contrib/archive/3.2.0.zip && \
          unzip opencv_contrib.zip
        1. Build + Install OpenCV
          WORKDIR /usr/local/src/opencv-3.2.0/
          RUN mkdir build
          WORKDIR /usr/local/src/opencv-3.2.0/build
          RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \
          -D CMAKE_INSTALL_PREFIX=/usr/local \
          -D INSTALL_PYTHON_EXAMPLES=ON \
          -D INSTALL_C_EXAMPLES=OFF \
          -D OPENCV_EXTRA_MODULES_PATH=~/opencv_contrib-3.2.0/modules \
          -D BUILD_EXAMPLES=ON ..
          RUN make -j8
          RUN make install

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298196751 Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 RUN apt-get update && \ apt-get install ffmpeg Dependencies RUN apt-get install build-essential cmake pkg-config \ libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \ libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \ libxvidcore-dev libx264-dev \ libgtk2.0-dev \ libatlas-base-dev gfortran \ python2.7-dev python3-dev \ python-pip RUN \ pip install --upgrade pip && \ pip install numpy WORKDIR /usr/local/src Download OpenCV 3 + Contrib RUN \ wget -O opencv.zip https://github.com/Itseez/opencv/archive/3.2.0.zip && \ unzip opencv.zip && \ wget -O opencv_contrib.zip https://github.com/Itseez/opencv_contrib/archive/3.2.0.zip && \ unzip opencv_contrib.zip Build + Install OpenCV WORKDIR /usr/local/src/opencv-3.2.0/ RUN mkdir build WORKDIR /usr/local/src/opencv-3.2.0/build RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \ -D CMAKE_INSTALL_PREFIX=/usr/local \ -D INSTALL_PYTHON_EXAMPLES=ON \ -D INSTALL_C_EXAMPLES=OFF \ -D OPENCV_EXTRA_MODULES_PATH=~/opencv_contrib-3.2.0/modules \ -D BUILD_EXAMPLES=ON .. RUN make -j8 RUN make install ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298196751

        1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
          FROM ubuntu:16.04
          RUN apt-get update && \
          apt-get install ffmpeg
        1. Dependencies
          RUN apt-get install build-essential cmake pkg-config \
          libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \
          libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \
          libxvidcore-dev libx264-dev \
          libgtk2.0-dev \
          libatlas-base-dev gfortran \
          python2.7-dev python3-dev \
          python-pip
          RUN \
          pip install --upgrade pip && \
          pip install numpy

        WORKDIR /usr/local/src

        1. Download OpenCV 3 + Contrib
          RUN \
          wget -O opencv.zip https://github.com/Itseez/opencv/archive/3.2.0.zip && \
          unzip opencv.zip && \
          wget -O opencv_contrib.zip https://github.com/Itseez/opencv_contrib/archive/3.2.0.zip && \
          unzip opencv_contrib.zip
        1. Build + Install OpenCV
          WORKDIR /usr/local/src/opencv-3.2.0/
          RUN mkdir build
          WORKDIR /usr/local/src/opencv-3.2.0/build
          RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \
          -D CMAKE_INSTALL_PREFIX=/usr/local \
          -D INSTALL_PYTHON_EXAMPLES=ON \
          -D INSTALL_C_EXAMPLES=OFF \
          -D OPENCV_EXTRA_MODULES_PATH=~/opencv_contrib-3.2.0/modules \
          -D BUILD_EXAMPLES=ON ..
          RUN make -j8
          RUN make install

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298196751 Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 RUN apt-get update && \ apt-get install ffmpeg Dependencies RUN apt-get install build-essential cmake pkg-config \ libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \ libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \ libxvidcore-dev libx264-dev \ libgtk2.0-dev \ libatlas-base-dev gfortran \ python2.7-dev python3-dev \ python-pip RUN \ pip install --upgrade pip && \ pip install numpy WORKDIR /usr/local/src Download OpenCV 3 + Contrib RUN \ wget -O opencv.zip https://github.com/Itseez/opencv/archive/3.2.0.zip && \ unzip opencv.zip && \ wget -O opencv_contrib.zip https://github.com/Itseez/opencv_contrib/archive/3.2.0.zip && \ unzip opencv_contrib.zip Build + Install OpenCV WORKDIR /usr/local/src/opencv-3.2.0/ RUN mkdir build WORKDIR /usr/local/src/opencv-3.2.0/build RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \ -D CMAKE_INSTALL_PREFIX=/usr/local \ -D INSTALL_PYTHON_EXAMPLES=ON \ -D INSTALL_C_EXAMPLES=OFF \ -D OPENCV_EXTRA_MODULES_PATH=~/opencv_contrib-3.2.0/modules \ -D BUILD_EXAMPLES=ON .. RUN make -j8 RUN make install ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298197362

        I created this file just now[1]. I think this should do what we need to do. Couldn't test this file, so there could be errors. OpenCV takes time to build. Could someone try this out, or tomorrow, I'll build this.
        [1] https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298197362 I created this file just now [1] . I think this should do what we need to do. Couldn't test this file, so there could be errors. OpenCV takes time to build. Could someone try this out, or tomorrow, I'll build this. [1] https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298197510

        @chrismattmann I presume, we don't need to build FFMPEG. Just having FFMPEG installed in the system suffices. In the gist I shared, I am installing FFMPEG with a simple apt-get. But we need to build OpenCV.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298197510 @chrismattmann I presume, we don't need to build FFMPEG. Just having FFMPEG installed in the system suffices. In the gist I shared, I am installing FFMPEG with a simple apt-get. But we need to build OpenCV. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298227933

        Well, building OpenCV with my docker file worked.

        root@65a7649c4342:~# python
        Python 2.7.12 (default, Nov 19 2016, 06:48:10)
        [GCC 5.4.0 20160609] on linux2
        Type "help", "copyright", "credits" or "license" for more information.
        >>> import cv2
        >>> cap = cv2.VideoCapture('testVideoMp4.mp4')
        >>> cap.isOpened()
        True
        >>>

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298227933 Well, building OpenCV with my docker file worked. root@65a7649c4342:~# python Python 2.7.12 (default, Nov 19 2016, 06:48:10) [GCC 5.4.0 20160609] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> import cv2 >>> cap = cv2.VideoCapture('testVideoMp4.mp4') >>> cap.isOpened() True >>> ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298228190

        https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298228190 https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298248138

        @thammegowda thank you for running the tests.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298248138 @thammegowda thank you for running the tests. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298248251

        Guys I tried to install and it runs out of space in cmake any ideas?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298248251 Guys I tried to install and it runs out of space in cmake any ideas? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298248593

        hmmm...I think its because of *RUN make -j8* line in docker file.
        Could you edit that line? replace *-j8* with *-j4* or something and try to rebuild the docker file. It's the number of parellel threads.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298248593 hmmm...I think its because of * RUN make -j8 * line in docker file. Could you edit that line? replace * -j8 * with * -j4 * or something and try to rebuild the docker file. It's the number of parellel threads. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298249438

        > Guys I tried to install and it runs out of space in cmake any ideas?

        @chrismattmann Is it running with virtual box? if so, try increasing resources to VM

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - thammegowda commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298249438 > Guys I tried to install and it runs out of space in cmake any ideas? @chrismattmann Is it running with virtual box? if so, try increasing resources to VM ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298253974

        Good news! I updated my docker-machine, and built a new vbox VM using Docker Quickstart terminal, used @ThejanW's updated docker file, rebuilt it, and ran the tests, and:

        ```
        LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        WARN JBIG2ImageReader not loaded. jbig2 files will be ignored
        INFO Available = true, API Status = HTTP/1.0 200 OK
        INFO minConfidence = 0.015, topN=4
        INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser
        INFO Recogniser Available = true
        <?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml">
        <head>
        <meta name="Content-Length" content="1053651"/>
        <meta name="OBJECT" content="king penguin, Aptenodytes patagonica (0.19076)"/>
        <meta name="OBJECT" content="hare (0.13538)"/>
        <meta name="OBJECT" content="wallaby, brush kangaroo (0.09441)"/>
        <meta name="OBJECT" content="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus (0.09350)"/>
        <meta name="X-Parsed-By" content="org.apache.tika.parser.CompositeParser"/>
        <meta name="X-Parsed-By" content="org.apache.tika.parser.recognition.ObjectRecognitionParser"/>
        <meta name="Content-Type" content="video/mp4"/>
        <meta name="resourceName" content="testVideoMp4.mp4"/>
        <meta name="org.apache.tika.parser.recognition.object.rec.impl" content="org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser"/>
        <title/>
        </head>
        <body><ol id="objects"> <li id="king penguin, Aptenodytes patagonica"> king penguin, Aptenodytes patagonica [en](confidence = 0.190764 )</li>
        <li id="hare"> hare [en](confidence = 0.135382 )</li>
        <li id="wallaby, brush kangaroo"> wallaby, brush kangaroo [en](confidence = 0.094413 )</li>
        <li id="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus"> ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus [en](confidence = 0.093503 )</li>
        </ol>
        </body></html>LMC-053601:smadha-tika mattmann$
        ```

        Yay! I'm going to push the updated docker file now, and prepare to commit this. I will also create a wiki page before doing so. Give me 20 mins.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298253974 Good news! I updated my docker-machine, and built a new vbox VM using Docker Quickstart terminal, used @ThejanW's updated docker file, rebuilt it, and ran the tests, and: ``` LMC-053601:smadha-tika mattmann$ java -jar tika-app/target/tika-app-1.15-SNAPSHOT.jar --config=tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml ./tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 WARN JBIG2ImageReader not loaded. jbig2 files will be ignored INFO Available = true, API Status = HTTP/1.0 200 OK INFO minConfidence = 0.015, topN=4 INFO Recogniser = org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser INFO Recogniser Available = true <?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta name="Content-Length" content="1053651"/> <meta name="OBJECT" content="king penguin, Aptenodytes patagonica (0.19076)"/> <meta name="OBJECT" content="hare (0.13538)"/> <meta name="OBJECT" content="wallaby, brush kangaroo (0.09441)"/> <meta name="OBJECT" content="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus (0.09350)"/> <meta name="X-Parsed-By" content="org.apache.tika.parser.CompositeParser"/> <meta name="X-Parsed-By" content="org.apache.tika.parser.recognition.ObjectRecognitionParser"/> <meta name="Content-Type" content="video/mp4"/> <meta name="resourceName" content="testVideoMp4.mp4"/> <meta name="org.apache.tika.parser.recognition.object.rec.impl" content="org.apache.tika.parser.recognition.tf.TensorflowRESTVideoRecogniser"/> <title/> </head> <body><ol id="objects"> <li id="king penguin, Aptenodytes patagonica"> king penguin, Aptenodytes patagonica [en] (confidence = 0.190764 )</li> <li id="hare"> hare [en] (confidence = 0.135382 )</li> <li id="wallaby, brush kangaroo"> wallaby, brush kangaroo [en] (confidence = 0.094413 )</li> <li id="ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus"> ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus [en] (confidence = 0.093503 )</li> </ol> </body></html>LMC-053601:smadha-tika mattmann$ ``` Yay! I'm going to push the updated docker file now, and prepare to commit this. I will also create a wiki page before doing so. Give me 20 mins. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann closed pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann closed pull request #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298260810

        Documentation added in: https://wiki.apache.org/tika/TikaAndVisionVideo#preview. Thank you to @smadha, @ThejanW and @thammegowda Tika can now automatically detect objects in video!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298260810 Documentation added in: https://wiki.apache.org/tika/TikaAndVisionVideo#preview . Thank you to @smadha, @ThejanW and @thammegowda Tika can now automatically detect objects in video! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298260817

        BTW, documentation needs improving, please improve guys!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298260817 BTW, documentation needs improving, please improve guys! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Jenkins build Tika-trunk #1250 (See https://builds.apache.org/job/Tika-trunk/1250/)
        fix for TIKA-2322 contributed by msharan@usc.edu (mattmann: https://github.com/apache/tika/commit/6c903f25aa6508d06fc6f0044ac996ff9e9d9bf4)

        • (edit) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
        • (add) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py
        • (edit) tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java
        • (add) tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTVideoRecogniser.java
        • (add) tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml
        • (add) tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4
        • (add) tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java
          Record change for TIKA-2322. (mattmann: https://github.com/apache/tika/commit/b19b9c36986707db8755a62ef77923af01549411)
        • (edit) CHANGES.txt
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Jenkins build Tika-trunk #1250 (See https://builds.apache.org/job/Tika-trunk/1250/ ) fix for TIKA-2322 contributed by msharan@usc.edu (mattmann: https://github.com/apache/tika/commit/6c903f25aa6508d06fc6f0044ac996ff9e9d9bf4 ) (edit) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py (add) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py (edit) tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTRecogniser.java (add) tika-parsers/src/main/java/org/apache/tika/parser/recognition/tf/TensorflowRESTVideoRecogniser.java (add) tika-parsers/src/test/resources/org/apache/tika/parser/recognition/tika-config-tflow-video-rest.xml (add) tika-parsers/src/test/resources/test-documents/testVideoMp4.mp4 (add) tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java Record change for TIKA-2322 . (mattmann: https://github.com/apache/tika/commit/b19b9c36986707db8755a62ef77923af01549411 ) (edit) CHANGES.txt
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298279635

        Thanks a lot guys. Good start of week. I'll take care of documentation first thing on Monday.
        @thejanw - thanks for your help buddy.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298279635 Thanks a lot guys. Good start of week. I'll take care of documentation first thing on Monday. @thejanw - thanks for your help buddy. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298279635

        Thanks a lot guys. Good start of week. I'll take care of documentation first thing on Monday.
        @thejanw - thanks for your help buddy.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298279635 Thanks a lot guys. Good start of week. I'll take care of documentation first thing on Monday. @thejanw - thanks for your help buddy. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298286570

        @chrismattmann @thammegowda the merged dockefile needs some updating. It won't build otherwise. I think you've missed the line of installing libtcmalloc-minimal4. So this breaks at "ENV LD_PRELOAD="/usr/lib/libtcmalloc_minimal.so.4". and also I have added a line to delete the source files of OpenCV after building. It's also not here

        Could you please revisit my gist https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f
        and make those changes in the merged file. just copy and paste the entire thing. Thanks.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298286570 @chrismattmann @thammegowda the merged dockefile needs some updating. It won't build otherwise. I think you've missed the line of installing libtcmalloc-minimal4. So this breaks at "ENV LD_PRELOAD="/usr/lib/libtcmalloc_minimal.so.4". and also I have added a line to delete the source files of OpenCV after building. It's also not here Could you please revisit my gist https://gist.github.com/ThejanW/bffe660f5a7718b8672ee49f855c6a6f and make those changes in the merged file. just copy and paste the entire thing. Thanks. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298287276

        It built fine for me? Can you open a new PR with changes and I will look?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298287276 It built fine for me? Can you open a new PR with changes and I will look? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW opened a new pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175

        #install missing dependencies
        #delete source files of OpenCV after building

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW opened a new pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175 #install missing dependencies #delete source files of OpenCV after building ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#issuecomment-298291625

        @chrismattmann ping!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#issuecomment-298291625 @chrismattmann ping! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298298975

        #175

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298298975 #175 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298298975

        #175

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298298975 #175 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130580

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \
        -D INSTALL_C_EXAMPLES=OFF \
        -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \
        -D BUILD_EXAMPLES=ON ..
        -RUN make -j4
        +RUN make -j8

        Review comment:
        no, this is what caused it to use too much disk/memory, so not going to change this to 8.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130580 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \ -D INSTALL_C_EXAMPLES=OFF \ -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \ -D BUILD_EXAMPLES=ON .. -RUN make -j4 +RUN make -j8 Review comment: no, this is what caused it to use too much disk/memory, so not going to change this to 8. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130603

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -15,37 +15,34 @@

        1. limitations under the License.
          #
          1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
            FROM ubuntu:16.04
            MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \
            Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org>

        +# Install missing part of ubuntu core + python + building dependencies
        RUN apt-get update && \

        • apt-get install -y wget && \
        • apt-get install -y unzip && \
        • apt-get install -y curl
          -
          -
          1. Install ffmpeg
            -RUN apt-get install -y ffmpeg
            -
          2. Dependencies
            -RUN apt-get install -y build-essential cmake pkg-config \
            + apt-get install -y wget git unzip curl \
            + libtcmalloc-minimal4 software-properties-common apt-utils \
            + build-essential cmake pkg-config \
            libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \
            libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \
            libxvidcore-dev libx264-dev \
            libgtk2.0-dev \
            libatlas-base-dev gfortran \
            python2.7-dev \
            python-pip
            +
            +# Install ffmpeg
            +RUN apt-get install -y ffmpeg
            +
            RUN \
            pip install --upgrade pip && \
            pip install numpy

        +# Download OpenCV + OpenCV_Contrib
        WORKDIR /usr/local/src

          1. Download OpenCV 3 + Contrib
            -RUN \

        Review comment:
        orthogonal line change?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130603 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -15,37 +15,34 @@ limitations under the License. # Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \ Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org> +# Install missing part of ubuntu core + python + building dependencies RUN apt-get update && \ apt-get install -y wget && \ apt-get install -y unzip && \ apt-get install -y curl - - Install ffmpeg -RUN apt-get install -y ffmpeg - Dependencies -RUN apt-get install -y build-essential cmake pkg-config \ + apt-get install -y wget git unzip curl \ + libtcmalloc-minimal4 software-properties-common apt-utils \ + build-essential cmake pkg-config \ libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \ libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \ libxvidcore-dev libx264-dev \ libgtk2.0-dev \ libatlas-base-dev gfortran \ python2.7-dev \ python-pip + +# Install ffmpeg +RUN apt-get install -y ffmpeg + RUN \ pip install --upgrade pip && \ pip install numpy +# Download OpenCV + OpenCV_Contrib WORKDIR /usr/local/src Download OpenCV 3 + Contrib -RUN \ Review comment: orthogonal line change? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130680

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -15,37 +15,34 @@

        1. limitations under the License.
          #
          1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
            FROM ubuntu:16.04
            MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \
            Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org>

        +# Install missing part of ubuntu core + python + building dependencies
        RUN apt-get update && \

        • apt-get install -y wget && \
        • apt-get install -y unzip && \
        • apt-get install -y curl
          -
          -
          1. Install ffmpeg
            -RUN apt-get install -y ffmpeg

        Review comment:
        maybe running apt-get install here forces the install of the dependencies you explicitly cite below (which is why it works still?)

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130680 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -15,37 +15,34 @@ limitations under the License. # Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \ Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org> +# Install missing part of ubuntu core + python + building dependencies RUN apt-get update && \ apt-get install -y wget && \ apt-get install -y unzip && \ apt-get install -y curl - - Install ffmpeg -RUN apt-get install -y ffmpeg Review comment: maybe running apt-get install here forces the install of the dependencies you explicitly cite below (which is why it works still?) ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130721

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -70,24 +67,24 @@ RUN \
        pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \
        pip install flask requests

          1. Get the TF-slim dependencies
            +# Get the TF-slim dependencies

        Review comment:
        orthogonal and unneeded line change.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130721 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -70,24 +67,24 @@ RUN \ pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \ pip install flask requests Get the TF-slim dependencies +# Get the TF-slim dependencies Review comment: orthogonal and unneeded line change. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130763

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -70,24 +67,24 @@ RUN \
        pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \
        pip install flask requests

          1. Get the TF-slim dependencies
            +# Get the TF-slim dependencies
        1. Downloading from a specific commit for future compatibility
          -RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip
          -

        Review comment:
        unneeded line change, please remove.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130763 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -70,24 +67,24 @@ RUN \ pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \ pip install flask requests Get the TF-slim dependencies +# Get the TF-slim dependencies Downloading from a specific commit for future compatibility -RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip - Review comment: unneeded line change, please remove. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114130786

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -70,24 +67,24 @@ RUN \
        pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \
        pip install flask requests

          1. Get the TF-slim dependencies
            +# Get the TF-slim dependencies
        1. Downloading from a specific commit for future compatibility
          -RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip
          -
          -RUN unzip c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip
          +RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip
          +RUN unzip c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip

        RUN \
        wget https://raw.githubusercontent.com/smadha/tika/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py -O /usr/bin/inceptionapi.py && \
        wget https://raw.githubusercontent.com/smadha/tika/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py -O /usr/bin/video_util.py && \
        chmod +x /usr/bin/inceptionapi.py && \
        chmod +x /usr/bin/video_util.py

          1. clean up cache, so we can publish smaller image to hub
            +
            +# clean up cache + delete src files, so we can publish smaller image to hub
            RUN apt-get clean
            +RUN rm -rf /usr/local/src

        Review comment:
        makes sense

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114130786 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -70,24 +67,24 @@ RUN \ pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \ pip install flask requests Get the TF-slim dependencies +# Get the TF-slim dependencies Downloading from a specific commit for future compatibility -RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip - -RUN unzip c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip +RUN wget https://github.com/tensorflow/models/archive/c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip +RUN unzip c15fada28113eca32dc98d6e3bec4755d0d5b4c2.zip RUN \ wget https://raw.githubusercontent.com/smadha/tika/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py -O /usr/bin/inceptionapi.py && \ wget https://raw.githubusercontent.com/smadha/tika/TIKA-2322/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/video_util.py -O /usr/bin/video_util.py && \ chmod +x /usr/bin/inceptionapi.py && \ chmod +x /usr/bin/video_util.py clean up cache, so we can publish smaller image to hub + +# clean up cache + delete src files, so we can publish smaller image to hub RUN apt-get clean +RUN rm -rf /usr/local/src Review comment: makes sense ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#issuecomment-298342195

        thanks @ThejanW see my comments

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#issuecomment-298342195 thanks @ThejanW see my comments ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114136424

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \
        -D INSTALL_C_EXAMPLES=OFF \
        -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \
        -D BUILD_EXAMPLES=ON ..
        -RUN make -j4
        +RUN make -j8

        Review comment:
        Hmm...then my guess was right. then keep it as j4

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114136424 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \ -D INSTALL_C_EXAMPLES=OFF \ -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \ -D BUILD_EXAMPLES=ON .. -RUN make -j4 +RUN make -j8 Review comment: Hmm...then my guess was right. then keep it as j4 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114140572

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -15,37 +15,34 @@

        1. limitations under the License.
          #
          1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
            FROM ubuntu:16.04
            MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \
            Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org>

        +# Install missing part of ubuntu core + python + building dependencies
        RUN apt-get update && \

        • apt-get install -y wget && \
        • apt-get install -y unzip && \
        • apt-get install -y curl
          -
          -
          1. Install ffmpeg
            -RUN apt-get install -y ffmpeg

        Review comment:
        hmm...That could happen.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114140572 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -15,37 +15,34 @@ limitations under the License. # Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \ Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org> +# Install missing part of ubuntu core + python + building dependencies RUN apt-get update && \ apt-get install -y wget && \ apt-get install -y unzip && \ apt-get install -y curl - - Install ffmpeg -RUN apt-get install -y ffmpeg Review comment: hmm...That could happen. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114136424

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \
        -D INSTALL_C_EXAMPLES=OFF \
        -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \
        -D BUILD_EXAMPLES=ON ..
        -RUN make -j4
        +RUN make -j8

        Review comment:
        Hmm...then my guess was right. then keep it as j4

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114136424 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -60,7 +57,7 @@ RUN cmake -D CMAKE_BUILD_TYPE=RELEASE \ -D INSTALL_C_EXAMPLES=OFF \ -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib-3.2.0/modules \ -D BUILD_EXAMPLES=ON .. -RUN make -j4 +RUN make -j8 Review comment: Hmm...then my guess was right. then keep it as j4 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114144077

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -15,37 +15,34 @@

        1. limitations under the License.
          #
          1. Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg
            FROM ubuntu:16.04
            MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \
            Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org>

        +# Install missing part of ubuntu core + python + building dependencies
        RUN apt-get update && \

        • apt-get install -y wget && \
        • apt-get install -y unzip && \
        • apt-get install -y curl
          -
          -
          1. Install ffmpeg
            -RUN apt-get install -y ffmpeg
            -
          2. Dependencies
            -RUN apt-get install -y build-essential cmake pkg-config \
            + apt-get install -y wget git unzip curl \
            + libtcmalloc-minimal4 software-properties-common apt-utils \
            + build-essential cmake pkg-config \
            libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \
            libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \
            libxvidcore-dev libx264-dev \
            libgtk2.0-dev \
            libatlas-base-dev gfortran \
            python2.7-dev \
            python-pip
            +
            +# Install ffmpeg
            +RUN apt-get install -y ffmpeg
            +
            RUN \
            pip install --upgrade pip && \
            pip install numpy

        +# Download OpenCV + OpenCV_Contrib
        WORKDIR /usr/local/src

          1. Download OpenCV 3 + Contrib
            -RUN \

        Review comment:
        Thought this would make much sense if it's called OpenCV_Contrib rather than Contrib.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114144077 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -15,37 +15,34 @@ limitations under the License. # Take base as Ubuntu: 16.04, then you can simply apt-get install ffmpeg FROM ubuntu:16.04 MAINTAINER Thamme Gowda <thammegowda@apache.org> and Kranthi Kiran GV <kkranthi@student.nitw.ac.in> and \ Madhav Sharan <msharan@usc.edu> and Chris Mattmann <mattmann@apache.org> +# Install missing part of ubuntu core + python + building dependencies RUN apt-get update && \ apt-get install -y wget && \ apt-get install -y unzip && \ apt-get install -y curl - - Install ffmpeg -RUN apt-get install -y ffmpeg - Dependencies -RUN apt-get install -y build-essential cmake pkg-config \ + apt-get install -y wget git unzip curl \ + libtcmalloc-minimal4 software-properties-common apt-utils \ + build-essential cmake pkg-config \ libjpeg8-dev libtiff5-dev libjasper-dev libpng12-dev \ libavcodec-dev libavformat-dev libswscale-dev libv4l-dev \ libxvidcore-dev libx264-dev \ libgtk2.0-dev \ libatlas-base-dev gfortran \ python2.7-dev \ python-pip + +# Install ffmpeg +RUN apt-get install -y ffmpeg + RUN \ pip install --upgrade pip && \ pip install numpy +# Download OpenCV + OpenCV_Contrib WORKDIR /usr/local/src Download OpenCV 3 + Contrib -RUN \ Review comment: Thought this would make much sense if it's called OpenCV_Contrib rather than Contrib. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on a change in pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#discussion_r114149180

        ##########
        File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        ##########
        @@ -70,24 +67,24 @@ RUN \
        pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \
        pip install flask requests

          1. Get the TF-slim dependencies
            +# Get the TF-slim dependencies

        Review comment:
        fixed

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on a change in pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#discussion_r114149180 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile ########## @@ -70,24 +67,24 @@ RUN \ pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl --ignore-installed && \ pip install flask requests Get the TF-slim dependencies +# Get the TF-slim dependencies Review comment: fixed ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#issuecomment-298452808

        thank you for your reviews @chrismattmann , there are new commits, review them too. Is there anything else I need to do?

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#issuecomment-298452808 thank you for your reviews @chrismattmann , there are new commits, review them too. Is there anything else I need to do? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#issuecomment-298458899

        Hi @ThejanW thanks I will go ahead and test tonight and if looks good will commit! Thanks!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#issuecomment-298458899 Hi @ThejanW thanks I will go ahead and test tonight and if looks good will commit! Thanks! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298463754

        @chrismattmann - I added more details in wiki. Manual installation instructions and REST API parameter description etc.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298463754 @chrismattmann - I added more details in wiki. Manual installation instructions and REST API parameter description etc. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298463754

        @chrismattmann - I added more details in wiki. Manual installation instructions and REST API parameter description etc.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298463754 @chrismattmann - I added more details in wiki. Manual installation instructions and REST API parameter description etc. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298511180

        @smadha Could you also add this fact to wiki, when running the container, it always downloads model files. If a user needs to keep those models, without downloading them again and again, the could commit those changes to the container. Once committed those changes, running the container again won't download those models. This is how to do it,

        1. First run the container by this command,
        *docker run -p 8764:8764 -it inception-video-rest-tika*
        2. Open another terminal, without closing the terminal which is server is running, then type this command,
        *docker ps -l*
        3. User will see the immediate container of inception-video-rest-tika and it's container id, make note of the container id.
        4. Then by this command, changes will be committed to inception-video-rest-tika container,
        *docker commit inception-video-rest-tika container_id*
        5. To check if it's worked, stop and start the inception-video-rest-tika container again, it won't download models.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298511180 @smadha Could you also add this fact to wiki, when running the container, it always downloads model files. If a user needs to keep those models, without downloading them again and again, the could commit those changes to the container. Once committed those changes, running the container again won't download those models. This is how to do it, 1. First run the container by this command, * docker run -p 8764:8764 -it inception-video-rest-tika * 2. Open another terminal, without closing the terminal which is server is running, then type this command, * docker ps -l * 3. User will see the immediate container of inception-video-rest-tika and it's container id, make note of the container id. 4. Then by this command, changes will be committed to inception-video-rest-tika container, * docker commit inception-video-rest-tika container_id * 5. To check if it's worked, stop and start the inception-video-rest-tika container again, it won't download models. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298511180

        @smadha Could you also add this fact to wiki, when running the container, it always downloads model files. If a user needs to keep those models, without downloading them again and again, the could commit those changes to the container. Once committed those changes, running the container again won't download those models. This is how to do it,

        1. First run the container by this command,
        *docker run -p 8764:8764 -it inception-video-rest-tika*
        2. Open another terminal, without closing the terminal which is server is running, then type this command,
        *docker ps -l*
        3. User will see the immediate container of inception-video-rest-tika and it's container id, make note of the container id.
        4. Then by this command, changes will be committed to inception-video-rest-tika container,
        *docker commit inception-video-rest-tika container_id*
        5. To check if it's worked, stop and start the inception-video-rest-tika container again, it won't download models.

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - ThejanW commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298511180 @smadha Could you also add this fact to wiki, when running the container, it always downloads model files. If a user needs to keep those models, without downloading them again and again, the could commit those changes to the container. Once committed those changes, running the container again won't download those models. This is how to do it, 1. First run the container by this command, * docker run -p 8764:8764 -it inception-video-rest-tika * 2. Open another terminal, without closing the terminal which is server is running, then type this command, * docker ps -l * 3. User will see the immediate container of inception-video-rest-tika and it's container id, make note of the container id. 4. Then by this command, changes will be committed to inception-video-rest-tika container, * docker commit inception-video-rest-tika container_id * 5. To check if it's worked, stop and start the inception-video-rest-tika container again, it won't download models. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        chrismattmann Chris A. Mattmann added a comment -

        PR #168 close. Thanks Madhav Sharan, Thejan Wijesinghe and Thamme Gowda!

        Show
        chrismattmann Chris A. Mattmann added a comment - PR #168 close. Thanks Madhav Sharan , Thejan Wijesinghe and Thamme Gowda !
        Hide
        chrismattmann Chris A. Mattmann added a comment -

        Thejan Wijesinghe what is your wiki username? I'll grant you permissions to do so yourself.

        Show
        chrismattmann Chris A. Mattmann added a comment - Thejan Wijesinghe what is your wiki username? I'll grant you permissions to do so yourself.
        Hide
        ThejanWijesinghe Thejan Wijesinghe added a comment -

        Will do, thanks. Chris A. Mattmann, it's ThejanW.

        Show
        ThejanWijesinghe Thejan Wijesinghe added a comment - Will do, thanks. Chris A. Mattmann , it's ThejanW.
        Hide
        chrismattmann Chris A. Mattmann added a comment -

        permission granted Thejan Wijesinghe

        Show
        chrismattmann Chris A. Mattmann added a comment - permission granted Thejan Wijesinghe
        Hide
        ThejanWijesinghe Thejan Wijesinghe added a comment -

        I can now edit the Wiki. thank you Chris A. Mattmann.

        Show
        ThejanWijesinghe Thejan Wijesinghe added a comment - I can now edit the Wiki. thank you Chris A. Mattmann .
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298719232

        Can we add this inside docker file? Looks like we can add this piece of code inside docker file -https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py#L189-L202

        It looks like we need to download [model tar](http://download.tensorflow.org/models/inception_v4_2016_09_09.tar.gz) [synsets.txt](https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_lsvrc_2015_synsets.txt) [model metadata](https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_metadata.txt) in `/tmp/imagenet`

        Will this not work? @thammegowda @ThejanW

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298719232 Can we add this inside docker file? Looks like we can add this piece of code inside docker file - https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py#L189-L202 It looks like we need to download [model tar] ( http://download.tensorflow.org/models/inception_v4_2016_09_09.tar.gz ) [synsets.txt] ( https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_lsvrc_2015_synsets.txt ) [model metadata] ( https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_metadata.txt ) in `/tmp/imagenet` Will this not work? @thammegowda @ThejanW ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298719232

        Can we add this inside docker file? Looks like we can add this piece of code inside docker file -https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py#L189-L202

        It looks like we need to download [model tar](http://download.tensorflow.org/models/inception_v4_2016_09_09.tar.gz) [synsets.txt](https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_lsvrc_2015_synsets.txt) [model metadata](https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_metadata.txt) in `/tmp/imagenet`

        Will this not work? @thammegowda @ThejanW

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298719232 Can we add this inside docker file? Looks like we can add this piece of code inside docker file - https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py#L189-L202 It looks like we need to download [model tar] ( http://download.tensorflow.org/models/inception_v4_2016_09_09.tar.gz ) [synsets.txt] ( https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_lsvrc_2015_synsets.txt ) [model metadata] ( https://raw.githubusercontent.com/tensorflow/models/master/inception/inception/data/imagenet_metadata.txt ) in `/tmp/imagenet` Will this not work? @thammegowda @ThejanW ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298719799

        @smadha @ThejanW @thammegowda let's not discuss immediate improvements to this PR and to 2322 on this request. Let's create a new JIRA ticket or talk on dev@tika.a.o to discuss design updates. I can say it won't make it into 1.15 I don't want to change features at this point. I just want to deliver an initial version of this and improve on it in 1.16-dev

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298719799 @smadha @ThejanW @thammegowda let's not discuss immediate improvements to this PR and to 2322 on this request. Let's create a new JIRA ticket or talk on dev@tika.a.o to discuss design updates. I can say it won't make it into 1.15 I don't want to change features at this point. I just want to deliver an initial version of this and improve on it in 1.16-dev ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298722760

        Just tried this and it worked. After below changes we don't have to download model every time server starts

        Just before making command InceptionVideoRestDockerfile#L89(https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L89)

        Add below lines
        ```
        ENV PYTHONPATH="$PYTHONPATH:$PATH"
        RUN python -c "import inceptionapi"
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298722760 Just tried this and it worked. After below changes we don't have to download model every time server starts Just before making command InceptionVideoRestDockerfile#L89 ( https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L89 ) Add below lines ``` ENV PYTHONPATH="$PYTHONPATH:$PATH" RUN python -c "import inceptionapi" ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298722760

        Just tried this and it worked. After below changes we don't have to download model every time server starts

        Just before making command InceptionVideoRestDockerfile#L89(https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L89)

        Add below lines
        ```
        ENV PYTHONPATH="$PYTHONPATH:$PATH"
        RUN python -c "import inceptionapi"
        ```

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298722760 Just tried this and it worked. After below changes we don't have to download model every time server starts Just before making command InceptionVideoRestDockerfile#L89 ( https://github.com/apache/tika/blob/e141640891cd7adcfc1848b351c0db7eab00a2d2/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile#L89 ) Add below lines ``` ENV PYTHONPATH="$PYTHONPATH:$PATH" RUN python -c "import inceptionapi" ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu
        URL: https://github.com/apache/tika/pull/168#issuecomment-298723350

        Just saw @chrismattmann comment, will do that

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - smadha commented on issue #168: fix for TIKA-2322 contributed by msharan@usc.edu URL: https://github.com/apache/tika/pull/168#issuecomment-298723350 Just saw @chrismattmann comment, will do that ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann closed pull request #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann closed pull request #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        githubbot ASF GitHub Bot added a comment -

        chrismattmann commented on issue #175: TIKA-2322: update dockerfile
        URL: https://github.com/apache/tika/pull/175#issuecomment-298925647

        Merged and tested! works great thanks @ThejanW!

        ----------------------------------------------------------------
        This is an automated message from the Apache Git Service.
        To respond to the message, please log on GitHub and use the
        URL above to go to the specific comment.

        For queries about this service, please contact Infrastructure at:
        users@infra.apache.org

        Show
        githubbot ASF GitHub Bot added a comment - chrismattmann commented on issue #175: TIKA-2322 : update dockerfile URL: https://github.com/apache/tika/pull/175#issuecomment-298925647 Merged and tested! works great thanks @ThejanW! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Jenkins build Tika-trunk #1254 (See https://builds.apache.org/job/Tika-trunk/1254/)
        TIKA-2322: update dockerfile (thejanwijesinghe.14: https://github.com/apache/tika/commit/27f7b24b3ee2b8cc6974785c79a148c23c25b707)

        • (edit) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Jenkins build Tika-trunk #1254 (See https://builds.apache.org/job/Tika-trunk/1254/ ) TIKA-2322 : update dockerfile (thejanwijesinghe.14: https://github.com/apache/tika/commit/27f7b24b3ee2b8cc6974785c79a148c23c25b707 ) (edit) tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionVideoRestDockerfile

          People

          • Assignee:
            chrismattmann Chris A. Mattmann
            Reporter:
            msharan@usc.edu Madhav Sharan
          • Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development