Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2298

To improve object recognition parser so that it may work without external RESTful service setup

    Details

    • Flags:
      Patch

      Description

      When ObjectRecognitionParser was built to do image recognition, there wasn't
      good support for Java frameworks. All the popular neural networks were in
      C++ or python. Since there was nothing that runs within JVM, we tried
      several ways to glue them to Tika (like CLI, JNI, gRPC, REST).
      However, this game is changing slowly now. Deeplearning4j, the most famous
      neural network library for JVM, now supports importing models that are
      pre-trained in python/C++ based kits [5].

      Improvement:
      It will be nice to have an implementation of ObjectRecogniser that
      doesn't require any external setup(like installation of native libraries or
      starting REST services). Reasons: easy to distribute and also to cut the IO
      time.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                chrismattmann Chris A. Mattmann
                Reporter:
                asmehra95 Avtar Singh
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - 672h
                  672h
                  Remaining:
                  Remaining Estimate - 672h
                  672h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified