Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2298

To improve object recognition parser so that it may work without external RESTful service setup

    XMLWordPrintableJSON

Details

    • Patch

    Description

      When ObjectRecognitionParser was built to do image recognition, there wasn't
      good support for Java frameworks. All the popular neural networks were in
      C++ or python. Since there was nothing that runs within JVM, we tried
      several ways to glue them to Tika (like CLI, JNI, gRPC, REST).
      However, this game is changing slowly now. Deeplearning4j, the most famous
      neural network library for JVM, now supports importing models that are
      pre-trained in python/C++ based kits [5].

      Improvement:
      It will be nice to have an implementation of ObjectRecogniser that
      doesn't require any external setup(like installation of native libraries or
      starting REST services). Reasons: easy to distribute and also to cut the IO
      time.

      Attachments

        Issue Links

          Activity

            People

              chrismattmann Chris A. Mattmann
              asmehra95 Avtar Singh
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 672h
                  672h
                  Remaining:
                  Remaining Estimate - 672h
                  672h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified