Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.12
    • Component/s: parser
    • Labels:

      Description

      As of now tika uses lucene-geo-gazetteer CLI to extract co-ordinates of a location. CLI requires jvm and lucene to instantiate for every request. With all new REST api it will be possible to gain improvement in this space.

      Idea is to create a client of lucene-geo-gazetteer in tika and use it in GeoTopicParser

        Activity

        Hide
        githubbot ASF GitHub Bot added a comment -

        GitHub user smadha opened a pull request:

        https://github.com/apache/tika/pull/65

        fix for TIKA-1803 contributed by msharan@usc.edu

        You can merge this pull request into a Git repository by running:

        $ git pull https://github.com/smadha/tika TIKA-1803

        Alternatively you can review and apply these changes as the patch at:

        https://github.com/apache/tika/pull/65.patch

        To close this pull request, make a commit to your master/trunk branch
        with (at least) the following in the commit message:

        This closes #65


        commit a55990aa5d6a0c521358123f8d7bbd6947255174
        Author: smadha <msharan@usc.edu>
        Date: 2015-12-16T15:26:23Z

        fix for TIKA-1803 contributed by msharan@usc.edu


        Show
        githubbot ASF GitHub Bot added a comment - GitHub user smadha opened a pull request: https://github.com/apache/tika/pull/65 fix for TIKA-1803 contributed by msharan@usc.edu You can merge this pull request into a Git repository by running: $ git pull https://github.com/smadha/tika TIKA-1803 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/tika/pull/65.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #65 commit a55990aa5d6a0c521358123f8d7bbd6947255174 Author: smadha <msharan@usc.edu> Date: 2015-12-16T15:26:23Z fix for TIKA-1803 contributed by msharan@usc.edu
        Hide
        chrismattmann Chris A. Mattmann added a comment -

        Thanks Madhav Sharan this is now done! I had to fix tika-bundle BTW, please test the whole suite (mvn install at the top level) before submitting the PR - since tika-bundle usually has updates to be made too. GREAT work!

        [chipotle:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1803 Use lucene-geo-gazetteer REST API in GeoTopicParser contributed by Madhav Sharan msharan@usc.edu this closes #65"
        Sending        CHANGES.txt
        Sending        tika-bundle/pom.xml
        Sending        tika-parsers/pom.xml
        Sending        tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParser.java
        Sending        tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParserConfig.java
        Sending        tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoTag.java
        Adding         tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer
        Adding         tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/GeoGazetteerClient.java
        Adding         tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/Location.java
        Adding         tika-parsers/src/main/resources/org/apache/tika/parser/geo
        Adding         tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic
        Adding         tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic/GeoTopicConfig.properties
        Sending        tika-parsers/src/test/java/org/apache/tika/parser/geo/topic/GeoParserTest.java
        Transmitting file data ..........
        Committed revision 1721048.
        [chipotle:~/tmp/tika1.12] mattmann% 
        
        Show
        chrismattmann Chris A. Mattmann added a comment - Thanks Madhav Sharan this is now done! I had to fix tika-bundle BTW, please test the whole suite (mvn install at the top level) before submitting the PR - since tika-bundle usually has updates to be made too. GREAT work! [chipotle:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1803 Use lucene-geo-gazetteer REST API in GeoTopicParser contributed by Madhav Sharan msharan@usc.edu this closes #65" Sending CHANGES.txt Sending tika-bundle/pom.xml Sending tika-parsers/pom.xml Sending tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParser.java Sending tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParserConfig.java Sending tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoTag.java Adding tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer Adding tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/GeoGazetteerClient.java Adding tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/Location.java Adding tika-parsers/src/main/resources/org/apache/tika/parser/geo Adding tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic Adding tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic/GeoTopicConfig.properties Sending tika-parsers/src/test/java/org/apache/tika/parser/geo/topic/GeoParserTest.java Transmitting file data .......... Committed revision 1721048. [chipotle:~/tmp/tika1.12] mattmann%
        Hide
        msharan@usc.edu Madhav Sharan added a comment - - edited

        Alrighty! Will take care in future

        Show
        msharan@usc.edu Madhav Sharan added a comment - - edited Alrighty! Will take care in future
        Hide
        githubbot ASF GitHub Bot added a comment -

        Github user asfgit closed the pull request at:

        https://github.com/apache/tika/pull/65

        Show
        githubbot ASF GitHub Bot added a comment - Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/65
        Hide
        hudson Hudson added a comment -

        UNSTABLE: Integrated in tika-trunk-jdk1.7 #892 (See https://builds.apache.org/job/tika-trunk-jdk1.7/892/)
        Fix for TIKA-1803 Use lucene-geo-gazetteer REST API in GeoTopicParser contributed by Madhav Sharan msharan@usc.edu this closes #65 (mattmann: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1721048)

        • trunk/CHANGES.txt
        • trunk/tika-bundle/pom.xml
        • trunk/tika-parsers/pom.xml
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParser.java
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParserConfig.java
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoTag.java
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/GeoGazetteerClient.java
        • trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/Location.java
        • trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo
        • trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic
        • trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic/GeoTopicConfig.properties
        • trunk/tika-parsers/src/test/java/org/apache/tika/parser/geo/topic/GeoParserTest.java
        Show
        hudson Hudson added a comment - UNSTABLE: Integrated in tika-trunk-jdk1.7 #892 (See https://builds.apache.org/job/tika-trunk-jdk1.7/892/ ) Fix for TIKA-1803 Use lucene-geo-gazetteer REST API in GeoTopicParser contributed by Madhav Sharan msharan@usc.edu this closes #65 (mattmann: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1721048 ) trunk/CHANGES.txt trunk/tika-bundle/pom.xml trunk/tika-parsers/pom.xml trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParser.java trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoParserConfig.java trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/GeoTag.java trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/GeoGazetteerClient.java trunk/tika-parsers/src/main/java/org/apache/tika/parser/geo/topic/gazetteer/Location.java trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic trunk/tika-parsers/src/main/resources/org/apache/tika/parser/geo/topic/GeoTopicConfig.properties trunk/tika-parsers/src/test/java/org/apache/tika/parser/geo/topic/GeoParserTest.java

          People

          • Assignee:
            chrismattmann Chris A. Mattmann
            Reporter:
            msharan@usc.edu Madhav Sharan
          • Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development