Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-9480

Graph Traversal for Significantly Related Terms (Semantic Knowledge Graph)

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Security Level: Public (Default Security Level. Issues are Public)
    • Labels:
      None

      Description

      This issue is to track the contribution of the Semantic Knowledge Graph Solr Plugin (request handler), which exposes a graph-like interface for discovering and traversing significant relationships between entities within an inverted index.

      This data model has been described in the following research paper: The Semantic Knowledge Graph: A compact, auto-generated model for real-time traversal and ranking of any relationship within a domain, as well as in presentations I gave in October 2015 at Lucene/Solr Revolution and November 2015 at the Bay Area Search Meetup.

      The source code for this project is currently available at https://github.com/careerbuilder/semantic-knowledge-graph, and the folks at CareerBuilder (where this was built) have given me the go-ahead to now contribute this back to the Apache Solr Project, as well.

      Check out the Github repository, research paper, or presentations for a more detailed description of this contribution. Initial patch coming soon.

      1. SOLR-9480.patch
        328 kB
        Trey Grainger

        Activity

        Hide
        solrtrey Trey Grainger added a comment -

        Initial patch to get the ball rolling here. Feature should now work as described in reference links in the description. Only real changes are an update from Solr 5.1.0 to master, and cleanup of most of the precommit issues.

        Still plenty of work to do, particularly in reworking some of the multi-threading code to follow Solr conventions, reducing the number of files for helper classes, and eventually getting this working correctly in distributed mode (was originally built for use cases involving a single Solr core as a "representative model"). Would also be good to make a getting started tutorial with example data so its easier get started with the feature and do something interesting out of the box.

        Will continue working on those items as I'm able. Feedback welcome.

        Show
        solrtrey Trey Grainger added a comment - Initial patch to get the ball rolling here. Feature should now work as described in reference links in the description. Only real changes are an update from Solr 5.1.0 to master, and cleanup of most of the precommit issues. Still plenty of work to do, particularly in reworking some of the multi-threading code to follow Solr conventions, reducing the number of files for helper classes, and eventually getting this working correctly in distributed mode (was originally built for use cases involving a single Solr core as a "representative model"). Would also be good to make a getting started tutorial with example data so its easier get started with the feature and do something interesting out of the box. Will continue working on those items as I'm able. Feedback welcome.

          People

          • Assignee:
            Unassigned
            Reporter:
            solrtrey Trey Grainger
          • Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

            • Created:
              Updated:

              Development