Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-1125

Create a lightweight EntityHub Indexing Tool for Freebase

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • Entityhub
    • None

    Description

      Due to the enormous size of the dumps, current Freebase indexing tool in Stanbol can't barely work in machines without several gigas of RAM and/or SSD disks. JenaTDB importer has been identified as the bootle neck of the indexing process. To use an RDF database is mandatory in order to, for instance, use LDPath programs at indexing time.

      The idea is to develop a lightweight indexing tool that stream data from the dumps and push it directly to Solr. Despite losing some functionality, it is possible for any user to generate Freebase EntityHub indexes from any dump.

      Attachments

        Issue Links

          Activity

            People

              rafaharo Rafa Haro
              rafaharo Rafa Haro
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: