Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2938

Use Any23's RepositoryWriter to write structured data to Rdf4j repository

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Do
    • None
    • None
    • any23, plugin
    • None

    Description

      I have been running a patch which leverages Any23's RepositoryWriter (implemented as one of a number of TripleHandler's via CompositeTripleHandler) to write Any23 extractions to GraphDB. This enables us to build a content graph from data across the enterprise.
      This feature is turned off by default so will not change existing Any23 behaviour. I have concerns about the performance of this patch because right now we need to create a new repository connection for each URL. This is not great so I will definitely improve on it.
      PR coming up.

      Attachments

        Issue Links

          Activity

            People

              lewismc Lewis John McGibbney
              lewismc Lewis John McGibbney
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: