Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-1653

Solr ingester connector contribution

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • ManifoldCF 2.18
    • None
    • None
    • Patch

    Description

      Hi,

      We developed a new repository connector for crawling data from Solr and we would like to contribute to MCF by releasing the code into Apache v2 license.

      The goal of this connector is to crawl Solr instances and manage it in MCF rather than using DIH for instance.
      So to do it, we send requests to Solr and we manage the large number of results thanks to the cursormark. The Solr fields must be stored in order to be gathered.

      By the way we do not use any specific libraries, all the dependencies are already into MCF. We tested it so far for Solr 7 and 8 versions.

      The documentation is here : https://datafari.atlassian.net/wiki/spaces/DATAFARI/pages/673742849/Solr+ingester+crawler+connector

      The code is attached.

      Best regards,

      Olivier Tavard

      Attachments

        1. patch_solr_ingester_connector_02_12_2020.txt
          5 kB
          Olivier Tavard
        2. patch_solr_ingester_connector_03_12_2020.txt
          6 kB
          Olivier Tavard
        3. patch_solr_ingester_connector_11_12_2020.txt
          8 kB
          Olivier Tavard
        4. solr_ingester_connector_patch.txt
          121 kB
          Olivier Tavard

        Activity

          People

            kwright@metacarta.com Karl Wright
            olivierfl Olivier Tavard
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: