Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-923

Multilingual support for Solr-index-mapping

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.2
    • None
    • indexer
    • None
    • Patch Available

    Description

      It would be useful to extend the mapping-possibilites when indexing to solr.
      One useful feature would be to use the detected language of the html page (for example via the language-identifier plugin) and send the content to corresponding language-aware solr-fields.

      The mapping file could be as follows:
      <field dest="lang" source="lang"/>
      <field dest="title_${lang}" source="title" />
      so that the title-field gets mapped to title_en for English-pages and tilte_fr for French pages.

      What do you think? Could this be useful also to others?
      Or are there already other solutions out there?

      Attachments

        1. patch-923-nutch-release-1.2.txt
          7 kB
          Matthias Agethle

        Activity

          People

            markus17 Markus Jelsma
            magethle Matthias Agethle
            Votes:
            3 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: