Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1406

index-metadata plugin: conversion to Solr date format

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • indexer, parser

    Description

      This improvement to the index-mdata plugin allows for conversion of selected fields to the Solr date format. The main benefit of this conversion is the possibility to create range facets.

      In order to convert the values of selected metatags to Solr date format, you must specify in nutch-site.xml. This can be for example used with Dublin Core elements. A subdomain which would have pages with the meta tag dcterms.modified would be cic.gc.ca. dcterms.modified must also be defined in the metatags.names and index.parse.md properties.

      <property>
      	<name>index.dateconvert.md</name>
      	<value>metatag.dcterms.modified</value>
      	<description>For plugin index-metadata: Indicate here the name of the html meta tag that should be converted to Solr date format.
      	</description>
      </property>
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              qlumbus Kristof
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: