Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-59

meta data support in webdb

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.8
    • None
    • None

    Description

      Meta data support in web db would very usefully for a new set of nutch feature that needs long life meta data.

      Actually page meta data need to be regenerated or lookup every 30 days a page is re-fetched, in a long context web db meta data would bring a dramatically performance improvement for such tasks.
      Furthermore Storage of meta data in webdb would make a new generation of linklist generation filters possible.

      Attachments

        1. webDBMetaDataPatch.txt
          16 kB
          Stefan Groschupf

        Activity

          People

            Unassigned Unassigned
            joa23 Stefan Groschupf
            Votes:
            4 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: