Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1862

Port flexible readdb dump formatting options to 2.X

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Auto Closed
    • 2.3
    • 2.5
    • crawldb
    • None

    Description

      Right now in 1.X we can format the crawldb dump as follows

      • -format csv - in Csv format
      • -format normal - dump in standard format (default option), and
      • -format crawldb - dump as CrawlDB

      We should port this to 2.X as it is extremely helpful for cross language support to be able to read alternative data input formats e.g. CSV

      Attachments

        Activity

          People

            Unassigned Unassigned
            lewismc Lewis John McGibbney
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: