Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1862

Port flexible readdb dump formatting options to 2.X

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Auto Closed
    • Affects Version/s: 2.3
    • Fix Version/s: 2.5
    • Component/s: crawldb
    • Labels:
      None

      Description

      Right now in 1.X we can format the crawldb dump as follows

      • -format csv - in Csv format
      • -format normal - dump in standard format (default option), and
      • -format crawldb - dump as CrawlDB

      We should port this to 2.X as it is extremely helpful for cross language support to be able to read alternative data input formats e.g. CSV

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              lewismc Lewis John McGibbney
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: