Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2115

Add total counts to dump stats

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.10
    • 1.11
    • dumpers, util
    • None

    Description

      It would be nice if the "dump" tool included total counts for the mimetype stats that it gives. Something along the lines of the following would be great when you have to deal with some larger crawls and don't want to bother doing the math yourself.

      Dumper File Stats: 
      TOTAL Stats:
      [
          {"mimeType":"application/xhtml+xml","count":"2"}
          {"mimeType":"application/octet-stream","count":"1"}
          {"mimeType":"text/html","count":"23"}
      ]
      Total count: 26
      
      FILTERED Stats:
      [
          {"mimeType":"text/html","count":"23"}
      ]
      Total filtered count: 23
      

      Attachments

        Activity

          People

            mjoyce Michael Joyce
            mjoyce Michael Joyce
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: