Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1137

LinkDb / invertlinks: command line arguments ignored

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.3
    • 1.4
    • linkdb
    • None
    • Patch Available

    Description

      If the tool invertlinks is called with option -dir <segmentsDir> all remaining
      arguments are ignored:

      % $NUTCH_HOME/bin/nutch invertlinks linkdb -dir segments -noNormalize -noFilter
      LinkDb: starting at 2011-09-28 23:24:07
      LinkDb: linkdb: linkdb
      LinkDb: URL normalize: true
      LinkDb: URL filter: true
      

      (URLs are normalized and filtered despite -noNormalize/-noFilter)

      The patch also restricts the ordering of arguments according to the help text:
      Usage: LinkDb <linkdb> (-dir <segmentsDir> | <seg1> <seg2> ...) [-force] [-noNormalize] [-noFilter]
      (segments must be given before the optional flags)

      Attachments

        1. NUTCH-1137-1.5.patch
          1 kB
          Sebastian Nagel

        Activity

          People

            markus17 Markus Jelsma
            snagel Sebastian Nagel
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: