Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1950

File name too long when bin/nutch dump

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.10
    • 1.10
    • segment
    • None

    Description

      When bin/dump in version 1.10-trunk, there will be an exception saying "File name too long". When crawling, the length of the url may be longer than 255 bytes and nutch save the file using the url as file name. It can be saved in segments but when dumping the files to local file system, the length of the filename can not be longer than 255 bytes.
      The FileDumper.java need to be changed to handle such exception.

      Attachments

        Activity

          People

            chrismattmann Chris A. Mattmann
            chongli Chong Li
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified