Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-583

Loss some data when create sequence files from directory

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.5
    • 0.5
    • classic
    • None
    • All situation

    Description

      Loss some data when create sequence files from directory. It will happen when we need more than one output chunk file. It create chunk-0 twice. The first chunk-0 file is overwrite by the second chunk-0 file. That's because the name of the second chunk file starts from 0 not 1.
      For example, it creates files in the sequence, chunk-0, chunk-0, chunk-1, chunk-2, chunk-3, chunk-*. So we loss the first chunk-0 file if we create more than one chunk files.

      Attachments

        1. abcd.patch
          0.7 kB
          yumeng

        Activity

          People

            Unassigned Unassigned
            yumegn yumeng
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 1h
                1h
                Remaining:
                Remaining Estimate - 1h
                1h
                Logged:
                Time Spent - Not Specified
                Not Specified