Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-456

RowSimilarityJob should not produce SequentialAccessSparseVectors

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.4
    • Math
    • None

    Description

      RowSimilarityJob currently produces SequentialAccessSparseVectors with cardinality Integer.MAX_VALUE wrapped inside VectorWritables.

      It should better produce RandomAccessSparseVectors as some methods like assign(Vector) are very slow on such SequentialAccessSparseVectors.

      Attachments

        1. MAHOUT-456.patch
          3 kB
          Sebastian Schelter
        2. MAHOUT-456-2.patch
          1 kB
          Sebastian Schelter
        3. MAHOUT-456.patch
          1 kB
          Jake Mannix
        4. MAHOUT-456.patch
          1 kB
          Jake Mannix

        Activity

          People

            Unassigned Unassigned
            ssc Sebastian Schelter
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: