Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-565

Features incorrectly hashed in Minhash

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.4
    • 0.5
    • None
    • None

    Description

      Given a feature vector for which minhash signature is desired, each feature id (an integer) is converted to a byte array through a series of bit shift operations. Current implementation of these operations doesn't mask the bits being shifted resulting in sign bit being shifted.

      Attachments

        1. jira-565.v1.patch
          14 kB
          Ankur Bansal

        Activity

          People

            ankur Ankur Bansal
            ankur Ankur Bansal
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: