[MAHOUT-344] Minhash based clustering - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.3
Fix Version/s: 0.4
Component/s: classic
Labels:
None

Description

Minhash clustering performs probabilistic dimension reduction of high dimensional data. The essence of the technique is to hash each item using multiple independent hash functions such that the probability of collision of similar items is higher. Multiple such hash tables can then be constructed to answer near neighbor type of queries efficiently.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAHOUT-344-v1.patch
22/Mar/10 07:58
17 kB
Ankur Bansal
MAHOUT-344-v2.patch
03/Apr/10 11:03
34 kB
Cristi Prodan
MAHOUT-344-v3.patch
02/Aug/10 07:15
41 kB
Cristi Prodan
MAHOUT-344-v4.patch
25/Sep/10 19:42
39 kB
Ankur Bansal
MAHOUT-344-v5.patch
26/Sep/10 19:26
28 kB
Ankur Bansal
MAHOUT-344-v6.patch
29/Sep/10 15:50
42 kB
Ankur Bansal
MAHOUT-344-v7.patch
30/Sep/10 15:37
48 kB
Ankur Bansal

Activity

People

Assignee:: Ankur Bansal

Reporter:: Ankur Bansal

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 22/Mar/10 07:44

Updated:: 31/Jan/24 22:14

Resolved:: 01/Oct/10 07:40