| Progress: |
|
|
59 of 59 issues have been resolved
|
|
|
First Minor Release
|
|
| |
|
MAHOUT-75 |
FIXED
|
asFormatString tests fail
|
|
|
| |
|
MAHOUT-69 |
FIXED
|
0.1 RELEASE TODO
|
|
|
| |
|
MAHOUT-53 |
FIXED
|
Add documentation for Taste
|
|
|
| |
|
MAHOUT-110 |
FIXED
|
Ant script for building Taste web app
|
|
|
| |
|
MAHOUT-82 |
FIXED
|
Canopy map intermediate file structure should be keyed by canopyId.
|
|
|
| |
|
MAHOUT-88 |
FIXED
|
Convert to doubles, other changes
|
|
|
| |
|
MAHOUT-23 |
FIXED
|
Getting a row or column from a matrix view gives a row or column from the wrapped matrix.
|
|
|
| |
|
MAHOUT-99 |
FIXED
|
Improving speed of KMeans
|
|
|
| |
|
MAHOUT-79 |
FIXED
|
Improving the speed of Fuzzy K-Means by optimizing data transfer between map and reduce tasks
|
|
|
| |
|
MAHOUT-15 |
FIXED
|
Investigate Mean Shift Clustering
|
|
|
| |
|
MAHOUT-112 |
FIXED
|
Maven jetty plugin has been relocated
|
|
|
| |
|
MAHOUT-20 |
FIXED
|
Migrate Canopy and KMeans Implementations to Vectors
|
|
|
| |
|
MAHOUT-6 |
FIXED
|
Need a matrix implementation
|
|
|
| |
|
MAHOUT-47 |
FIXED
|
Point class is now redundant and should be removed
|
|
|
| |
|
MAHOUT-58 |
FIXED
|
Remove deprecated distance(Float[], Float[]), AbstractDistanceMeasure?
|
|
|
| |
|
MAHOUT-37 |
FIXED
|
Tarball for Mahout-ified Taste code
|
|
|
| |
|
MAHOUT-80 |
FIXED
|
Taste build fix
|
|
|
| |
|
MAHOUT-89 |
FIXED
|
To test hudson, I just made a 'Patch Available' issue.
|
|
|
| |
|
MAHOUT-38 |
FIXED
|
Ant job jar task
|
|
|
| |
|
MAHOUT-27 |
FIXED
|
Canopy/KMeans unit tests failing
|
|
|
| |
|
MAHOUT-34 |
FIXED
|
Iterator interface for Vectors
|
|
|
| |
|
MAHOUT-118 |
FIXED
|
Mahout needs to respect the file system type when getting a FileSystem for an input or output path
|
|
|
| |
|
MAHOUT-26 |
FIXED
|
Matrix implementation bug fix and little addition
|
|
|
| |
|
MAHOUT-33 |
FIXED
|
Matrix tests share code that can be placed in an abstract class
|
|
|
| |
|
MAHOUT-49 |
FIXED
|
ParameterEnumerable
|
|
|
| |
|
MAHOUT-42 |
FIXED
|
Tanimoto coefficient distance measure
|
|
|
| |
|
MAHOUT-50 |
FIXED
|
Vector extends Writable
|
|
|
| |
|
MAHOUT-39 |
FIXED
|
Vector improvments
|
|
|
| |
|
MAHOUT-41 |
FIXED
|
VectorWritable
|
|
|
| |
|
MAHOUT-36 |
FIXED
|
WeightedDistanceMeasure
|
|
|
| |
|
MAHOUT-86 |
FIXED
|
A New Vector Assignment Operator
|
|
|
| |
|
MAHOUT-92 |
FIXED
|
BayesFeatureMapper doesn't properly extract features
|
|
|
| |
|
MAHOUT-3 |
FIXED
|
Build initial canopy clustering prototype
|
|
|
| |
|
MAHOUT-60 |
FIXED
|
MAHOUT-9
Complementary Naive Bayes
|
|
|
| |
|
MAHOUT-74 |
FIXED
|
Fuzzy K-Means clustering
|
|
|
| |
|
MAHOUT-9 |
FIXED
|
Implement MapReduce BayesianClassifier
|
|
|
| |
|
MAHOUT-5 |
FIXED
|
Implement a k-means clustering prototype
|
|
|
| |
|
MAHOUT-25 |
FIXED
|
Minor bugs/issues from code inspection
|
|
|
| |
|
MAHOUT-104 |
FIXED
|
Move to Maven for Build, drop Ant
|
|
|
| |
|
MAHOUT-10 |
FIXED
|
Replace fall-through exception handlers with propagated unchecked exception.
|
|
|
| |
|
MAHOUT-72 |
FIXED
|
Separate out Examples from Core
|
|
|
| |
|
MAHOUT-52 |
FIXED
|
Standardize on java.util.logging, Commons Logging, log4j?
|
|
|
| |
|
MAHOUT-55 |
FIXED
|
Update to Hadoop 0.16.4
|
|
|
| |
|
MAHOUT-102 |
FIXED
|
Use Watchmaker 0.5.0 instead of 0.4.3
|
|
|
| |
|
MAHOUT-95 |
FIXED
|
UserSimilarity-based NearestNNeighborhood
|
|
|
| |
|
MAHOUT-56 |
FIXED
|
Watchmaker Integration
|
|
|
| |
|
MAHOUT-91 |
FIXED
|
Wikipedia Example has incorrect input Key
|
|
|
| |
|
MAHOUT-62 |
FIXED
|
generate html test results
|
|
|
| |
|
MAHOUT-17 |
FIXED
|
Maven support
|
|
|
| |
|
MAHOUT-44 |
FIXED
|
Override zSum and dot for SparseVector
|
|
|
| |
|
MAHOUT-22 |
FIXED
|
Several matrix exceptions are checked exceptions, but should be unchecked
|
|
|
| |
|
MAHOUT-13 |
FIXED
|
Investigate Mahout jar loading
|
|
|
| |
|
MAHOUT-57 |
FIXED
|
Mahout Project Logo
|
|
|
| |
|
MAHOUT-1 |
FIXED
|
Mahout site doesn't link to mailing list archives.
|
|
|
| |
|
MAHOUT-12 |
FIXED
|
Point formatting and parsing improved (StringBuilder, no need for trailing comma).
|
|
|
| |
|
MAHOUT-111 |
FIXED
|
Redirect Test output to file
|
|
|
| |
|
MAHOUT-48 |
FIXED
|
isConverged() and converge flag OK?
|
|
|
| |
|
MAHOUT-51 |
FIXED
|
Upgrade to Hadoop 0.16.3
|
|
|
| |
|
MAHOUT-87 |
FIXED
|
Upgrade to Hadoop 0.18.1
|
|
|
| Progress: |
|
|
61 of 61 issues have been resolved
|
|
|
|
|
| |
|
MAHOUT-159 |
FIXED
|
SparseVector and DenseVector hashCode does not conform to the Java standard
|
|
|
| |
|
MAHOUT-35 |
FIXED
|
Benchmark performance of Vector.iterator() when reusing Element instances.
|
|
|
| |
|
MAHOUT-161 |
FIXED
|
Add Vector.norm to compute k-norms of vectors
|
|
|
| |
|
MAHOUT-162 |
FIXED
|
Added support for mapping String to long IDs in CF code
|
|
|
| |
|
MAHOUT-136 |
FIXED
|
Change Canopy MR Implementation to use Vector Writable
|
|
|
| |
|
MAHOUT-186 |
FIXED
|
Classifier PriorityQueue returns erroneous results
|
|
|
| |
|
MAHOUT-188 |
FIXED
|
Cleanup of Bayes/CBayes for 0.2
|
|
|
| |
|
MAHOUT-198 |
FIXED
|
Cleanup pom, remove lib dependencies, etc.
|
|
|
| |
|
MAHOUT-160 |
FIXED
|
ClusterDumper utility to output all the clusters in all sequence files and points
|
|
|
| |
|
MAHOUT-148 |
FIXED
|
Convert Classification Algs to use richer Writable syntax
|
|
|
| |
|
MAHOUT-137 |
FIXED
|
Convert Clustering Algs to use Vector Writable
|
|
|
| |
|
MAHOUT-181 |
FIXED
|
DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)
|
|
|
| |
|
MAHOUT-170 |
FIXED
|
Enable Java compile optimize flag during build
|
|
|
| |
|
MAHOUT-157 |
FIXED
|
Frequent Pattern Mining using Parallel FP-Growth
|
|
|
| |
|
MAHOUT-123 |
FIXED
|
Implement Latent Dirichlet Allocation
|
|
|
| |
|
MAHOUT-115 |
FIXED
|
Interpolated Knn and SVD Recommender
|
|
|
| |
|
MAHOUT-7 |
FIXED
|
Lucene indexes should act as matrix factories
|
|
|
| |
|
MAHOUT-118 |
FIXED
|
Mahout needs to respect the file system type when getting a FileSystem for an input or output path
|
|
|
| |
|
MAHOUT-139 |
FIXED
|
Make use of Vector Iterator capabilities where appropriate
|
|
|
| |
|
MAHOUT-171 |
FIXED
|
Move deployment to repository.apache.org
|
|
|
| |
|
MAHOUT-124 |
FIXED
|
Online Classification using HBase
|
|
|
| |
|
MAHOUT-126 |
FIXED
|
Prepare document vectors from the text
|
|
|
| |
|
MAHOUT-122 |
FIXED
|
Random Forests Reference Implementation
|
|
|
| |
|
MAHOUT-187 |
FIXED
|
RandomUtils>>isNotPrime throws IllegalArgumentException when argument is less than 2.
|
|
|
| |
|
MAHOUT-154 |
FIXED
|
Reduce memory usage with smarter data structures
|
|
|
| |
|
MAHOUT-158 |
FIXED
|
Replace all ID values with long
|
|
|
| |
|
MAHOUT-121 |
FIXED
|
Speed up distance calculations for sparse vectors
|
|
|
| |
|
MAHOUT-149 |
FIXED
|
The Great User/Item Removal Phase 1: de-generify implementations
|
|
|
| |
|
MAHOUT-150 |
FIXED
|
The Great User/Item Removal Phase 2
|
|
|
| |
|
MAHOUT-151 |
FIXED
|
The Great User/Item Removal Phase 3
|
|
|
| |
|
MAHOUT-172 |
FIXED
|
When running on a Hadoop cluster LDA fails with Caused by: java.io.IOException: Cannot open filename /user/*/output/state-*/_logs
|
|
|
| |
|
MAHOUT-183 |
FIXED
|
WikipediaXmlSplitter spits one chunk per line
|
|
|
| |
|
MAHOUT-134 |
FIXED
|
[PATCH] Cluster decode error handling
|
|
|
| |
|
MAHOUT-133 |
FIXED
|
[PATCH] Kmeans Clustering Example Tidy Up
|
|
|
| |
|
MAHOUT-132 |
FIXED
|
[PATCH] Push magic names into public constants
|
|
|
| |
|
MAHOUT-108 |
WON'T FIX
|
Implementation of Assoication Rules learning by Apriori algorithm
|
|
|
| |
|
MAHOUT-164 |
FIXED
|
"Potpourri": a collection of small possible bugs and improvements
|
|
|
| |
|
MAHOUT-135 |
FIXED
|
Allow FileDataModel to transpose users and items
|
|
|
| |
|
MAHOUT-113 |
FIXED
|
CDInfosToolTest.testGatherInfos failure in Mahout examples
|
|
|
| |
|
MAHOUT-184 |
FIXED
|
Code tweaks for .df.* code
|
|
|
| |
|
MAHOUT-138 |
FIXED
|
Convert main() methods to use Commons CLI for argument processing
|
|
|
| |
|
MAHOUT-177 |
FIXED
|
Fix for "java.lang.ClassNotFoundException Exception"
|
|
|
| |
|
MAHOUT-140 |
FIXED
|
In-memory mapreduce Random Forests
|
|
|
| |
|
MAHOUT-146 |
FIXED
|
Make Wikipedia Example Classifier more generic
|
|
|
| |
|
MAHOUT-199 |
FIXED
|
Parent POM missing in public maven repository
|
|
|
| |
|
MAHOUT-145 |
FIXED
|
PartialData mapreduce Random Forests
|
|
|
| |
|
MAHOUT-166 |
FIXED
|
Potpourri 2
|
|
|
| |
|
MAHOUT-178 |
FIXED
|
Rationalize 'utils' and 'common' stuff
|
|
|
| |
|
MAHOUT-114 |
FIXED
|
Release Process Needs to sign published dependencies such as Hadoop, etc.
|
|
|
| |
|
MAHOUT-176 |
FIXED
|
Remove VectorIterable in favor of just using Iterable<Vector>
|
|
|
| |
|
MAHOUT-127 |
FIXED
|
Remove warnings
|
|
|
| |
|
MAHOUT-179 |
FIXED
|
Taste Demo Help
|
|
|
| |
|
MAHOUT-174 |
FIXED
|
Unify Pair implementation, Random number generation
|
|
|
| |
|
MAHOUT-200 |
FIXED
|
Update information on Mahout site
|
|
|
| |
|
MAHOUT-142 |
FIXED
|
Upgrade to Hadoop 0.20.0
|
|
|
| |
|
MAHOUT-175 |
FIXED
|
Use IOUtils, FileLineIterable/Iterator across the project
|
|
|
| |
|
MAHOUT-131 |
FIXED
|
Vector improvements
|
|
|
| |
|
MAHOUT-147 |
FIXED
|
Wikipedia Example improvements
|
|
|
| |
|
MAHOUT-107 |
FIXED
|
make MySQLJDBCDataModel not final
|
|
|
| |
|
MAHOUT-77 |
WON'T FIX
|
DistanceMeasure calculation slow for SparseVector
|
|
|
| |
|
MAHOUT-93 |
DUPLICATE
|
Refactor Bayes and CBayes to share more common code
|
|
|
| Progress: |
|
|
|
8 of 21 issues have been resolved
|
|
|
|
|
| |
|
MAHOUT-185 |
UNRESOLVED
|
Add mahout shell script for easy launching of various algorithms
|
|
|
| |
|
MAHOUT-204 |
UNRESOLVED
|
Better integration of Mahout matrix capabilities with Colt Matrix additions
|
|
|
| |
|
MAHOUT-163 |
UNRESOLVED
|
Get (better) cluster labels using Log Likelihood Ratio
|
|
|
| |
|
MAHOUT-206 |
UNRESOLVED
|
Separate and clearly label different SparseVector implementations
|
|
|
| |
|
MAHOUT-193 |
UNRESOLVED
|
SparseVectors created with the wrong cardinality in SparseMatrix
|
|
|
| |
|
MAHOUT-11 |
UNRESOLVED
|
Static fields used throughout clustering code (Canopy, K-Means).
|
|
|
| |
|
MAHOUT-165 |
UNRESOLVED
|
Using better primitives hash for sparse vector for performance gains
|
|
|
| |
|
MAHOUT-208 |
UNRESOLVED
|
Vector.getLengthSquared() is dangerously optimized
|
|
|
| |
|
MAHOUT-207 |
UNRESOLVED
|
AbstractVector.hashCode() should not care about the order of iteration over elements
|
|
|
| |
|
MAHOUT-155 |
UNRESOLVED
|
ARFF VectorIterable
|
|
|
| |
|
MAHOUT-209 |
UNRESOLVED
|
Add aggregate() methods for Vector
|
|
|
| |
|
MAHOUT-205 |
UNRESOLVED
|
Pull Writable (and anything else hadoop dependent) out of the matrix module
|
|
|
| |
|
MAHOUT-106 |
UNRESOLVED
|
PLSI/EM in pig based on hofmann's ACM 04 paper.
|
|
|
| |
|
MAHOUT-192 |
FIXED
|
RMSRecommenderEvaluator does not catch NoSuchItemException
|
|
|
| |
|
MAHOUT-201 |
DUPLICATE
|
OrderedIntDoubleMapping / SparseVector is unnecessarily slow
|
|
|
| |
|
MAHOUT-190 |
FIXED
|
Make all instance fields private
|
|
|
| |
|
MAHOUT-182 |
FIXED
|
New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols()
|
|
|
| |
|
MAHOUT-125 |
FIXED
|
Remove Deprecated Ant builds
|
|
|
| |
|
MAHOUT-189 |
FIXED
|
Standardize use of assert keyword
|
|
|
| |
|
MAHOUT-196 |
FIXED
|
bounded values for RecommenderEvaluator
|
|
|
| |
|
MAHOUT-195 |
FIXED
|
doubt about SlopeOneRecommender
|
|
|