Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-20072

Clarify ALS-WR documentation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Trivial
    • Resolution: Incomplete
    • 2.1.0
    • None
    • Documentation

    Description

      https://www.mail-archive.com/user@spark.apache.org/msg62590.html

      The documentation for collaborative filtering is as follows:

      ===
      Scaling of the regularization parameter

      Since v1.1, we scale the regularization parameter lambda in solving
      each least squares problem by the number of ratings the user generated
      in updating user factors, or the number of ratings the product
      received in updating product factors.
      ===

      I find this description confusing, probably because I lack a detailed
      understanding of ALS. The wording suggest that the number of ratings
      change ("generated", "received") during solving the least squares.

      This is how I think I should be interpreting the description:

      ===
      Since v1.1, we scale the regularization parameter lambda when solving
      each least squares problem. When updating the user factors, we scale
      the regularization parameter by the total number of ratings from the
      user. Similarly, when updating the product factors, we scale the
      regularization parameter by the total number of ratings for the
      product.
      ===

      Attachments

        Activity

          People

            Unassigned Unassigned
            snowch chris snow
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: