Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22449

Add BIC for GLM

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Later
    • 2.2.0
    • None
    • ML
    • None

    Description

      Currently, we only have AIC for GLM. BIC is another "similar" criterion widely used and implemented in all major statical tools.

      Postive reasons:
      1. Completeness.
      2. Useful for some users.

      Negative reasons:
      1. Not sure how many users would actually use BIC.

      Possible Implementation:
      1. Duplicate AIC's methods. Calculate penalty term independently. Pros: safe & consistent. Cons: duplication.
      2. Let AIC & BIC share the log likelihood by a same method. Calculate penalty term independently.
      Pros: similar to scikit learn. No duplication. Cons: less safe & consistent.

      Reference:
      1. https://stats.stackexchange.com/questions/577/is-there-any-reason-to-prefer-the-aic-or-bic-over-the-other
      2.http://users.stat.umn.edu/~yangx374/papers/Pre-Print_2003-10_Biometrika.pdf

      Thoughts?

      Attachments

        Activity

          People

            Unassigned Unassigned
            Teng Peng Teng Peng
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: