Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-1384

Change default num_components for SVM to max(100, 2*num_features)

    XMLWordPrintableJSON

Details

    Description

      Currently
      http://madlib.apache.org/docs/latest/group__grp__svm.html#kernel_params
      says

      n_components
      Default: 2*num_features. The dimensionality of the transformed feature space. A larger value lowers the variance of the estimate of the kernel but requires more memory and takes longer to train.
      

      but this produces poor decision boundaries for small num_features. I suggest we change the default to

      n_components
      Default: max(100, 2*num_features). The dimensionality of the transformed feature space. A larger value lowers the variance of the estimate of the kernel but requires more memory and takes longer to train.
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            fmcquillan Frank McQuillan
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: