Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-1332

DL: Support mini-batched validation data for fit/evaluate

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • v1.16
    • Deep Learning
    • None

    Description

      Currently, keras_evaluate() is implemented by calling internal_keras_evaluate() as a UDF.  This requires the validation table passed to keras_fit() to be in a format with only 1 image per row, even though the training table is in a different format, with a batch of images in every row.  This is potentially confusing and cumbersome for users to deal with, and based on some preliminary testing it seems that passing only 1 image at a time to keras_evaluate() is also slowing down performance.

      We can solve this by converting internal_keras_evaluate() into a UDA, so that it runs on a minibatched validation table in the same form as the training table.

       

      Tasks:

      • Convert the internal_keras_evaluate UDF to a UDA and perform weighted averaging of loss and accuracy.
      • Since x and y will now be minibatched, we don't need to add another dimension to x and y np arrays in internal_keras_evaluate.
      • Compare UDF to UDA and verify that the UDA results in a speed improvement

       

      Attachments

        Activity

          People

            jingyimei Jingyi Mei
            dvaldano Domino Valdano
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: