Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1802

Capture attached checkpoints (if cached)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.11.1
    • 0.11.2
    • None
    • None

    Description

      Currently, the optimizer generates checkpoints and attaches them to actual logical elements of the DAG via CheckpointAction$cp.

      the way it worsk today is as follows:

      drmC = drmA+ drmB
      
      val cp1 = drmC.checkpoint() // checkpoint
      val cp2 = drmC.checkpoint() // cp2 == cp1
      
      drmD = cp1 + drmE // cp1 + drmE
      

      but, in:

      drmD = drmC + drmE // computes drmA + drmB + drmC all over
      

      drmC already has cp1 attached to it so we should assume the common computational path is the intent here regardless and should be used, instead of building plans that recompute it. That is,

      drmD = drmC + drmE should imply cp1 + drmE as well even if checkpoint is not used explicitly.

      Attachments

        Activity

          People

            Andrew_Palumbo Andrew Palumbo
            Andrew_Palumbo Andrew Palumbo
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: