Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2326

Verbose INFO logging with stateful DoFns and Dataflow

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • 0.6.0
    • 2.0.0
    • runner-dataflow

    Description

      I am seeing a lot of INFO level logging:

       jsonPayload: {
        logger: "com.google.cloud.dataflow.worker.runners.worker.BatchModeUngroupingParDoFn" 
        message: "Processing timers for key {} for stateful DoFn"    
       }
       jsonPayload: {
        message: "Processing key KV{one of my keys} for stateful DoFn"    
        logger: "com.google.cloud.dataflow.worker.runners.worker.BatchModeUngroupingParDoFn"    
      }
      

      out of one of my stateful DoFn. There is one such group of logs for each key I process, which leads to a very large amount of logs and possibly to a significant slowdown.

      Also, not sure if the Processing timers log message is missing some string interpolation or if the empty key is on purpose.

      At any rate, this seems more like something for DEBUG than INFO given the large volume.

      Attachments

        Activity

          People

            kenn Kenneth Knowles
            peay peay
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: