Uploaded image for project: 'Samza'
  1. Samza
  2. SAMZA-572

SamzaContainer checkpoints and windows immediately on startup

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.9.0
    • 0.10.0
    • container

    Description

      I noticed that the SamzaContainer checkpoints immediately on startup. This is kind of useless since it has either processed only 0 or 1 messages. The issue is that last window and last commit are both set to 0:

        private var lastWindowMs = 0L
        private var lastCommitMs = 0L
      

      The logic that's triggered after the RunLoop starts is:

            if (commitMs >= 0 && lastCommitMs + commitMs < clock()) {
      

      Of course this is always true, since lastCommitMs + commitMs will always be < clock() when lastCommitMs starts at 0. The same logic is triggered for window.

      It seems like we should set lastCommitMs/lastWindowMs to clock() if they're 0 (or just set them to clock() on instantiation).

      I noticed this by running a kafka-console-consumer against hello-samza. When the job started, I saw:

        {"type":"changelog-partition-mapping"}	{"Partition 0":0}
        {"systemstreampartition-grouper-factory":"org.apache.samza.container.grouper.stream.GroupByPartitionFactory","taskName":"Partition 0","type":"checkpoint"}	{}
        {"systemstreampartition-grouper-factory":"org.apache.samza.container.grouper.stream.GroupByPartitionFactory","taskName":"Partition 0","type":"checkpoint"}	{"SystemStreamPartition [kafka, wikipedia-raw, 0]":{"system":"kafka","partition":"0","offset":"179","stream":"wikipedia-raw"}}
      

      The first empty checkpoint happened immediately on startup, before any messages had been processed. Since the job was being started for the first time, OffsetManager.lastProcessedOffsets was empty, and it immediately checkpointed an empty map. This shouldn't result in data loss, it's just annoying and useless.

      Attachments

        1. SAMZA-572.0.patch
          4 kB
          József Márton Jung
        2. SAMZA-572.1.patch
          4 kB
          József Márton Jung

        Activity

          People

            jjung József Márton Jung
            criccomini Chris Riccomini
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: