Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-960

Hive-Bolt can lose tuples when flushing data

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.0.0
    • storm-hive
    • None

    Description

      In HiveBolt's execute method tuples are ack'd as they are received. When a batchsize of tuples has been received, the writers are flushed. However, if the flush fails only the most recent tuple will be marked as failed. All prior tuples will already have been ack'd. This creates a window for data loss.

      Attachments

        Activity

          People

            dossett Aaron Blake Niskode-Dossett
            dossett@gmail.com Aaron Dossett
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: