Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-4602

Report Lag for consuming from ATLAS_HOOK

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.2.0
    • trunk
    • atlas-core
    • None

    Description

      Currently the 'Stats' webUI function shows some details about the consumption from the ATLAS_HOOK Kafka topic where changes from Hive Metastore arrive.

      By far the most important metric is not available though; the lag the atlas server consumer-group has in consuming Hive updates.

      Monitoring the lag is very important as trust in Atlas is greatly undermined when changes are not reflected in Atlas within seconds. I have had numerous occasions where ATLAS_HOOK consumption was slowing down silently and atlas was behind tens of thousands (or 2 days) worth of messages.

      There should be a new metric for the lag on the stats page to quickly identify a possible reason for slow Atlas updates

      Attachments

        1. image-2022-05-11-17-42-12-250.png
          101 kB
          Jasper Knulst

        Activity

          People

            Unassigned Unassigned
            jasperknulst Jasper Knulst
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: