Uploaded image for project: 'James Server'
  1. James Server
  2. JAMES-3107

Log request when P99 is exceeded

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: 3.5.0
    • Component/s: Metrics
    • Labels:
      None

      Description

      Given our current tooling I struggle to correctly review slow requests from James.

      My current procedure is:

      • In grafana identify timestamp of a spike
      • Groke logs in kibana until I find something that could correspond
      • Pray and hope my analisys stands.

      This is both time consumming, hard to do and unreliable.

      Identifying slow queries is important as it can point us to critical path to optimize.

      Hence I propose to log an info message when p99 is exceeded for high level function (JMAP methods, IMAP processors, matcher mailet and overall processing, mailbox listeners, and remote delivery).

      In order to avoid log spamming I propose to only log when a function-specified threshold is exceeded (defaulting to 100ms)

      I belive it will help us coming up with more meaningful performance analysis and better fixes for the greater goods of our prduction platforms.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              btellier Benoit Tellier
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: