Performance Metrics Language
6 exercises — describe latency, throughput, error rates, and resource utilization with precise before/after comparisons, improvement factors, and business translations.
0 / 6 completed
Performance metrics narration pattern
- Structure: "reduced [metric] from [before] to [after] — a [N×] improvement — by [change]"
- Improvement factor: before ÷ after (latency: 1,400ms ÷ 320ms = 4.4×)
- Percentage reduction: (before − after) / before × 100 (e.g., 75% reduction)
- Percentile: always specify P50 / P95 / P99 — never just "latency"
- Throughput units: rps / qps / tpm / mps depending on context
- Business translation: Technical metric → user experience → revenue/cost impact
1 / 6
You want to describe a latency improvement. Which statement is most precise?
Option C — identifies the specific metric (P99), the specific endpoint (search), the before and after values, the improvement factor, and the cause.
Latency improvement narrative structure:
Calculating improvement factor:
1,400ms ÷ 320ms = 4.375× ≈ 4.4× improvement
Latency vocabulary:
• "reduced latency from X to Y" — the core phrase; "reduced" is active, clear
• "a Nx improvement" — improvement factor (divide before by after)
• "cut latency by X%" — percentage reduction: (1 − 320/1400) × 100 = ~77% reduction
• "latency came down from X to Y" — neutral, passive-ish
• "moved from X to Y on P99" — acceptable; less active than "reduced"
Specifying which percentile:
Always say P50, P95, P99 — never just "latency" in a performance review:
• "P50 improved from 80ms to 45ms — median user experience improved"
• "P99 dropped from 1.4s to 320ms — tail latency is no longer a concern"
• "P95 is still at 800ms — we haven't addressed the slow /report queries yet"
Latency improvement narrative structure:
"We reduced [metric] on [endpoint/service] from [before] to [after] — a [factor]× improvement — by [change made]."
Calculating improvement factor:
1,400ms ÷ 320ms = 4.375× ≈ 4.4× improvement
Latency vocabulary:
• "reduced latency from X to Y" — the core phrase; "reduced" is active, clear
• "a Nx improvement" — improvement factor (divide before by after)
• "cut latency by X%" — percentage reduction: (1 − 320/1400) × 100 = ~77% reduction
• "latency came down from X to Y" — neutral, passive-ish
• "moved from X to Y on P99" — acceptable; less active than "reduced"
Specifying which percentile:
Always say P50, P95, P99 — never just "latency" in a performance review:
• "P50 improved from 80ms to 45ms — median user experience improved"
• "P99 dropped from 1.4s to 320ms — tail latency is no longer a concern"
• "P95 is still at 800ms — we haven't addressed the slow /report queries yet"