Help me design coverage metrics

hewerlin · May 27, 2026, 6:48pm

I’m building a Threat Modeling Tool and I am facing the following problem:

There are a bunch of threat models (). Each has threats (). Threats have their associated mitigations ().

I see all of those threat models in an overview.

I want to answer: How far are they? Then display in a red-yellow-green gradient if threat models are mature / have good progress.

Threat models that suffer from admiration for the problem (little mitigations) shall have a bad rank.

How would you compute that?

=> I’m looking for coverage metrics.

agota.daniel · May 30, 2026, 11:13pm

Check out The Metrics Manifesto by Richard Seiersen — tough but worth it. It introduces BOOM (Burndown, Arrival, Wait, Escape):

• Burndown — time‑to‑remediate. Track the distribution (median, percentiles, survival curve). Example: median patch time 72h → 24h after a process change. Time = risk.
• Arrival — rate/timestamps of new risks (vulns, alerts, new assets). Example: arrivals spike after a big feature/asset onboarding. Why: rising inflow can overwhelm capacity.
• Wait — time from arrival to work start (triage latency). Example: average wait 4h → 16h on Mondays. Why: long waits increase exposure and backlog.
• Escape — fraction of risky events that become incidents. Example: 0.5% of phishing attempts lead to compromise. Why: the ultimate KPI for control effectiveness.

You need timestamped events to build a life‑table (survival analysis) and a versioned history of your threat models to track trends. I thought about adding BOOM support to Risquanter (GitHub - risquanter/register · GitHub) — it tracks model versions, but implementing full BOOM metrics would still be non‑trivial.

Johan_Sydseter · June 3, 2026, 6:20am

It may help to have a connection to a issue tracking software where the various dev teams have their backlog. You would get quite interesting stats from locking at the relationship between threat, backlog issues, status and time. To connect a model with a issue you could use labels, ussing the issue id in the threat model will help too.

hewerlin · June 6, 2026, 3:14pm

Thank you for the inspiration, @agota.daniel and @Johan_Sydseter . That sounds like opportunities for interesting life cycle metrics.

The Arrival metric is closest to what I am looking for.

Let’s say I have 20 scoped threat models, some of them have some threats, some of them mitigations. How would I know how “far” / “complete” they are?

Back to Arrival… when nobody has spotted the threat yet, how would I have it’s arrival? I need to somehow incorporate the known/unknown unknowns also…

hewerlin · June 6, 2026, 3:28pm

I’ve been thinking about the following:

For each threat model, estimate it’s final size (Small, Medium, Large) and set a parameter M (10, 20, 40).

For N threats,
set ThreatCoverage(N, M) = N / (N+M).

Problem: rewards quantity over quality. How would I know M? Will never be 100%.

For N threats where O have at least one mitigation,
set MitigationCoverage(N, O) = O / N.

Problem: rewards quantity over quality. Does not reward richness in mitigations or consider mitigation effectiveness.

What are your thoughts?

agota.daniel · June 6, 2026, 4:36pm

What is M supposed to mean here? Number of Mitigations? Or number of Mitigated threats? Something else?

If it is mitigated threats in N /(N+M) and N is the number of threats then in M + N you are counting once all threats and add to it those that have mitigations again.

What does Low, Medium, High refer to? The threat model or the threats? Their impact or likelihood? I am not really sure based on the description…but basically this a good example why I avoid qualitative representation where ever I can: you will have a very hard time figuring out what the other meant

agota.daniel · June 6, 2026, 5:08pm

First of all I would bucket your threats so that you are tracking Low, Medium, High categories separately.

each threat has a „discovery date". in most cases this is what you realistically know, unless you are able to tie it to a commit of version or a specific update which introduced it … then you would have a real „date of the exposure". Each threat should have also its own ID. Then there is a date when it got mitigated. Depending on how much you want to complicate it this can be the date when your team committed a patch or when that patch got deployed… I recommend the latter.

At this point you have for each threat an ID, when it was introduced (discovered) and fixed.

ideally you have an SLA of fixing thing say for category M 30 days.

You iterate over your data and check at each day how many are open and how many got mitigated within SLA. The proportion is what is if interest to you I thing. It will be a running number.

The data structure you need to track this is a „life table" and if you dump this on chatgp I think it will be able to make a PoC code for calculating it.

The book I linked has working (but a bit buggy) code in R. I would use an agent to analyse what it’s doing and translate it to your favourite language. You can clean up the result as a learning exercise

hewerlin · June 7, 2026, 6:55am

I think metrics should be between 0 (worst) and 1 (best). The M in ThreatCoverage(N,M) = N/(N+M) is just a parameter that helps turn ever-growing threat count into something that is in [0, 1[.

For M=1, this would be 0, 1/2, 2/3, 3/4, 4/5, 5/6, 6/7, …

M is the value of N that will result in 50%.

See also plot n/(n+10) from 0 to 50 - Wolfram|Alpha

Example: Two threat models.
One with small scope => “There should be some threats”. There are 5. => ThreatCoverage(5, 5=“small”) = 5/(5+5) = 50%
One with large scope => “There should be a lot of threats”. There are 5. => ThreatCoverage(5, 20=“large”) = 5 / (5+20) = 20%

hewerlin · June 7, 2026, 7:00am

Yes, I think that metric is a good idea, too.

SLARespecting(threat) = min{1, SLA(severity(threat)) / [TimeFixed(threat) - TimeDiscovered(threat)] }

With TimeFixed(threat) unset, treat as a seperate group and assume TimeFixed(threat) = today.

agota.daniel · June 10, 2026, 7:30pm

Ah cool, now I understand it. So you basically want a metric that tells you “for this big TM X threats were found. How plausible is that?”

I think that’s genuinely useful. I would think about the following though:

why not simply use a property of the TM instead of fixing a specific M value for a category like for “small” M should be 10? You could use number of components instead for setting M. I am not sure it is easy to say what a “component” is, but when confronted with the question is this a small or medium Modell you would probably fall back to count something as an orientation anyway
I think this metric is inherently not linear: an increase from 0.5 → 0.6 means something different than 0.8 → 0.9. Put it differently: How big of an increase in the number of threats you need for a bump of 0.1 depends on both your M and your baseline of threats for the increase.
I am not sure I would make this user facing: one tends to get results for what gets measured - if you measure the number of threats people will come up with more threats…

hewerlin · June 10, 2026, 9:20pm

Re: M ~ Components

Agree!

Re: Nonlinear

I think the non-linear aspect is a good thing. We can interpret N / (N+M) as: “How much gain would M more create, compared to the N we have now?”. 3 or 10 is a totally different story than 100 or 107 - although both are just +7.

The alternative [0, 1] metric would be Coverage2(N, M) = min{1, N/M}. That would not reward any growth beyond N > M.

Re: Visible?

I need something so that users can quickly see how far the TMs are and where is the work todo. How else would you solve that without a coverage/progress metric?

hewerlin · June 23, 2026, 5:39pm

New approach:

I’m trying to answer…

How many questions were answered?

How many questions are still unanswered?

Estimate: How many questions are likely to appear as follow up questions? (e. g. when I answer What can go wrong that will spawn follow up questions for each threat)

Question is something that a user will have to answer.

That should give some good percentages…

Topic		Replies	Views
Share Your Metrics for Success in Threat Modeling General	11	312	January 24, 2025
How do you define the success criteria for threat modeling? General management	6	60	August 20, 2024
Success and Metrics for Threat Modeling - Meetup Recap Events & Highlights metrics	0	177	January 23, 2025
Nominate a question for the 'State of Threat Modeling' survey General	15	391	December 12, 2024
Meet ThreatPad! Techniques & Tooling	27	749	January 26, 2026

Help me design coverage metrics

Re: M ~ Components

Re: Nonlinear

Re: Visible?

Related topics