Applications in Sales & Business

Why sales is the ecosystem most exposed to Goodhart

Every function in a company is measured, but sales combines three traits that make it the #1 risk zone for Goodhart:

Fine measurability: every call, every email, every deal is tracked to the second in the CRM.
Indexed bonuses: 30 to 60% of compensation depends on KPIs.
Information asymmetry: the SDR knows what was said on the call; the manager only sees the counter.

Result: as soon as a KPI becomes a bonus, optimization starts within 30 days. That is why an experienced rep reads your new commission plan the way a developer reads the spec of a security flaw — looking for the shortest path.

Case #1: the "number of calls" KPI

Typical context: sales leadership who believes "more calls = more pipe". Sets 80 calls/day per SDR as a KPI.

What happens in 4 weeks:

Week	Observed behavior	Goodhart variant
W1	SDRs call their priority accounts	None
W2	They start hanging up after 5 sec on dubious accounts	Adversarial
W3	They dial then hang up immediately ("calls 0:01")	Adversarial
W4	They call their own personal number to finish the quota	Adversarial

How to diagnose:

Call duration distribution — abnormal spike on 0–10 seconds
Call/booked-meeting correlation collapsing (from 0.7 to 0.1)
Unique-calls/total ratio dropping (same numbers redialed)

Antidote:

Main KPI = qualified meetings booked; secondary KPI = calls (capped at 100/day to discourage pure gaming)
Quality audit: 5 calls listened to per SDR/week, blind-scored by a coach

Case #2: the "MRR" KPI

Typical context: B2B SaaS that sets signed MRR as the AEs' main KPI.

What happens in 6 months:

graph LR
    A[KPI: signed MRR] --> B[AEs push annual contracts]
    B --> C[Aggressive discounts T-1 day from month-end]
    C --> D[Customers not really qualified]
    D --> E[Massive churn at renewal]
    E --> F[NRR collapses 6-12 months later]
    style F fill:#ffcdd2

The temporal trap: the MRR KPI is immediate, churn is delayed. The AE who games the system gets promoted before the result lands.

Antidote:

Co-main KPI: MRR at 6 months post-signature (excludes fast churners)
Clawback: 50% of commission if churn within 90 days
Qualitative tracking: 3 deals/AE/quarter reviewed by RevOps with a quality score

Case #3: the "NPS" KPI

Typical context: CS leadership that turns NPS into a monthly bonus.

What happens:

CSMs select customers surveyed (hostile customers "not available")
They brief customers: "if all is well, give 9 or 10"
They trigger the survey right after a success, never after an incident
NPS goes from 32 to 71 in 3 quarters — without any product improvement

Diagnosis:

Abnormally high response rate (>40%)
Bimodal distribution flattening
Decorrelation NPS / actual churn

Antidote:

Automated random sampling on 100% of base (no cherry-picking possible)
Measure delayed NPS (90 days after contact, not immediately)
Cross-reference with an external qualitative score (quarterly call review)

Case #4: "MQLs"

Typical context: marketing paid on the number of MQLs delivered to sales.

What happens:

The MQL definition silently widens
Scoring thresholds drop
Most leads are actually junk
Sales waste 40% of their time on unqualified MQLs
The marketing/sales relationship deteriorates silently

Antidote:

Marketing KPI = SQL accepted + not rejected within 7 days, not MQL
MQL definition frozen by a tripartite committee (marketing, sales, RevOps)
Quarterly audit: 50 MQLs blind-reviewed, true-positive rate tracked

Case #5: NRR (Net Revenue Retention)

Typical context: SaaS steering on NRR as North Star KPI.

What happens — extremal variant:

In normal range, NRR ≈ customer health
At extremes (large accounts), correlation breaks
CSMs concentrate 80% of their time on the top 10 accounts (NRR explodes) while the mid-market churns massively
18 months later, average NRR remains high but concentration risk has become deadly

This is Extremal Goodhart: the metric works at the center of the distribution, not at the extremes — exactly where optimization pushes.

Antidote:

Decompose NRR by segment (logo, mid-market, enterprise)
Cap enterprise NRR in the global calculation (cap at 130% per account)
Track the distribution of NRRs, not just the mean

The inverted KPI: the "anti-Goodhart" metric by construction

A powerful technique (popularized by Andy Grove at Intel, High Output Management, 1983): for each offensive KPI, create a defensive KPI.

Offensive KPI	Defensive KPI (anti-Goodhart)
Number of SDR calls	Meeting-booked rate per call
Signed MRR	NRR at 6 months
NPS	Delayed NPS + response rate
MQLs delivered	Acceptance rate by AEs
Closed tickets (CS)	Reopen rate within 14 days
Jira velocity	Bug rate post-release

The defensive KPI is structurally immune to gaming the offensive KPI — gaming one degrades the other.

The "no bonuses on metrics < 6 months" rule

Empirical conclusion of multiple studies (Larkin, Strategic Management Journal, 2014): any metric with a feedback loop shorter than 6 months is massively gamed. Metrics that resist have long feedback loops (12-month retention, LTV, NRR, reorder rate).

For a VP Sales, this means a hard rule:

80% of bonus on long-loop metrics, 20% on quick pipeline metrics.

This inverts most teams' practice (90% short-term).

Diagnostic: is your organization infected by Goodhart?

Diagnostic test in 7 questions. Count your "yes":

Do you have a single North Star KPI communicated each month?
Does more than 50% of bonus depend on fewer than 3 metrics?
Have the metrics been stable for the past 18 months?
Do seniors explain in meetings how they hit their numbers?
Has an "input/output" ratio of a team changed abruptly after a KPI change?
Are metrics self-reported (CRM filled by the person being evaluated)?
Is there at least one monthly qualitative audit on the reported numbers?

5+ "yes" score: your organization is probably already infected at level 2-3.

Synthesis for a sales leader

Three actions this week:

List your 3 main KPIs and identify the most likely Goodhart variant for each
Create a defensive KPI for each (the anti-gaming mirror)
Schedule a monthly qualitative audit: 5 calls/CSM, 5 deals/AE, 50 MQLs blind-reviewed

The next chapter addresses the AI version of the problem — reward hacking in modern LLMs.