Applications in Sales & Business
Why sales is the ecosystem most exposed to Goodhart
Every function in a company is measured, but sales combines three traits that make it the #1 risk zone for Goodhart:
- Fine measurability: every call, every email, every deal is tracked to the second in the CRM.
- Indexed bonuses: 30 to 60% of compensation depends on KPIs.
- Information asymmetry: the SDR knows what was said on the call; the manager only sees the counter.
Result: as soon as a KPI becomes a bonus, optimization starts within 30 days. That is why an experienced rep reads your new commission plan the way a developer reads the spec of a security flaw — looking for the shortest path.
Case #1: the "number of calls" KPI
Typical context: sales leadership who believes "more calls = more pipe". Sets 80 calls/day per SDR as a KPI.
What happens in 4 weeks:
| Week | Observed behavior | Goodhart variant |
|---|---|---|
| W1 | SDRs call their priority accounts | None |
| W2 | They start hanging up after 5 sec on dubious accounts | Adversarial |
| W3 | They dial then hang up immediately ("calls 0:01") | Adversarial |
| W4 | They call their own personal number to finish the quota | Adversarial |
How to diagnose:
- Call duration distribution — abnormal spike on 0–10 seconds
- Call/booked-meeting correlation collapsing (from 0.7 to 0.1)
- Unique-calls/total ratio dropping (same numbers redialed)
Antidote:
- Main KPI = qualified meetings booked; secondary KPI = calls (capped at 100/day to discourage pure gaming)
- Quality audit: 5 calls listened to per SDR/week, blind-scored by a coach
Case #2: the "MRR" KPI
Typical context: B2B SaaS that sets signed MRR as the AEs' main KPI.
What happens in 6 months:
graph LR
A[KPI: signed MRR] --> B[AEs push annual contracts]
B --> C[Aggressive discounts T-1 day from month-end]
C --> D[Customers not really qualified]
D --> E[Massive churn at renewal]
E --> F[NRR collapses 6-12 months later]
style F fill:#ffcdd2
The temporal trap: the MRR KPI is immediate, churn is delayed. The AE who games the system gets promoted before the result lands.
Antidote:
- Co-main KPI: MRR at 6 months post-signature (excludes fast churners)
- Clawback: 50% of commission if churn within 90 days
- Qualitative tracking: 3 deals/AE/quarter reviewed by RevOps with a quality score
Case #3: the "NPS" KPI
Typical context: CS leadership that turns NPS into a monthly bonus.
What happens:
- CSMs select customers surveyed (hostile customers "not available")
- They brief customers: "if all is well, give 9 or 10"
- They trigger the survey right after a success, never after an incident
- NPS goes from 32 to 71 in 3 quarters — without any product improvement
Diagnosis:
- Abnormally high response rate (>40%)
- Bimodal distribution flattening
- Decorrelation NPS / actual churn
Antidote:
- Automated random sampling on 100% of base (no cherry-picking possible)
- Measure delayed NPS (90 days after contact, not immediately)
- Cross-reference with an external qualitative score (quarterly call review)
Case #4: "MQLs"
Typical context: marketing paid on the number of MQLs delivered to sales.
What happens:
- The MQL definition silently widens
- Scoring thresholds drop
- Most leads are actually junk
- Sales waste 40% of their time on unqualified MQLs
- The marketing/sales relationship deteriorates silently
Antidote:
- Marketing KPI = SQL accepted + not rejected within 7 days, not MQL
- MQL definition frozen by a tripartite committee (marketing, sales, RevOps)
- Quarterly audit: 50 MQLs blind-reviewed, true-positive rate tracked
Case #5: NRR (Net Revenue Retention)
Typical context: SaaS steering on NRR as North Star KPI.
What happens — extremal variant:
- In normal range, NRR ≈ customer health
- At extremes (large accounts), correlation breaks
- CSMs concentrate 80% of their time on the top 10 accounts (NRR explodes) while the mid-market churns massively
- 18 months later, average NRR remains high but concentration risk has become deadly
This is Extremal Goodhart: the metric works at the center of the distribution, not at the extremes — exactly where optimization pushes.
Antidote:
- Decompose NRR by segment (logo, mid-market, enterprise)
- Cap enterprise NRR in the global calculation (cap at 130% per account)
- Track the distribution of NRRs, not just the mean
The inverted KPI: the "anti-Goodhart" metric by construction
A powerful technique (popularized by Andy Grove at Intel, High Output Management, 1983): for each offensive KPI, create a defensive KPI.
| Offensive KPI | Defensive KPI (anti-Goodhart) |
|---|---|
| Number of SDR calls | Meeting-booked rate per call |
| Signed MRR | NRR at 6 months |
| NPS | Delayed NPS + response rate |
| MQLs delivered | Acceptance rate by AEs |
| Closed tickets (CS) | Reopen rate within 14 days |
| Jira velocity | Bug rate post-release |
The defensive KPI is structurally immune to gaming the offensive KPI — gaming one degrades the other.
The "no bonuses on metrics < 6 months" rule
Empirical conclusion of multiple studies (Larkin, Strategic Management Journal, 2014): any metric with a feedback loop shorter than 6 months is massively gamed. Metrics that resist have long feedback loops (12-month retention, LTV, NRR, reorder rate).
For a VP Sales, this means a hard rule:
80% of bonus on long-loop metrics, 20% on quick pipeline metrics.
This inverts most teams' practice (90% short-term).
Diagnostic: is your organization infected by Goodhart?
Diagnostic test in 7 questions. Count your "yes":
- Do you have a single North Star KPI communicated each month?
- Does more than 50% of bonus depend on fewer than 3 metrics?
- Have the metrics been stable for the past 18 months?
- Do seniors explain in meetings how they hit their numbers?
- Has an "input/output" ratio of a team changed abruptly after a KPI change?
- Are metrics self-reported (CRM filled by the person being evaluated)?
- Is there at least one monthly qualitative audit on the reported numbers?
5+ "yes" score: your organization is probably already infected at level 2-3.
Synthesis for a sales leader
Three actions this week:
- List your 3 main KPIs and identify the most likely Goodhart variant for each
- Create a defensive KPI for each (the anti-gaming mirror)
- Schedule a monthly qualitative audit: 5 calls/CSM, 5 deals/AE, 50 MQLs blind-reviewed
The next chapter addresses the AI version of the problem — reward hacking in modern LLMs.