Validation

We publish our accuracy.
Ask our competitors to do the same.

Every cluster Prism serves is audited against a real, public, dated ground-truth dataset. Accuracy is mean-absolute percentage agreement between Prism's calibrated predictions and the referenced dataset. Clusters whose audit falls below 80% are paused automatically.

87% median accuracy across 15 live audits · 0 SaaS-specific completed, 5 SaaS audits in progress (target Q2 2026), 15 cross-domain clusters live.

Median accuracy
87%
Mean accuracy
87%
Highest audit
93%
Lowest still-live
78%
Time-series accuracy

The same cluster, audited again, against fresh ground truth.

If our living agents are working, accuracy should hold steady or improve as the world shifts. If they were static, accuracy would decay between calibration cycles. Three illustrative clusters, each with its full audit history.

Eco-anxious parents

+3pt
Western Europe
  1. 2025-06-1588%
  2. 2025-09-1289%
  3. 2025-12-0890%
  4. 2026-03-1291%
Holding above baseline

Gen Z urban Americans

+1pt
USA
  1. 2025-05-3086%
  2. 2025-08-2287%
  3. 2025-11-1887%
  4. 2026-02-2787%
Holding above baseline

US wine consumers

-8pt
USA
  1. 2025-05-0486%
  2. 2025-08-1284%
  3. 2025-11-0981%
  4. 2026-02-0178%
Decayed, cluster paused
SaaS-audience calibration

Calibration on SaaS audiences

These are the audits that matter for SaaS founders — predicted reactions vs. observed real-world data on the audiences Prism's customers actually sell to.

Audit transparency: every SaaS-cluster prediction is published before the real-world outcome lands. Compare our predictions against actual conversion data from the SaaS pages we audit. Latest predict-then-publish loop: /predictions.
Cluster
Accuracy
n
Ground truth
Last audit
Status
B2B SaaS pricing-page conversion intentSaaS
Global SaaS · saas-pricing-conversion
Stripe Atlas conversion benchmarks (target 2024–2026 slice)
Audit in progress · target Q2 2026.
Q2 2026
in progress
Indie hacker pre-launch sentimentSaaS
Global indie · saas-indie-hacker-sentiment
r/SaaS founder polls + Indie Hackers posts (target n=2,400)
Audit in progress · target Q2 2026.
Q2 2026
in progress
Dev-tool buyer reactionsSaaS
Global · weighted to NA/EU · saas-dev-tool-buyers
G2 Crowd verified reviews 2025 (target slice)
Audit in progress · target Q2 2026.
Q2 2026
in progress
Growth-led SaaS hero copySaaS
Global growth-led · saas-growth-led-hero
Wynter public message tests 2024–2025
Audit in progress · target Q2 2026.
Q2 2026
in progress
API-first buyer pricing reactionsSaaS
Global API-first · saas-api-first-pricing
Stack Overflow 2024 Developer Survey (paid-tools slice)
Audit in progress · target Q2 2026.
Q2 2026
in progress

Each row links to the methodology, the ground-truth dataset, the sample size, and the audit date. Raw audit CSVs available on request to any customer, journalist, or academic at audits@prism.so.

Cross-domain calibration

Cross-domain calibration

Prism's calibration engine is validated against ground truth across demographics beyond SaaS. These are not your audience — they're proof the calibration approach generalizes.

Cluster
Accuracy
n
Ground truth
Last audit
Status
UK private-equity professionalsgeneral
UK · pe-uk
93%
310
BVCA 2024 member survey
Small n, intervals published in audit CSV.
2026-03-01
healthy
Dutch informal caregiversgeneral
Netherlands · caregivers-nl
92%
540
SCP Mantelzorg 2024
2026-03-14
healthy
Eco-anxious parentsgeneral
Western Europe · eu-eco-parents
91%
1,204
Eurobarometer 2024 climate attitudes, Bulletin 102
2026-03-12
healthy
German parents of under-12sgeneral
Germany · parents-germany
90%
1,120
SOEP v39 (2024)
2026-03-22
healthy
US Democratic primary votersgeneral
USA · political-primary-us
89%
2,100
Cooperative Election Study 2024
2026-02-11
healthy
UK tech professionalsgeneral
UK · tech-workers-uk
88%
760
Stack Overflow Dev Survey 2024 (UK slice)
2026-02-05
healthy
Gen Z urban Americansgeneral
USA · genz-urban-us
87%
842
Pew Research 2024 youth panel, May wave
2026-02-27
healthy
French early-stage foundersgeneral
France · founders-france
87%
230
France Digitale 2024 founder census
2026-02-28
healthy
French retirees 65+general
France · retirees-fr
86%
980
INSEE 2023 household panel
2026-01-18
healthy
London daily commutersgeneral
UK · commuters-london
85%
890
TfL passenger panel, 2024 H2
2026-02-19
healthy
Crisis PR stakeholders (cross-country)general
Global · crisis-stakeholders-global
85%
3,100
Edelman Trust Barometer 2024
2026-03-07
healthy
US DTC beauty buyersgeneral
USA · dtc-beauty-us
84%
1,530
NIQ 2024 retail panel
Drift detected, recalibration queued for 2026-04-30.
2026-03-05
drift
US B2B SaaS buyers (mid-market)general
USA · b2b-saas-buyers-us
82%
640
G2 quarterly buyer panel, Q4 2024
2026-01-29
drift
Italian recreational runnersgeneral
Italy · runners-italy
81%
410
Strava Year in Sport 2024 (IT cohort)
Source dataset narrow, expanding the audit panel.
2026-01-10
drift
US wine consumersgeneral
USA · wine-drinkers-us
78%
1,340
Wine Market Council 2024
Below 80% threshold. Paused 2026-02-02. Re-seeding in progress.
2026-02-01
paused
Healthy

Cluster audit ≥ 85% accuracy against its ground-truth dataset.

Drift

Cluster audit between 80% and 85%, usable, flagged, being re-calibrated.

Paused

Cluster audit below 80%, automatically suspended, customers notified.

Methodology

Each cluster is audited against a named, dated, public dataset whose sample frame overlaps the cluster definition. The audit re-runs the dataset's core questions as Prism stimuli and compares calibrated Prism output to the dataset's published distribution. Accuracy is mean absolute agreement across the primary metrics of the dataset (typically: sentiment, purchase intent, brand recall, issue importance).

Ground-truth datasets are listed above. Where the dataset is behind a paywall we cite the nearest published bulletin. The raw audit CSVs are available on request to any customer, journalist, or academic under the same access policy we apply to our own team.