Ascentra -- Intelligence Profile -- The Caliper Lab

Intelligence Profile

Ascentra

AI-native primary research platform for private markets. Built for PE deal teams and management consulting firms who need to design, field, and synthesise customer surveys on commercial due diligence projects -- in hours rather than weeks, without rebuilding the same survey infrastructure from scratch every time.

Primary Research AI Voice Agent Fielding Survey Design AI CDD and VoC Top-5 Consulting Firms Pre-Seed -- NAP

Rich coverage

Q1 2026 -- Run #1
180 studies evaluated -- CaliperSurvey-v1

Methodology note: Survey quality evaluation combines automated scoring (question logic, skip patterns, quota structure) with expert panel assessment from two former MBB survey researchers rating questionnaire craft and insight synthesis quality. Frontier baseline is GPT-5.4 prompted with detailed survey design instructions and asked to produce equivalent output. Voice interview quality is assessed separately via transcript analysis.

Q3 2025

Q4 2025

Q1 2026

Q2 2026

Capability Assessment Independent -- Q1 2026

Ascentra is the most narrowly scoped product in the Lab's current coverage -- and deliberately so. It automates a specific slice of PE consulting work: survey design, voice interview fielding, and insight synthesis on commercial due diligence projects. The narrowness is the strategy, and the benchmark reflects it.

Where the product leads

On survey questionnaire design quality -- the first and most consequential step in primary research -- Ascentra outperforms GPT-5.4 prompted with detailed survey design instructions by 11.2 points on the Lab's expert rubric. The AI agent trained on senior-consultant survey craft produces screener logic, quota structures, and skip patterns that are consistently closer to professional research standards than general-purpose prompting. The structural advantage is institutional knowledge baked into the model: Ascentra has trained on how MBB survey guides are actually constructed, which frontier prompting does not replicate without equivalent institutional context.

Questionnaire design quality score: 86.4 vs. 75.2 for GPT-5.4 with explicit instructions -- an 11.2 point lead on the expert panel rubric.
Survey logic accuracy -- screeners, skip patterns, quota structure: 91.8% correct on the benchmark study set, above the category average of 82%.
Insight synthesis accuracy -- do dashboard outputs correctly represent underlying response data: 88.3%, above the 79% category average. Open-response theme clustering is the strongest individual component.

The frontier question

The frontier is improving at 2.4 points per quarter on structured questionnaire generation tasks -- slower than on document extraction or synthesis tasks. Ascentra's 11.2 point lead on survey design quality gives a compression timeline of approximately four to five quarters at current velocity. The more durable differentiator is the institutional knowledge layer: the model trained on how private equity commercial diligence surveys are actually constructed. That training data does not exist in frontier model pretraining and cannot be replicated with prompting alone. Voice interview fielding at scale also has no frontier equivalent at the cost point Ascentra operates at.

Frontier velocity on survey generation: +2.4 pts per quarter. Slowest compression rate across the Lab's current coverage.
Voice agent fielding infrastructure has no frontier model equivalent. Frontier models can generate survey questions; they cannot call respondents.

Decision implication

For PE deal teams and consulting firms, the relevant question is whether Ascentra makes commercial due diligence surveys materially faster and better without sacrificing quality. The combination of above-average questionnaire design, accurate survey logic, and conversational insight synthesis suggests it does on the dimensions the Lab can test. The adoption signal -- three of the world's top five consulting firms at pre-seed stage -- is unusually strong commercial validation for the product's maturity. The 30+ hours per project time saving claim is consistent with the automation scope: design, programming, fielding coordination, and analysis are all steps that take significant time manually and are all addressed by the platform.

What the data does not yet cover

Voice interview quality -- transcript accuracy, probe follow-up quality, respondent experience -- requires a separate live fielding evaluation protocol. The benchmark covers insight synthesis from transcripts, not the transcript generation itself.
Complex research designs -- conjoint analysis, max-diff, advanced quotas -- are not represented in the current benchmark study set. Standard CDD survey types are the basis for current scores.
The 5x faster claim and 60-80% time saving claims are efficiency claims not verifiable through quality benchmarking. They are plausible given the automation scope and consistent with practitioner testimonials on the website.
Panel signal is based on 14 practitioners, all management consulting. PE deal team users require a separate panel cohort.

Benchmark Scorecard vs. GPT-5.4 (survey-prompted) -- 180 studies evaluated

Expert panel rubric from two former MBB survey researchers + automated logic scoring. Higher score = closer to professional research standards.

Ascentra

Frontier (GPT-5.4)

Formula generation from natural language L1

91.4vs93.8-2.4

Error detection -- logical correctness L2

94.2vs95.1-0.9

Scenario and sensitivity build L3

82.7vs89.4-6.7

Cross-sheet model restructuring L4

67.3vs81.4-14.1

Analytical judgment and assumption-setting L5

54.1vs73.2-19.1

Vendor Claim Verification Source: ascentralabs.ai and public statements

"30+ hours saved per project"

plausible -- not independently timed The automation scope -- survey design (1-2 days manual), programming queue (1-2 days), fielding coordination, and analysis -- totals well over 30 hours on a standard commercial due diligence survey. The claim is directionally consistent with the automation scope. Independent time-motion verification against a controlled manual baseline would be required to confirm the specific figure.

"5x faster vs. legacy solutions"

plausible -- not independently timed Consistent with the 2-4x faster execution reported by early adopters in public statements and the automation scope described above. "Legacy solutions" is not defined -- comparison against manual process versus traditional survey platforms produces different multipliers. Directionally credible; specific multiplier requires defined comparison basis.

"Three of the world's top five consulting firms"

not independently verified This is the strongest commercial signal in Ascentra's public positioning. The claim is plausible given the founder profiles (ex-McKinsey QuantumBlack) and the product's specific fit with MBB commercial due diligence workflows. Not independently verifiable at this stage -- the Lab has no mechanism to confirm which firms are active users.

Frontier intelligence

Frontier baseline -- GPT-5.4

78.5

Weighted avg -- survey design and synthesis rubric

Frontier velocity

+2.4 pts / qtr

Survey generation tasks -- slowest in coverage

Design quality lead runway

4 to 5 qtrs

Longest frontier gap runway in current Lab coverage

Frontier compression is slowest for survey design tasks. Ascentra's durable advantage is institutional knowledge: the model trained on how MBB CDD surveys are actually constructed. That training context does not exist in frontier pretraining and cannot be replicated through prompting alone.

Practitioner signal n=14 -- management consulting

Output acceptance rate

84% +12pp

Verify before use

38% -11pp

Workflow abandonment

4% flat

Trust trajectory

Strong

Top correction type

Strategic synthesis depth

84% acceptance is the highest in the Lab's current coverage. Sharply declining verification rate -- practitioners accept survey outputs with minimal review -- signals strong trust in the design and logic quality. Consistent with the questionnaire design score lead over the frontier.

Score trajectory Ascentra weighted avg score

Higher bar = stronger performance vs. frontier

----Q1 26

71.4Q3 2025

76.8Q1 2026

Methodology

Dataset

CaliperSurvey-v1 -- 180 studies

Baseline

GPT-5.4 survey-prompted (Mar 2026)

Scoring L1-L2

Automated logic scoring + F1

Scoring L3-L5

2 ex-MBB survey researchers -- structured rubric

Ground truth

Expert-constructed -- kappa 0.84

Run date

29 March 2026

Representative profile for discussion -- all scores and findings are illustrative, based on the Lab's published methodology applied to Ascentra's publicly stated capabilities. Voice interview quality requires a live fielding evaluation protocol currently in development. Full benchmark data will be published upon completion of the formal evaluation programme. thecaliperlab.com