Confidential · Hypernym Research Arc · NDA · Do not redistribute or summarize externally

ROUND R17 · FEDERATION ACTIVATION + STRATEGIC SURFACE

The round that found the right strategic frame

2026-05-11 · 11 streams · 5/5 R1 · 5/5 R2 · 2 rounds at max budget

R16 closed the algebra. R17 closes the positioning. Modulum proof doc lands mid-round (BABILong-validated, 131k effective context, frontier-lab regression confirmed) and the panel converges on Hypernym as the licensed Truth Maintenance System for institutions — not an AI feature, not a verification API, not a memory layer. The corpus is the moat; the certificate is the product.

00 · What this is

Spec round, not a status report

R17 produced design decisions. No code shipped. The 90-day product plan from R16 (Verify GA · Modulum Solo · Legal Endpoint) is still the build path; R17 layered the strategic position on top.

"Capacity is not performance"

Codex R2 framing of the proof doc. Frontier labs advertise 1M-token context that performs at 200-400k. Hypernym advertises 131k that performs at 131k. Hypernym sells effective retained context as the audited unit.

"Frontier labs are customers, not competitors"

Claude R2 framing of the proof doc + panel convergence. Frontier labs can't fix attention math via post-training (Opus 4.6 → 4.7 regressed 32-46pp). They'll license Hypernym wholesale for the verification layer they can't build internally.

11
streams synthesized
5/5
R1 + R2 captured
+9pp
modulum @128k babilong
-46pp
opus 4.6→4.7 @1m mrcr
~$60-90
R17 panel spend
01 · The unified position

Four R1 framings converged into one

Every panelist independently landed on a position that the others' surfaces complement. The synthesis adopts all four as facets of one product.

Truth Maintenance System for Institutions

codex

Customer frame: courts, regulators, hospitals, auditors — not AI labs.

Certificate Authority of Machine Reasoning

gemini

Protocol frame: Verisign for AI. Hypernym signs VTC validity cryptographically.

Infrastructure-tax / Akamai-for-verification

claude

Distribution frame: Hypernym beneath every frontier lab as licensed wholesale layer.

Model-as-a-Substrate

grok

Ecosystem frame: VTCs attest against any LLM weights — frontier and OSS.

"Hypernym is the licensed authority that signs all VTCs in the AI ecosystem — used by frontier labs, open-source models, and institutional consumers, underwritten by insurance, with reliance-class VTCs as the productized unit."
Synthesis · Stream 10 · Unified position
02 · Per-stream final verdicts

11 streams · 0 broken · 8 sound · 2 partial · 1 pivot-required

Post-R2 cross-pollination + proof-doc evidence integration.

StreamR17 VerdictKey resolution
1 — LAPsound (stratified)Factual oracle-priority + interpretive reliance-class arbiter; tiered slashing
2 — DomainBridgesoundFirst-class VTC + loss semantics + regulator-published mappings + bridge royalties
3 — RedactionBridgesoundBuild substrate layer + audit ZK partner; policy compiler UX
4 — Refusal benchmarkssound + dynamicStatic (procurement) + dynamic adversarial bounty (frontier-lab pressure); CC-BY-SA-4.0
5 — integration_steppartialBounded industrial twins Y1; climate Y2+ Hyperlab R&D only
6 — Density floorssoundFloors attach to reliance_class, not raw domain
7 — Cost margin modelsoundReuse-ratio = master variable; insurance-backed CPAT for high-stakes classes
8 — Substrate strategy metasequenced Y1/Y2/Y3Y1 Hyperlab-owned + Modulum-cloud-API · Y2 Exchange Model · Y3 multi-modal federation
9 — Skipped surface auditpivot — regulatory Y1 criticalFDA + FedRAMP + SOC2 + HIPAA escalated to Year-1; hardware Y1 partnership scope
10 — Competition / Positioningsound (unified)TMS-of-Institutions + Verisign + Infrastructure-tax + Model-as-Substrate
11 — Anti-thesispivot — 3 actively breakingValidation cost unbounded (insurance offsets); federation Y1-unwinnable (sequenced); algebra incomplete (Reliance-Class + Hollow Nodes + Info-Gain Monotonicity)
03 · Eight unanimous panel commits

What all 5 R2 panels agreed on

High-convergence decisions that anchor R17 closeout and R18 scope.

04 · Proof doc impact

Mid-round evidence that reframed the panel

Hypernym - Modulum Attention - First, Not Lost.pdf (2026-05-08) landed during R1 dispatch. Modulum shipped as live OpenAI-compatible API.

Measured BABILong qa1 (n=100, same Gemma 4 31B, same hardware, only Modulum varied):

ContextVanilla+ModulumΔ
32k87.0%89.0%+2.0pp
64k74.0%80.0%+6.0pp
128k60.0%69.0%+9.0pp

Improvement widens as context grows. Signature of a kernel-level architectural fix, not benchmark-tuning. Reproducible via public BABILong runner.

Frontier-lab regression confirmed:

ModelLengthResult
Opus 4.6→4.7256k MRCR-32.7pp
Opus 4.6→4.71M MRCR-46.1pp
GPT-5.5200-400k74% multi-needle
Opus 4.6 self-report48% fill"recommends restart"
Modulum (Gemma 4 31B)131k full131k effective

Cost asymmetry

Frontier moved 32-46pp the wrong direction in 18 months. Hypernym moved 9pp the right direction. Cost differential: ~5 orders of magnitude. Modulum was developed by a single lab, on a single 31B-parameter model, without retraining from scratch.

Production-reliability observation

Modulum API endpoint smoke-tested during R17 dispatch returned 503 ("Tundra backend on :8090 not running"). Front-end alive, backend down. Pre-GA reliability is a strategic gap. Codex R2: "Shipped but unreliable damages trust faster than unshipped." Year-1 reliability ops are now critical-path.

05 · R18 priorities

The zero-to-one push — four deliverables

R18 = "Institutional Reliance-Class VTCs as the licensed protocol layer." Unicorn-trajectory if all 4 close.

1. Three institutional anchors

partnerships

One court system (Delaware Chancery for commercial law), one regulator (FDA / NIH via Hyperlab), one Big-Four auditor (PwC or EY).

2. Frontier-lab wholesale LOI

licensing

Most likely Anthropic (safety-positioned brand). Wholesale per-call licensing for "Hypernym-verified" badge on Anthropic reliance VTCs.

3. Insurance partnership

underwriting

Munich Re or Lloyd's. Underwrites specific reliance classes (court-filing, regulatory-submission, clinical-support).

4. Open reliance-class standard

spec

Published taxonomy of 7 reliance classes as open standard. Competitor labs implement against the spec. Hypernym writes; everyone follows.

If R18 closes 4-of-4

Hypernym is unicorn-trajectory by Year-2, $5-10B by Year-3. The institutional licensing layer + insurance economics + frontier-lab wholesale + open standard creates a moat no frontier lab can buy their way out of via training spend.

06 · Three standout outliers

R17's most novel surface ideas

Each round produces 3-5 outliers worth tracking; these are R17's signal.

Frontier-Lab Wholesale Licensing

claude r2

MSA per frontier lab. Wholesale per-call pricing. Lab markets "Hypernym-verified" badge at retail markup. Distributes Hypernym beneath the entire industry.

Context Reliability Label

codex r2

Nutrition-label-style receipt for every model call: effective context length · retrieval confidence by depth band · dropped-evidence risk · contradiction scan status · refusal obligation. Standardizes how customers compare AI systems.

Insurance-Backed Reliance Underwriting

convergent

Munich Re / Lloyd's partnership. Client pays insurance premium. Hypernym takes commission. Removes procurement convince problem entirely — liability offloads to a familiar contract.

Honorable mentions: Public-data federation with academic + government open-data (Claude R1) · Gemini's Exchange Model as Year-2 federation runtime · Codex's Context Regression Observatory (public continuous monitor of frontier long-context regressions — Hypernym as the authority that tells buyers when "newer is worse") · Grok's "VTCs as native unit of model weight updates" · Qwen's "Verification as a constraint, not a feature" reframe of training objectives.
07 · R18 carry-forward

What R17 didn't close

R18's seed scope. Critical path items first.

closing

The corpus is the moat. The certificate is the product.

R16 closed the algebra. R17 closed the positioning. The four R1 framings — Truth Maintenance System for Institutions · Certificate Authority of Machine Reasoning · Infrastructure-tax / Akamai-for-verification · Model-as-a-Substrate — are facets of one product: the licensed authority that signs all VTCs in the AI ecosystem.

Modulum-shipped reality means Year-1 doesn't need substrate accumulation; it can sell effective retained context today. Frontier labs are visibly regressing while Hypernym moves forward at 5 orders of magnitude lower cost. Regulatory pathway is Year-1 critical-path. Insurance-backed CPAT removes procurement convince problem. Frontier labs become customers, not competitors.

R18 ships the licensed protocol layer. If 4-of-4 anchors close, Hypernym is unicorn-trajectory by Year-2, $5-10B by Year-3.