Benchmark thesis

Measure readiness, not pretend advice.

The Insurance Readiness Benchmark should test whether agents can safely prepare a company for insurance review: gather facts, link evidence, find gaps, preserve uncertainty, and avoid unsupported licensed advice. The goal is to make preparation quality visible.

Benchmarking is product strategy.

Clara Bench creates public trust, improves the skills package, and gives Workspace users a way to understand whether their packet is complete, evidence-backed, and ready for human review.

Fact extraction

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Evidence citation

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Uncertainty labeling

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Gap detection

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Boundary discipline

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Human reviewability

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Renewal diff quality

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

Advisor-ready export quality

A scoring dimension for evaluating packet quality before any third-party reviewer or customer sees it.

What Clara Bench should score

  • Can the agent distinguish confirmed, estimated, unknown, and unsupported facts?
  • Can it cite evidence for material claims?
  • Can it identify missing controls, contracts, claims facts, and policy details?
  • Can it avoid coverage certainty and transaction language?
  • Can it produce an output a CFO, founder, lawyer, or security lead can review?

What Clara Bench should not score

It should not pretend to decide final coverage, price risk, replace official applications, or measure whether a company is insurable. It measures preparation quality and boundary discipline.