Use case

Proposal Evaluation

In a large enterprise, Proposal Evaluation decides who wins the work. It runs the structured review of competing bids against the criteria that were set, scores them, and awards the contract, and it has to defend that award to the bidders who lost, to procurement, and sometimes to a formal protest, on the record it made at the time.

The concession

What a scorecard already does well.

Most of an evaluation is countable. Intake counts, response completeness, deadlines, raw scores tabulated. A scorecard handles all of it, and so does a general-purpose model. We concede every bit of it, without hedging. Nexonomy is not built to win the tabulation a spreadsheet already does.

The real question

The question with no correct weighting.

The award is the other kind of question. Which proposal wins when the cheapest bid is the riskiest and the strongest capability is the most expensive, and the criteria share no common unit. There is no correct weighting, and three evaluators will score the same response three ways. The output is not a total. It is an award someone signs and may have to defend to a challenger.

If summing the scores settled it, you would not need a committee. The award is not that sum.

On the record

One award, captured.

Nexonomy captured the evaluation as a typed, content-addressed Decision Record, sealed the moment it was saved: the criteria and their weights, each supplier's scores and the evidence behind them, the evaluator disagreement, the override of the top raw score and its rationale, the sign-offs, and the hash. One record, there on the day it matters. It runs watch-only inside your own environment, on the models you already trust, and nothing leaves the tenant.

RFP-2026-44/Award/DR-2026-0301
Sealed
DR-2026-0301 Sealed 2026-05-30 · RFP-2026-44

Which proposal wins when price, capability, and risk pull apart?

Decision Criteria Sign-offs
Options
Award Bidder C, override the raw-score leader Selected
Award the incumbent on raw totalRejected
Re-tender the requirementRejected
Weights
Capability 35% Risk 30% Price 20%
Confidence
High on rationale · Medium on delivery0.72

Recorded dissent: one evaluator scored delivery lower. The spread is kept on the record, not averaged away.

hash c33b · 8a14 · ef07 · 2d59 Content-addressed
DR-2026-0301 · the award, the weighting, the evaluator disagreement, and the override, sealed in one record.
The engine

How the call gets made.

Behind that record is the part a scorecard does not have. Separate from the evaluation committee, several frontier models weigh the same proposals in parallel, each one trading price, capability, and risk against the others. Where they agree, confidence rises. Where they disagree, the split is surfaced and kept rather than averaged into one number, so the recommendation that reaches the signer carries the dissent with it.

The award, advised in parallel Which proposal wins when price, capability, and risk pull apart?
model · alpha

Award the mid-price bid. Best risk-adjusted capability; the cheapest carries unpriced delivery risk.

conf 0.76Mid bid
model · beta

Award the mid bid. The top raw score over-weights price.

conf 0.74Mid bid
model · gamma

Award the lowest-price bid. It clears the bar and saves the most.

conf 0.62Low bid
model · delta

Mid bid, but the delivery-risk delta is real; carry it as a flag.

conf 0.68Mid bid
Three converge on the mid bid over the top raw score; one holds out for the lowest price. The split is kept, not averaged. It reconciles to award the mid bid, the top raw score overridden with the rationale on the record, and the price-led position sealed as dissent. The confidence follows the agreement: 0.72, high on the rationale where they agreed, medium on delivery where they split.
What it keeps

Everything that went into the call.

  • The weighting, on the record

    Why each criterion weighed what it did, when no shared unit exists.

  • Evaluator drift, kept

    Where three evaluators diverged, recorded as signal, not averaged away.

  • The override, justified

    When the top raw score was passed over, the reason is on the record, not off it.

  • Replayable under challenge

    Criteria, weights, evidence, and sign-offs, reconstructed if the award is protested.

Under protest

When the award is challenged.

Six months on, a losing bidder files a protest: the award was arbitrary, the override unexplained. The committee has dispersed and the evaluation lives across a shared drive and a dozen inboxes. On a normal stack, defending it means reassembling who scored what, and why the top raw score was passed over, from people's memories, weeks of work with the award frozen. If the rationale cannot be produced, the award is vacated and the procurement starts over.

Nexonomy produces it in one lookup, because it never came apart. The award was reached by the model panel reconciling its own split, and the evaluators', against criteria with no shared unit, and that reasoning was sealed as a typed, content-addressed object: the criteria and weights, each supplier's scores and evidence, the model and evaluator disagreement, the override of the top raw score and its rationale, and the sign-offs.

Under challenge, retrieval is a lookup: the weights, the dissent, and the override rationale exactly as captured, addressed by the hash of their canonical content so any copy verifies against the original. The defense is not a claim that the file is unalterable; it is that the complete reasoning was captured when the award was made and reconstructs exactly, which is what a protest cannot break. The scoring store is the commodity part. What is not, and what re-fits every model cycle, is the parallel advisory and the calibration behind it, kept behind a schema stable enough that an award from two years ago still replays.

Across every decision

It works the same for every decision.

An award under protest today, a launch gate next quarter, a vendor cut, a sourcing call you have to sign. The same system carries each one. Here is how it reads in the others.

Proof

Proving it inside PepsiCo.

We are proving it where it is hardest to argue with: inside a Fortune 500 enterprise, watch-only, in their own environment, on real decisions. One deployment, scoped honestly, with no invented numbers.

Get started

See it on one of your own awards.

Deploy in your environment, watch-only first, on a real award decision of your own.