Eval Scores Chart
Eval pass rates per release as a shadcn line chart, including the regression we published.
Eval pass rates per release as a shadcn line chart, including the regression we published.
The Marketing Collection unlocks the source for every Marketing block. All Access unlocks every Collection.
Already purchased? Log in
Eval Scores Chart treats AI quality like a metric instead of an adjective. Two lines track releases, the share of drafts editors accept unchanged rising, the share flagged by fact checking falling, and the footnote owns the one release that regressed. Publishing the bad release is the move, it makes the good releases believable.
The chart is built on the shadcn chart primitives, so the data array at the top of the file is the only thing to replace when wiring your real eval history. Config labels and grayscale chart tokens come from chartConfig.
Reach for this block on AI pages aimed at buyers who have heard every accuracy claim already. It assumes you actually run evals per release, do not fake this one.
A natural flow around it on a Marketing Pro page:
Before
After
One strong use is acceptance rate and factual flag rate per model release. Other evals:
Tip: include the regression release. A monotonically improving chart reads as marketing, a dented one reads as measurement.