Sebastián Sarmiento
← The Notebook

Translating IRT for people who build products

Item Response Theory is not a reporting feature — it’s a way of deciding what a score is allowed to mean. Here’s the working translation for product teams.

Updated May 14, 2026

Most product teams meet Item Response Theory as a request: “can we add an IRT score?” That framing is already lost. IRT is not a number you bolt onto a report. It is a decision about what a score is allowed to mean — and that decision shapes the item bank, the UI, and the roadmap long before anyone sees a chart.

The one idea that transfers

Strip away the logistic curves and IRT says something simple and radical for a product person: difficulty belongs to the item, ability belongs to the person, and the two are estimated on the same scale. A right answer to a hard item and a right answer to an easy item are not the same evidence, and a percent-correct score pretends they are.

A percentage treats every question as equal. IRT is the refusal to do that.

What it changes upstream

Once difficulty is a property of items, your item bank becomes an instrument that has to be calibrated, not just authored. That has product consequences:

  • New items need exposure before their difficulty is trustworthy.
  • A score is only as defensible as the items underneath it.
  • “Add a question” is no longer a free action — it changes the measurement.

This is usually the unwelcome part. It means the content pipeline and the measurement are the same system, and you cannot move fast on one while ignoring the other.

What it changes downstream

A calibrated scale lets you say honest things you otherwise can’t: this student is here on the ability scale; this item sits there on the difficulty scale; this is the probability they answer it correctly. That is the raw material for knowledge tracing done honestly and for a mastery model that survives an audit.

The translation, in one line

For a product team, IRT is not a feature — it is the agreement that a score means the same thing for every student, and the discipline required to keep that promise. Build for that promise and the charts take care of themselves.