$0 United States Evaluation Request Letter Template

Age Equivalent, Standard Score, Percentile Rank, and Confidence Interval: What They Mean in an IEP Report

One of the most disorienting aspects of reading a psychoeducational evaluation report is the variety of score types that appear on different pages — sometimes on the same page. A child might receive a standard score of 78, a percentile rank of 7, an age equivalent of 5-6, and a 95% confidence interval of 72–84 all for the same skill. Each number describes the same performance differently, but they are not interchangeable — and some are far more useful than others for advocacy.

Standard Scores: The Most Reliable Metric

The standard score (SS) is the primary metric in any psychoeducational evaluation. Most educational assessments use a standard score scale with a mean (average) of 100 and a standard deviation of 15. This creates a consistent framework for comparing any score to the national normative sample.

The ranges work like this:

Standard Score Classification
130 and above Very Superior
120–129 Superior
110–119 High Average
90–109 Average
80–89 Low Average
70–79 Low
69 and below Very Low

The average range (90–109) represents the middle portion of the bell curve where most same-age peers perform. A standard score of 85 is at the 16th percentile — at the lower edge of the normative range but still within the broad average band. A standard score of 70 is at the 2nd percentile — genuinely below average by any clinical standard.

Standard scores are the most reliable metric for tracking progress over time and for making eligibility determinations. A score of 78 today compared to a score of 82 in a re-evaluation two years from now represents a measurable change against a stable reference point.

Percentile Ranks: Useful Context, Not a Grade

A percentile rank tells you how your child's performance compares to same-age peers in the normative sample. A percentile of 25 means 25% of peers scored at or below your child's score — and 75% scored higher. A percentile of 50 is exactly average.

The most important thing to understand about percentile ranks: they are not linear. The distance between the 50th and 60th percentile is much smaller in absolute terms than the distance between the 5th and 15th percentile. The extremes of the distribution are where the real clinical differences live.

A second important thing: percentile ranks are not grades. This confusion is extremely common. A parent who sees a percentile of 7 thinks their child "got a 7 out of 100" — as if 7 is the score on a test. A percentile of 7 corresponds to a standard score of approximately 79 (Low range). It means 7% of same-age peers performed at or below this level — not that the child answered 7% of questions correctly.

This is specifically relevant for school eligibility decisions. A child scoring at the 7th percentile in reading fluency is significantly below average by any clinical standard. Districts that say "scores are within normal limits" at the 7th percentile are applying a broader definition of "normal" than most parents would recognize as appropriate.

Age Equivalents and Grade Equivalents: Misleading by Design

Age equivalents and grade equivalents are the scores that appear most intuitively understandable — and that are most likely to mislead parents and IEP teams.

An age equivalent of "6-4" on a reading test doesn't mean your 10-year-old reads like a 6-year-4-month-old in any meaningful sense. It means your child achieved the same raw score as the average 6-year-4-month-old who took that particular test. The relationship between raw scores and developmental age is not linear, and the same performance at different points on a test's scale can produce very different age equivalent numbers.

Grade equivalents are even more problematic. A grade equivalent of 3.2 on a math test does not mean the child is ready to do third-grade math. It means they achieved the same raw score as the average third-grader, second month of school, on that specific test. A very bright fourth-grader who scores at the 3.2 grade equivalent has not told you they can do third-grade work — they've told you they scored about average on one specific math measure.

Psychometricians, the Association for Test Publishers, and most major testing companies explicitly recommend against using age equivalents and grade equivalents for eligibility decisions or program placement. These metrics are statistically fragile, mislead parents about severity, and don't allow meaningful comparisons across different tests. Standard scores and percentile ranks are almost always more accurate and actionable.

If a school psychologist's report relies heavily on age equivalents or grade equivalents to support an eligibility decision — whether finding a disability or finding no disability — ask them to restate the conclusion using standard scores and percentile ranks instead. The picture may look different.

Free Download

Get the United States Evaluation Request Letter Template

Everything in this article as a printable checklist — plus action plans and reference guides you can start using today.

Confidence Intervals: Acknowledging Measurement Error

No standardized test produces a perfectly accurate score. A child who is having an off day, who is anxious, or who is fighting a cold will perform somewhat differently than the same child in optimal conditions. The confidence interval accounts for this built-in measurement uncertainty.

A 95% confidence interval of 72–84 around a standard score of 78 means that there is a 95% probability that the child's "true" score — their stable underlying ability level, if perfectly measured — falls somewhere in that range. The single number (78) is the best estimate; the interval is the honest acknowledgment that all test scores carry measurement error.

Confidence intervals matter for eligibility decisions in two ways. When a standard score is very close to a cutoff (such as a school that uses a score of 70 as the threshold for a service), the confidence interval reveals whether the child's true ability might reasonably be above or below that cutoff. A score of 73 with a 95% CI of 67–79 brackets the 70 threshold — the child's true score could be below 70, suggesting that a conclusion of "no significant deficit" is not fully supported by the data.

Confidence intervals also reveal that a standard score that went from 82 to 88 between evaluations may not represent true growth — if the confidence intervals for both scores overlap substantially, the change could be within the margin of measurement error rather than a genuine improvement.

Putting It Together

When you're reading an evaluation report, the hierarchy of reliability is: standard scores and percentile ranks first, then confidence intervals, then age and grade equivalents last (or not at all). If a report is using age or grade equivalents to justify an eligibility conclusion, ask for the same conclusion to be restated in standard scores.

The most important single question: does this score represent a meaningful deviation from what is expected for this child's age and cognitive ability? A score at the 25th percentile means something very different for a child with a cognitive ability score of 80 than for a child with a cognitive ability score of 130. Context — including the child's full cognitive profile — is essential for interpreting any single score accurately.

The United States Special Education Assessment Decoder includes a score interpretation guide that walks through how to read standard scores, percentile ranks, scaled scores, T-scores, and confidence intervals across all the major assessment tools used in school evaluations — so that when you're looking at a report, you understand exactly what you're seeing.

Get Your Free United States Evaluation Request Letter Template

Download the United States Evaluation Request Letter Template — a printable guide with checklists, scripts, and action plans you can start using today.

Learn More →