Comprehensive languageCELF-5Ages 5;0–21;11

Clinical Evaluation of Language Fundamentals, Fifth Edition

CELF-5 is the most widely used comprehensive language battery in school-age SLP practice, covering receptive and expressive language from kindergarten through age 21.

At a glance

Publisher
Pearson Clinical
Edition year
2013
Age range
5;0–21;11
Domain
Comprehensive language
Administration time
30–60 minutes for the core battery

Standard scores for this battery use a mean of 100 and a standard deviation of 15 unless noted otherwise in the publisher manual. Always cross-reference against the current edition of the manual before clinical use — publishers update norms and scoring tables on regular cycles.

What CELF-5 is

The Clinical Evaluation of Language Fundamentals, Fifth Edition is the dominant comprehensive language assessment in U.S. school SLP practice. It is built around a Core Language Score derived from four subtests selected by age, plus a wider menu of supplementary subtests covering receptive, expressive, language content, language structure, and language memory indices. CELF-5 introduced an Observational Rating Scale and a Pragmatics Activities Checklist alongside the formal subtests in an attempt to broaden the construct beyond the traditional decontextualised tasks. The Core Language Score has a mean of 100 and a standard deviation of 15, and most school districts use a cut-off of one to one-and-a-half standard deviations below the mean as part of their special-education eligibility decisions.

Subtests and structure

  • Sentence Comprehension
  • Linguistic Concepts
  • Word Structure
  • Word Classes
  • Following Directions
  • Formulated Sentences
  • Recalling Sentences
  • Understanding Spoken Paragraphs
  • Word Definitions
  • Sentence Assembly
  • Semantic Relationships
  • Reading Comprehension
  • Structured Writing

What CELF-5 measures

CELF-5 is designed to measure four overlapping constructs: receptive language (the comprehension of words, sentences, and paragraphs), expressive language (sentence formulation, repetition, and production), language content (semantic knowledge and word relationships), and language structure (morphosyntactic mastery in tasks like Word Structure and Recalling Sentences). The battery yields a Core Language Score plus four optional index scores and individual subtest scaled scores. Recalling Sentences in particular is widely treated as a sensitive single-task probe of grammatical maturity, because it taps both syntactic processing and verbal working memory at the same time.

Strengths and limitations

Strengths

  • Largest normative sample of any school-age language battery in current use (over 3,000 children)
  • Familiar to school teams, district psychologists, and IEP committees nationwide
  • Core Language Score has solid test-retest reliability (>0.90 in the manual)
  • Subtest menu is broad enough to support both screening and diagnostic decisions

Limitations

  • Spaulding, Plante, and Farinella (2006) showed sensitivity below 80% for DLD identification when using a –1 SD cut-off
  • Recalling Sentences relies heavily on verbal working memory and may underestimate grammatical knowledge in children with attention deficits
  • Decontextualised picture and sentence tasks may underestimate functional communication in dynamic settings
  • Bias has been documented for African American English speakers when standard scoring is applied without dialect-sensitive adjustments

How language sample analysis complements CELF-5

CELF-5 is a structured-task battery — it asks the child to repeat a sentence, name a picture, or follow a direction in a context the examiner controls. A 50-utterance language sample captures everything CELF-5 cannot: spontaneous sentence length and complexity in connected speech, lexical diversity in topics the child chooses, finite verb morphology in real conversation rather than elicited probes, and discourse-level features like topic maintenance and turn-taking. Children with developmental language disorder routinely score in the broad average range on the CELF-5 Core while producing sentences in spontaneous play that fall well below age expectation on MLU-M, depressed PGU, and obvious tense omissions. Pair the CELF-5 Core with a 50-utterance LSA and a non-word repetition task to recover the cases the standardised battery alone misses.

The case that fools CELF-5 most often is the bright preschooler who scores in the average range on every subtest and still cannot string two clauses together at the snack table. If the family is worried, run a language sample anyway.
Average scores, sample anyway

Free tools that pair with CELF-5

MLU Calculator

Paste a language sample and get Mean Length of Utterance in morphemes and words, total utterances, total morphemes, and the matching Brown's stage. Implements Brown (1973) morpheme counting rules and runs entirely in your browser.

Open tool

Language Sample Worksheet

Free printable and fillable language sample analysis worksheet for speech-language pathologists. Five columns (utterance #, transcription, morpheme count, grammatical Y/N, notes), configurable row count up to 100 utterances, browser print produces a clean PDF, and an inline running summary tracks total utterances, total morphemes, and rolling MLU as you fill it in.

Open tool

DSS Calculator

Free Developmental Sentence Scoring (DSS) calculator for speech-language pathologists. Paste 50 sentences, tap weighted points across the eight Lee (1974) grammatical categories — Indefinite Pronouns, Personal Pronouns, Main Verbs, Secondary Verbs, Negatives, Conjunctions, Interrogative Reversals, Wh-Questions — plus the all-correct sentence point, and the calculator returns the live DSS with a per-category breakdown. Mobile-friendly tap-through grid, client-side, no sign-up.

Open tool

IPSyn Calculator

Free Index of Productive Syntax (IPSyn) calculator for speech-language pathologists. Score the 56 Scarborough (1990) grammatical items across the Noun Phrase, Verb Phrase, Question/Negation, and Sentence Structure subscales from a 100-utterance language sample. Live IPSyn total, per-subscale subtotals, sample-size guard, mobile-friendly, client-side, no sign-up.

Open tool

References

  1. Wiig, E. H., Semel, E., & Secord, W. A. (2013). Clinical Evaluation of Language Fundamentals, Fifth Edition. Pearson Clinical.
  2. Spaulding, T. J., Plante, E., & Farinella, K. A. (2006). Eligibility criteria for language impairment: Is the low end of normal always appropriate? LSHSS, 37(1), 61–72.
  3. Betz, S. K., Eickhoff, J. R., & Sullivan, S. F. (2013). Factors influencing the selection of standardized tests for the diagnosis of specific language impairment. LSHSS, 44(2), 133–146.