Narrative Scoring Scheme Calculator

Score a paediatric narrative language sample against the seven-subscale Heilmann, Miller, Nockerts, & Dunaway (2010) Narrative Scoring Scheme rubric. Rate each subscale (introduction, character development, mental states, referencing, conflict resolution, cohesion, conclusion) from 0 (immature / absent) to 5 (proficient), and the calculator sums the scores, classifies the total out of 35 against the published 5-11-year-old story-retell expectation band (15-28), and returns up to three intervention targets derived from the lowest-scoring subscales. Built for school-based SLPs, clinic SLPs, early-intervention teams, graduate SLP students, and paediatric language researchers.

7 Subscales0-5 AnchorsHeilmann 20105-11 Year NormsSALT Narrative RubricClient-Side
Tool details, related tools, and citation

Score a narrative sample with the NSS rubric

Rate each of the seven Narrative Scoring Scheme (NSS; Heilmann et al. 2010) subscales from 0 (immature / absent) to 5 (proficient) based on the child’s story retell or personal narrative. The tool sums the subscale scores, classifies the total against the published 5-11-year-old expectation band (15-28 of 35), and returns up to three intervention targets derived from the lowest-scoring subscales.

NSS subscales (0 = immature, 5 = proficient)

Introduction

Does the narrator set the scene — introduce the setting, the main characters, and the starting conditions before the story action begins?

Score
3

Developing — setting and at least one main character introduced, but lacking detail or time reference.

Character development

Are the main characters developed beyond a name — actions, dialogue, traits, perspective, and change over the course of the story?

Score
2

Emerging — one main character has two or more actions or traits; others are flat.

Mental states

Does the narrator reference feelings, thoughts, motivations, beliefs, and internal states of the characters (theory of mind)?

Score
2

Emerging — two or three mental-state words across the story.

Referencing

Are pronouns and referring expressions unambiguous — does the listener always know who "he," "she," "they," or "it" refers to?

Score
3

Developing — most references are clear but a few ambiguous pronouns remain.

Conflict resolution

Does the story have a clear problem (initiating event), an attempt, and a resolution that follows from the attempt?

Score
3

Developing — problem, attempt, and resolution present but the causal chain is weak.

Cohesion

Is the story smoothly sequenced with appropriate temporal and causal connectors (then, so, because, after, while, finally)?

Score
2

Emerging — some variety in connectors (then, so); sequencing mostly clear.

Conclusion

Does the story have a clear ending — the conflict is closed, consequences are stated, and the narrative wraps up rather than just stopping?

Score
2

Emerging — ending present but does not address the initiating problem.

Total NSS score
17 / 3549 % of max — within expectation

NSS total 17 / 35 (49 % of max) — within the Heilmann et al. (2010) 5-11-year-old expectation band (15-28). Narrative macrostructure is broadly intact; focus intervention on the lowest-scoring subscales and continue to monitor with fresh narrative samples every quarter.

Suggested intervention targets
  1. 1. Character development Scored 2 / 5 — target for explicit narrative intervention. Build character traits, actions, and dialogue through structured story grammar templates and character-feeling visual supports.
  2. 2. Mental states Scored 2 / 5 — target for explicit narrative intervention. Seed mental-state vocabulary (wanted, thought, felt, believed, worried) with explicit instruction and generalisation to classroom texts.
  3. 3. Cohesion Scored 2 / 5 — target for explicit narrative intervention. Expand the temporal / causal connector set (then, so, because, while, after, finally) and prune "and then" over-reliance.
Clinical caveats
  • The NSS is a macrostructure rubric — it rates the story-level organisation of the narrative. It does NOT measure microstructure (MLU, NDW, TTR, grammatical accuracy, utterance complexity). Always pair the NSS with a microstructure measure (MLU + NDW + PCC + PGU + grammatical error count) for a complete language-sample profile.
  • The 5-11-year-old normative band (Heilmann et al. 2010) is based on story retells elicited with a wordless picture-book protocol ("Frog, Where Are You?" or "Frog Goes to Dinner"). Personal narratives, fiction-generation tasks, or expository samples may produce systematically different scores — use the band with caution for non-retell genres.
  • The NSS is a reliability-dependent rubric — published inter-rater reliability for trained scorers is in the 0.80-0.90 range. Novice scorers should score at least five practice samples with a trained rater before interpreting independent scores clinically.

Implementation of the Narrative Scoring Scheme (Heilmann, Miller, Nockerts, & Dunaway 2010) with interpretive bands derived from the 5-11-year-old story-retell normative sample. Planning aid for language-sample analysis — not a diagnostic instrument on its own. Pair with a microstructure measure (MLU, NDW, IPSyn, DSS, PCC, PGU) for a complete language-sample profile.

Get the full analysis

Unlock normative comparison and error analysis

This free tool covers the basic case. ConductSpeech adds normative comparison, error categorisation, and a parent-ready report.

Get the full analysis with ConductSpeech
  • Initial evaluation of a school-age child (5-11 years) referred for a language concern — score one story-retell and one personal-narrative sample to establish the narrative macrostructure baseline
  • Annual IEP review for a child on the caseload with a narrative or story-grammar goal — score a fresh narrative sample the week before the IEP meeting to report progress
  • Progress monitoring during active narrative intervention — re-score a fresh narrative sample every 8-12 weeks to document gains
  • Discharge / dismissal consideration — score two fresh narrative samples to confirm the child has moved from below-band to within-band on the NSS
  • Classroom screening for reading-comprehension risk — use the NSS on a whole-class story retell to identify children below the expectation band for Tier-2 reading support
  • Graduate SLP training — teach narrative macrostructure scoring with the NSS rubric and the published Frog story-book transcripts
  • Paediatric language research — score narrative samples for group-level analyses of DLD, autism, ADHD, or dual-language children with the published reliability coefficients
  • Second-rater calibration for an existing NSS scorer — score the same transcripts and compare to the published anchor descriptors

Don't use for

  • As a stand-alone diagnostic instrument for developmental language disorder — always pair with a microstructure measure and the full case history
  • For children under 5 years of age — the 5-11-year-old normative band does not apply, and the rubric is built around story-grammar elements that are not stable before age 4-5
  • For children over 11 years of age — the normative band ends at age 11 and adolescent narrative development follows different trajectories
  • For expository or procedural samples — the NSS is designed for narrative genre (story retell, personal narrative, fictional narrative) and the subscales do not map onto expository macrostructure
  • For dual-language learners in the second language before exposure has reached 18-24 months — normative band may over-identify L2 learners as below-band
  • By novice scorers who have not completed the calibration training protocol — inter-rater reliability is scorer-dependent and novice scores drift high
  • To re-score the same transcript after intervention — the child has been explicitly taught the story structure and the re-score will overstate the gain

Why Narrative Macrostructure Matters

Narrative language is the bridge between oral and written language. A child who cannot tell a coherent story at 5 or 6 years of age is at heightened risk for reading comprehension difficulty at 8 or 9, because the same story-grammar frame — setting, character, problem, attempt, resolution, consequence — drives both oral narrative production and written expository comprehension. The longitudinal evidence (Bishop & Edmundson 1987; Fey et al. 2004; Wetherell et al. 2007; Gillam & Pearson 2017) is consistent: narrative macrostructure at kindergarten predicts reading comprehension at third grade, even after controlling for decoding, vocabulary, and working memory.

The NSS was built to make narrative scoring feasible in school-based SLP practice. Before the NSS, macrostructure scoring required either a proprietary rubric (e.g., Stein & Glenn 1979 story grammar analysis, which was too time-intensive for weekly caseload use) or an informal "holistic" rating that had no published psychometrics. Heilmann, Miller, Nockerts, and Dunaway (2010) published the NSS alongside the SALT Software narrative database, giving school SLPs a rubric that takes 10-15 minutes per transcript, has published inter-rater reliability of 0.80-0.90 for trained scorers, and has published norms for 5-11-year-old story retells. The NSS is now the de-facto standard macrostructure rubric in school-based SLP practice in the United States and is explicitly referenced by the ASHA School-Based Service Delivery Practice Portal (2024).
Intervention evidence supports teaching narrative macrostructure directly. Petersen and Spencer (2016) summarised the evidence for explicit narrative instruction — using story-grammar icons, visual story maps, and structured retell scaffolds — and demonstrated strong effect sizes for narrative organisation gains in children with developmental language disorder (DLD), autism, and learning disability. The NSS provides the progress-monitoring instrument that matches the intervention — score a fresh narrative sample every 8-12 weeks and compare the total to baseline.

The Seven NSS Subscales in Detail

The seven NSS subscales are not arbitrary — they correspond directly to the core elements of the story-grammar frame taught in reading comprehension and expository writing curricula. Each subscale is rated on a 0-5 scale from immature (0) to proficient (5).

1. Introduction. Does the narrator set the scene — introduce the setting, the main characters, and the starting conditions — before the story action begins? A 0 is a story that starts mid-action with no setting or character identification. A 5 is a complete story-book opening with setting, main character, supporting characters, time, place, and the launching condition.
2. Character development. Are the main characters developed beyond a name — actions, dialogue, traits, perspective, and change over the course of the story? A 0 is "the boy" throughout with no development. A 5 is a three-dimensional protagonist with traits, actions, dialogue, and perspective, and a visible change arc.
3. Mental states. Does the narrator reference feelings, thoughts, motivations, beliefs, and internal states (theory of mind)? A 0 is a purely behavioural story with no mental-state vocabulary. A 5 is rich mental-state vocabulary (thought, believed, decided, worried, realised) with motivation-consequence chains.
4. Referencing. Are pronouns and referring expressions unambiguous — can the listener always track who "he," "she," "they," or "it" refers to? A 0 is referencing that breaks down and leaves the listener lost. A 5 is flawless reference chains across the entire narrative with appropriate re-introduction of characters after shifts.
5. Conflict resolution. Does the story have a clear problem (initiating event), an attempt, and a resolution that follows from the attempt? A 0 is no problem or resolution structure. A 5 is multiple problem-resolution episodes embedded, each with causal links and consequence.
6. Cohesion. Is the story smoothly sequenced with appropriate temporal and causal connectors (then, so, because, after, while, finally)? A 0 is a list of disconnected events. A 5 is rich use of subordinating connectors with smooth sequencing and embedded clauses.
7. Conclusion. Does the story have a clear ending — the conflict is closed, consequences are stated, and the narrative wraps up rather than just stopping? A 0 is a story that stops abruptly mid-action. A 5 is a rich ending with resolution, consequence, character reaction, and closing story-book phrasing.

Interpreting the Total NSS Score

The total NSS score is the sum of the seven subscales (range 0-35). The calculator classifies the total into one of three interpretive bands derived from the Heilmann et al. (2010) 5-11-year-old story-retell normative sample.

Below expectation (0-14 of 35). The child's narrative macrostructure is in the lower 25th percentile band for the 5-11 age range. This is an intervention target. Begin explicit narrative instruction using the story-grammar frame (Petersen & Spencer 2016), schedule 20-30 minutes of direct narrative work twice a week, and re-score a fresh narrative sample in 8-12 weeks for progress monitoring. Narrative macrostructure below band is a frequent but not specific marker of developmental language disorder (DLD) and is also seen in children with attention, working memory, and hearing issues — rule out hearing and attention before concluding that narrative organisation is the primary target.
Within expectation (15-28 of 35). The child's narrative macrostructure is broadly intact and within the middle 50 % of the normative sample. Focus intervention on the lowest-scoring subscales rather than the full story-grammar frame, and continue to monitor with fresh narrative samples every quarter. A child within band on the NSS can still have a DLD profile driven by microstructure impairment — check MLU, NDW, IPSyn, DSS, PCC, and PGU for the full language-sample picture.
Above expectation (29-35 of 35). The child's narrative macrostructure is a relative strength and is in the upper 25 % of the normative sample. If the child is otherwise on the caseload, focus intervention time on the lower-scoring language domains (expressive grammar, vocabulary, phonology, or fluency) rather than on narrative. A perfect 35 / 35 is rare — verify the narrative is a spontaneous retell and not a memorised or rehearsed performance, and re-elicit with an unfamiliar story to confirm.

Reliability and Scorer Training

The NSS is a reliability-dependent rubric. Heilmann et al. (2010) reported inter-rater reliability of 0.80-0.90 for trained scorers on "Frog, Where Are You?" retells from 5-11-year-olds, which is in the "good to excellent" range for clinical measurement. However, the reliability depends on the scorer being trained — novice scorers (including graduate SLP students on their first narrative rotation) will score systematically higher than trained scorers on the same transcript until they have calibrated against a gold-standard rater.

Training protocol for new scorers: (1) Read the Heilmann et al. (2010) paper in full. (2) Score five published example transcripts and compare against the published scores. (3) Score five of your own caseload samples blind, then score them again with a trained colleague as a second rater. (4) Resolve scoring discrepancies item-by-item to build the shared mental model of each anchor descriptor. (5) After 10 calibration samples, you can interpret independent scores clinically.
Ongoing calibration: Periodically re-calibrate with a second rater on a 10-20 % sub-sample of your caseload. School-based SLPs can pair with another SLP in the district, or use the SALT Software narrative transcripts as reference samples. Drift is common — scorers who have been rating alone for 6+ months often diverge from the published norms and benefit from a calibration session.
Reporting: When reporting NSS scores in a clinical report or IEP progress note, report the total, the percent-of-max, the band classification, and the subscale breakdown. Avoid reporting the subscale score without the total — a subscale score in isolation has no established norm.

Frequently Asked Questions