Clausal Density Calculator

Total clauses, total utterances, and the clauses-per-utterance ratio from a single pasted transcript. The calculator detects subordinate clauses with a heuristic over standard English subordinators and reports the developmental band against Loban (1976) and Nippold (2009) reference values — instantly, in your browser.

Loban 1976Nippold 2009Heuristic DetectorClient-Side
Tool details, related tools, and citation

Paste your language sample

One utterance per line. The calculator counts main and subordinate clauses, then reports the clauses-per-utterance ratio (clausal density) — instantly, in your browser.

Paste a transcript above to compute total utterances, total clauses, and clausal density (clauses per utterance).

Automate this workflow

Skip the manual count with ConductSpeech

ConductSpeech transcribes the audio, runs the analysis, and writes the clinical report — all in minutes instead of hours.

Automate this with ConductSpeech
  • Computing clausal density from a transcribed language sample
  • Tracking syntactic complexity growth across therapy sessions
  • Comparing a school-age child against Loban (1976) or Nippold (2009) reference bands
  • Reporting a syntactic-complexity metric alongside MLU in an evaluation
  • Teaching graduate SLP students why MLU loses sensitivity above age 5
  • Spot-checking a transcript for embedded subordinate clauses

Don't use for

  • Diagnosing developmental language disorder from CD alone — combine with MLU, PGU, and lexical diversity
  • Reporting CD as a forensic-grade clause count — the heuristic is approximate
  • Comparing CD across different sampling contexts without context-matched norms
  • Evaluating preschoolers below age 5 — MLU is more sensitive at that age
  • Trusting the count without reviewing the per-utterance breakdown

What Is Clausal Density?

Clausal density (CD) is the ratio of total clauses to total utterances (or T-units) in a transcribed language sample. A clause is a syntactic unit with a subject and a predicate; a main clause stands alone, and a subordinate clause is embedded inside another clause.

Why it matters. Two children can produce equally long utterances but show very different syntactic complexity. "I went to the park and I played and I came home" (3 main clauses, no subordination, CD = 1.0 if we treat each conjunct as a separate utterance) is structurally very different from "I went to the park because my friend asked me" (1 main + 1 subordinate, CD = 2.0). MLU cannot tell these apart; CD can.
When CD becomes useful. Below age 5, children produce mostly simple sentences and CD hovers near 1.0; MLU is the more sensitive measure. Above age 5, children begin to embed subordinate clauses, and CD rises through adolescence. By age 12, typical conversational CD is around 1.5 (Loban 1976; Nippold 2009).

How This Calculator Works

Paste your language sample into the textarea, one utterance per line. The calculator:

  • Splits the sample into utterances using the shared SLP utterance parser (newlines first, sentence-ending punctuation as a fallback).
  • Tokenises each utterance: lowercases, strips leading/trailing punctuation.
  • Counts one main clause per non-empty utterance.
  • Walks through the tokens after the first one and adds one subordinate clause for every token that matches the subordinator list (because, when, while, that, who, which, …).
  • Reports total utterances, total clauses, the clausal density ratio, the developmental band the ratio falls into, and a per-utterance breakdown so you can spot mis-classifications.

The heuristic is conservative — it does not handle zero-relatives, and it may over-count subordinators in fixed phrases. Treat the result as a screening number, not a definitive count.

Developmental Reference Bands

Below age 5 (CD < 1.10). Children at this stage produce mostly simple sentences. Subordinate clauses are rare. MLU is the more sensitive measure of expressive language at this age.
Ages 5-8 (1.10 \leq CD < 1.30). Children begin embedding adverbial clauses ("because", "when") and complement clauses ("that"). CD starts to rise.
Ages 8-12 (1.30 \leq CD < 1.50). Relative clauses, wh-complements, and multiple subordinators per utterance become common. CD continues to rise.
Age 12+ (CD \geq 1.50). Adolescents and adults produce dense subordination, especially in expository and persuasive language. CD plateaus around 1.6-1.8 in conversational samples and can rise above 2.0 in formal narrative or expository samples.

These bands are rough guides drawn from Loban (1976) and Nippold (2009). They are not diagnostic cut-offs — interpret CD alongside MLU, PGU, and lexical diversity, and against context-matched norms when possible.

Frequently Asked Questions