Reversal Learning

Overview

Reversal learning assesses cognitive flexibility by training an animal on a stimulus-response contingency and then reversing the reward contingencies. For example, if the animal has learned that lever A is reinforced and lever B is not (discrimination phase), the contingencies are switched so that lever B is now reinforced and lever A is extinguished. The number of errors and trials to reach criterion after reversal measures the ability to suppress a previously learned response and acquire a new one.

Reversal learning depends critically on orbitofrontal cortex (OFC) and serotonergic neurotransmission. OFC lesions impair reversal but spare initial discrimination, making this paradigm a selective assay for cognitive flexibility. Serial reversals can be used to assess learning-to-learn effects and are impaired in models of autism, schizophrenia, and addiction.

ConductMaze automates both the discrimination and reversal phases, applying criterion-based advancement rules to switch contingencies automatically. The software tracks perseverative errors (continued responding on the previously correct option), regressive errors, and learning curves across multiple serial reversals.

Trial Flow

start

Discrimination Phase

Train stimulus-response contingency to criterion

decision

Criterion Check

Has subject reached discrimination criterion?

process

Contingency Reversal

Reward contingencies switched between options

input

Trial Presentation

Subject chooses between options under new contingency

output

Outcome Delivery

Reinforcer for correct choice, no reward for incorrect

decision

Reversal Criterion

Has subject reached reversal criterion?

end

Phase Complete

Log errors, advance to next reversal or end

Parameters

ParameterTypeDefaultDescription
Criterioninteger8Consecutive correct trials to meet criterion (e.g., 8/10 correct)
Criterion Windowinteger10Sliding window for criterion calculation
Number of Reversalsinteger1Total reversal phases (1 = single reversal, >1 = serial reversal)
Max Trials per Phaseinteger200Maximum trials before forced phase advancement
ITI Durationseconds5Inter-trial interval between choice trials
Correction ProcedurebooleanfalseRepeat incorrect trials until correct (reduces side bias)
Stimulus TypeenumSpatialDiscrimination dimension (spatial, visual, auditory)

Metrics

MetricUnitDescription
Trials to CriterioncountTotal trials to reach criterion after reversal
Perseverative ErrorscountErrors on the previously correct option (first errors after reversal)
Regressive ErrorscountErrors occurring after initial shift away from old rule
Total ErrorscountAll incorrect choices during reversal phase
Choice LatencysecondsMean time from trial start to choice response
Win-Stay ProbabilityproportionProbability of repeating a rewarded choice
Lose-Shift ProbabilityproportionProbability of switching after an unrewarded choice

Sample Data

PhaseTrials_to_CriterionPerseverative_ErrorsRegressive_ErrorsTotal_ErrorsMean_Latency_s

Representative data for illustration purposes. Actual values will vary by species, strain, and experimental conditions.

Applications

  • 1
    Cognitive flexibilityselective OFC-dependent measure dissociated from initial learning
  • 2
    Psychiatric modelingreversal impairments in autism, schizophrenia, and OCD models
  • 3
    Addiction researchdrug-induced perseveration as a model of compulsive behavior
  • 4
    Serotonin pharmacology5-HT depletion selectively impairs reversal learning
  • 5
    Developmental neurosciencetracking maturation of flexible behavior in juveniles

Compatible Products

ME-OC-BASEME-OC-LEVERME-OC-PELLETME-OC-TTL

Ready to Automate Your Behavioral Protocols?

Contact us for a demo and pricing information.