I now have comprehensive data from all the major textbook sources. Let me compile the complete, detailed notes.
Auditory Evoked Potentials (AEPs) - Comprehensive Notes
Overview
An auditory evoked potential (AEP) is a waveform that reflects the electrophysiologic activity of a specific portion of the central auditory nervous system in response to sound stimulation. These small electrical responses are isolated from the background EEG by signal averaging - the auditory response occurs at a predictable latency post-stimulus onset, while background noise is random. Successive responses are averaged together, enhancing the AEP while canceling out noise.
AEPs are also called Electrical Response Audiometry (ERA). They are classified as:
- Exogenous (obligatory) potentials - depend primarily on stimulus characteristics; include ECochG, ABR, MLR, LLR, ASSR
- Endogenous (cognitive/event-related) potentials - depend on the psychological significance of the stimulus to the subject; include P300, N200, N400, MMN, P600
Classification by Latency
| Category | Latency Range | Components |
|---|
| Very Short (Electrocochleography) | 0-5 ms | SP, CM, AP/N1 |
| Short Latency | 0-10 ms | ABR Waves I-VII |
| Middle Latency | 10-50 ms | Po, Na, Pa, Nb, Pb |
| Long Latency (ALR) | 50-250 ms | P1, N1, P2, N2 |
| Cognitive / ERPs | 200-800+ ms | P300, MMN, N400, P600 |
General Recording Principles
- Electrode placement follows the 10-20 international system. For ABR: non-inverting at Fz, inverting at A1 or A2 (mastoid/earlobe), ground below Fz. For MLR and ALR: non-inverting at Cz, C3, or C4.
- Stimuli most commonly used: click (100 µsec rarefaction/condensation square wave, broadband with dominant energy at 2000-4000 Hz), and tone bursts (frequency-specific).
- Filtering: ABR uses 100-3000 Hz bandpass; lower cutoffs (~30 Hz) needed for 500 Hz tone bursts.
- Stimulus rate: ABR typically 17.7-30 clicks/sec (non-integer to avoid time-locking to periodic noise); fast-rate ABR uses 77.7/sec.
- Averaging trials: Minimum 2000-3000 per recording; responses replicated twice for reliability.
1. Electrocochleography (ECochG / ECoG)
Latency: 0-5 ms post-stimulus (earliest recordable AEP)
Neural Generators:
The ECochG records three distinct potentials generated within the cochlea and distal auditory nerve:
-
Cochlear Microphonic (CM): An alternating current (AC) voltage generated primarily by the outer hair cells and organ of Corti. It exactly replicates (mirrors) the acoustic input stimulus at low-to-moderate levels. Because it mimics the stimulus, it can be difficult to distinguish from stimulus artifact in clinical settings. Alternating polarity stimuli (rarefaction and condensation clicks presented in alternating fashion) are used to cancel the CM and isolate other components.
-
Summating Potential (SP): A direct current (DC) response that reflects the envelope of the input stimulus. Generated by outer hair cells (>50%), organ of Corti, and inner hair cells. It appears as a "shoulder" preceding the action potential on the ECochG waveform.
-
Compound Action Potential (AP / N1): The primary clinically used component. Generated by spiral ganglia and distal eighth cranial nerve afferent fibers. It represents the synchronous discharge of the auditory nerve fibers in response to click onset.
Electrode placement: Best recorded as a near-field response. Two options:
- Transtympanic (TT-ECochG): Needle electrode placed through the tympanic membrane onto the promontory of the temporal bone - yields the largest, most detailed recordings
- Tympanic membrane/ear canal electrode: Non-invasive extratympanic; smaller amplitude but clinically practical
Clinical applications:
- Diagnosis of Meniere's disease / endolymphatic hydrops: An elevated SP/AP ratio >0.4 (or >40%) suggests endolymphatic hydrops. The SP is disproportionately enlarged relative to AP due to distortion of the basilar membrane from endolymph pressure. With tympanic electrode recordings, expanded ECochG sensitivity is gaining recognition.
- Diagnosis of third-window conditions: Superior Semicircular Canal Dehiscence (SSCD) - new evidence supports ECochG sensitivity here.
- Auditory Neuropathy Spectrum Disorder (ANSD): CM is present and robust while ABR is absent or severely abnormal - the CM is the defining feature.
- Intraoperative monitoring: During acoustic neuroma/vestibular schwannoma resection, ECochG monitors cochlear function in real time. Preceded by preoperative baselines. Combined with ABR.
- Threshold estimation: Less commonly used than ABR for this purpose, but can be valuable.
SP/AP ratio: Normal <0.4; >0.4 suggests endolymphatic hydrops. The amplitude ratio is a more primary factor in ECochG interpretation than in ABR.
2. Auditory Brainstem Response (ABR)
Also known as: BERA (Brainstem Evoked Response Audiometry), BAER (Brainstem Auditory Evoked Response)
Latency: 0-10 ms post-stimulus
Neural Generators (Waves I-VII):
| Wave | Generator |
|---|
| I | Distal (peripheral) eighth nerve, near cochlea exit |
| II | Proximal eighth nerve, near brainstem |
| III | Caudal brainstem - cochlear nucleus, trapezoid body, superior olivary complex |
| IV | Superior olivary complex |
| V | Lateral lemniscus as it enters the inferior colliculus |
| VI | Inferior colliculus |
| VII | Inferior colliculus |
The most clinically reliable waves are I, III, and V. Wave V is the most robust and the last to disappear as stimulus intensity decreases toward threshold.
Key measurement parameters:
- Absolute latency: Time from stimulus onset to peak of each wave (ms). Normal adult values approximate: I ~1.5 ms, III ~3.8 ms, V ~5.6 ms.
- Interwave (interpeak) latency intervals: I-III (~2.3 ms), III-V (~2.1 ms), I-V (~4.0 ms). These reflect neural conduction time between generators and are highly sensitive to retrocochlear pathology.
- Interaural wave V latency difference (ILD): Should be <0.4 ms between ears; abnormal ≥0.4 ms - sensitive for eighth nerve tumor detection.
- Rate-induced latency shift for wave V: Abnormal if ≥0.8 ms - indicator of retrocochlear pathology.
- Amplitude ratio (V/I): Should be ≥1.0; highly variable, not a primary ABR measure.
- Morphology: Poor waveform morphology associated with high-frequency sensorineural hearing loss.
ABR findings by type of hearing loss:
| Condition | Pattern |
|---|
| Normal hearing | All parameters within normal limits |
| Conductive hearing loss | Delayed absolute latencies (all waves shift equally); interpeak intervals normal; normal morphology |
| Sensory (cochlear) loss | Wave I diminished or absent; delayed absolute latencies; poor morphology; interpeak intervals normal |
| Neural (retrocochlear) loss | Wave I normal latency; delay of all subsequent absolute latencies; prolonged interpeak intervals especially I-V |
Threshold ABR:
- Tone burst stimuli used for frequency-specific threshold estimation (500, 1000, 2000, 4000 Hz).
- ABR threshold correlates strongly with behavioral thresholds: click-ABR threshold shares ~89% variance with PTA at 2000-4000 Hz (mean difference ~1 dB).
- Tone burst ABR at 500 Hz and 1000 Hz also shares 86-89% variance with behavioral thresholds.
- Not affected by sleep or sedation - primary advantage over MLR and LLR for threshold testing in infants/difficult-to-test patients.
Diagnostic ABR:
- Highly sensitive to acoustic neuromas/vestibular schwannomas and brainstem disorders.
- Prolonged I-V interval or absent later waves with preserved Wave I suggests retrocochlear pathology.
- Stacked ABR: Modification that sums wave Vs from multiple frequency-band-filtered ABRs to enhance sensitivity for small eighth nerve tumors by assessing auditory function across the full frequency spectrum rather than just the high-frequency click region.
- Auditory Neuropathy Spectrum Disorder (ANSD): ABR is absent or severely abnormal despite present OAEs and CM. This dissociation is diagnostic.
Intraoperative monitoring: ABR monitored during posterior fossa surgery, acoustic neuroma resection, and other procedures near the auditory pathway.
Bone conduction ABR: Can be obtained using a bone vibrator on the mastoid to differentiate sensorineural from conductive components. Contralateral masking is required.
Chirp Stimulus ABR: A newer stimulus designed to compensate for cochlear travel time by using a frequency-sweeping tone (low frequency enters cochlea first, reaching apex simultaneously with high-frequency energy stimulating the base). This produces greater neural synchrony, larger amplitude responses, and improved threshold detection.
3. Auditory Steady-State Response (ASSR)
Also known as: 40-Hz response (at 40 Hz modulation rate), Amplitude Modulating Following Response (AMFR), Steady-State Evoked Potential (SSEP), Envelope Following Response (EFR), Frequency Following Response (FFR)
Latency / Recording approach: Unlike transient ABR, ASSR uses a continuous, amplitude-modulated stimulus rather than repeated transient clicks. The brain's neural generators respond at a rate equal to the modulation rate of the stimulus - the response is "phase-locked" to the modulation frequency.
Stimulus parameters:
- Carrier frequencies (CF): 500, 1000, 2000, 4000 Hz
- Modulation rates: typically ~80-100 Hz for infants (brainstem-dominant response, more resistant to sleep/sedation) or ~40 Hz for adults (more cortical contribution, better sensitivity at near-normal thresholds but strongly affected by sleep/sedation)
- Multiple simultaneous frequencies can be tested by assigning different modulation rates to different carrier frequencies - speeds up testing.
Neural generators:
- High modulation rate (~80 Hz): Response emanates primarily from auditory nerve and brainstem structures - similar to ABR generators. Relatively stable during sleep and sedation.
- Low modulation rate (~40 Hz): Response emanates from more central/cortical structures. Significantly affected by sleep and sedation.
Analysis: Unlike ABR (visually identified peaks), ASSR is analyzed using objective statistical algorithms (spectral analysis - the response is identified as a statistically significant peak at the modulation frequency in the EEG frequency domain). This removes clinician subjectivity.
Clinical advantages over tone-burst ABR:
- Threshold estimation at 4 frequencies simultaneously (faster)
- Automated, objective analysis
- May better estimate severe-to-profound hearing loss (ABR may underestimate severe losses as wave V can appear at very low intensities even in profound loss; ASSR floor is lower)
- May be less likely to underestimate hearing loss in steeply sloping audiograms
Disadvantages:
- More affected by arousal state than ABR (80 Hz rate is preferred for sleeping infants)
- Bone conduction ASSR is technically difficult and may require masking
- Risk of overestimating thresholds at near-normal levels
- Cannot reliably distinguish profound hearing loss from auditory neuropathy
- Normative data still expanding
Clinical applications:
- Primary tool for frequency-specific threshold estimation in infants under 6 months and difficult-to-test children
- Assessment of severe-to-profound hearing loss (for cochlear implant candidacy)
- Used alongside ABR for hearing threshold in pediatric populations
4. Middle Latency Response (MLR)
Latency: 10-50 ms post-stimulus
First reported by: Geisler and colleagues
Neural generators (thalamocortical system):
- Na (~12-21 ms): Possibly thalamus (medial geniculate body); also affected by midbrain lesions
- Pa (~21-40 ms): Primary auditory cortex (measured over temporal lobe), subcortical generator (measured at midline)
- Nb: Follows Pa
- Pb (~50 ms): Follows Nb; difficult to elicit reliably; not routinely measured
MLR generators span the medial geniculate body, thalamocortical connections, auditory cortex, and reticular formation. It is the most difficult AEP to record clinically.
Waveform characteristics:
- Larger amplitude and longer latency than ABR
- The consensus is that the responses are primarily neurogenic (not myogenic as was historically thought), though myogenic potentials within the Na-Pa latency range can distort results.
- Clinically measured indices: latency of Na, Pa; peak-to-peak Na-Pa amplitude. Pb is generally too variable for routine measurement.
State effects: MLR is significantly affected by sleep, sedation, and alertness. Acquiring during sleep can reduce or abolish the Pa complex, limiting its use for objective threshold testing in sedated patients - a key disadvantage compared to ABR.
Clinical applications:
- Neurological dysfunction of the CANS: Lesions near primary auditory cortex affect Pa morphology (especially measured over the affected hemisphere). Brainstem lesions affect Na. MLR abnormalities are actually more common with brainstem lesions than cortical lesions, likely because large brainstem lesions denervate thalamic projection nuclei.
- Central Auditory Processing Disorder (CAPD): Dysfunction of CANS contributing to listening difficulties, particularly in background noise. Commonly assessed with MLR alongside ABR.
- Auditory neuropathy evaluation
- Non-organic (functional) hearing loss
- Lesion localization at thalamocortical level and primary auditory cortex
- Frequency-specific auditory sensitivity up to cortical level
- Cochlear implant effectiveness (evaluating stimulation adequacy)
- Binaural hearing research - MLR believed useful for auditory language functioning data
Electrode placement: Non-inverting at Cz or C3/C4; inverting at mastoids; ground below Fz.
5. Auditory Late Response (ALR) / Late Latency Response (LLR)
Also called: Cortical Auditory Evoked Response (CAER), Cortical Electric Response Audiometry (CERA), N1-P2 complex
Latency: 50-250 ms post-stimulus
Neural generators (cortical):
- P1 (~50-80 ms): Primary or secondary auditory cortex
- N1 (~100 ms): Primary auditory cortex and adjacent association areas
- P2 (~150-200 ms): Primary or secondary auditory cortex
- N2 (~200 ms): Cortical (see also N200 section below)
Waveform: Large amplitude, well-defined cortical response. Larger and easier to detect near threshold than ABR, but requires the patient to be awake - a significant limitation.
State effects: Like MLR, ALR is significantly affected by sleep and arousal. It can only be reliably recorded in stage 2 sleep or REM; decreases with drowsiness and is absent in deeper sleep. Also decreases with aging.
Key component - P1 Cortical AEP Biomarker:
- The P1 response (latency ~100 ms in normal-hearing young children) serves as a biomarker for auditory cortex maturation. In children with hearing loss who receive hearing aids or cochlear implants, P1 latency is monitored to track whether auditory cortical development is progressing normally with amplification. Children with age-appropriate P1 latencies tend to have better speech/language outcomes.
- Used clinically to monitor efficacy of intervention in children with hearing aids or cochlear implants and those with multiple disabilities.
MMN is a component within this latency range - see separate section below.
Clinical applications:
- Threshold estimation in adults (complementing ABR; larger amplitude makes near-threshold responses more easily identified)
- Neurological dysfunction assessment of primary auditory and association cortex
- Cortical plasticity monitoring (P1 biomarker for pediatric auditory habilitation)
- CAPD assessment
- Used in combination with ABR and ASSR for comprehensive hearing evaluation
6. Mismatch Negativity (MMN)
Latency: 100-250 ms post-stimulus onset (peaks typically 150-200 ms)
Type: Endogenous event-related potential
Paradigm: The oddball paradigm - a series of repetitive "standard" stimuli is interspersed with occasional "deviant" stimuli that differ in frequency, intensity, duration, or other acoustic properties. MMN is the brain's automatic response to detecting the deviant. Crucially, it does not require attention - it can be recorded even when the patient is distracted, reading, or asleep (in stage 2/REM sleep). The MMN is extracted by subtracting the response to standard stimuli from the response to deviant stimuli (subtraction technique).
Neural generators (cortical):
- Primary generator: Supratemporal plane of primary auditory cortex (AI) / Heschl's gyrus
- Subcortical and primary cortical auditory regions also contribute
Mechanism: Reflects the brain's automatic "echoic memory" or prediction system - the auditory cortex maintains a short-term memory trace of the repetitive standard, and when a deviant is detected, a mismatch signal is generated.
Clinical applications:
- Speech perception assessment - tests discrimination of phoneme contrasts (e.g., /ba/ vs /da/)
- Central auditory processing disorders - assesses pre-attentive auditory discrimination
- Dyslexia: Dyslexics show reduced MMN to rapid speech changes
- Language impairment and SLI (Specific Language Impairment): Reduced MMN to speech contrasts is a marker
- Psychiatric biomarker: MMN is widely studied as a biomarker in schizophrenia (consistently reduced) and psychosis
- Cochlear implant outcomes - assesses benefit of implantation on discrimination
- Disorders of consciousness (coma, vegetative state) - presence of MMN indicates preserved auditory discrimination
- Infant hearing and language development research (does not require behavioral response)
Note: MMN is primarily a research tool but has expanding clinical applications.
7. P300 (P3)
Latency: ~300 ms post-stimulus (range: ~250-500 ms; prolongs with age)
Type: Endogenous event-related potential
Paradigm: Oddball paradigm - patient must attend to and detect a rare "target" (oddball) stimulus embedded in a series of frequent "standard" stimuli. Unlike MMN, active attention is required - the patient must count, press a button, or otherwise respond to the target.
Two subcomponents:
- P3a (~250-280 ms): Frontal distribution; reflects involuntary attention to novel stimuli; does not require task relevance
- P3b (~300 ms+): Classic P300; parietal-dominant; reflects cognitive decision-making, memory updating (context updating), and stimulus categorization. This is the standard clinical "P300."
Neural generators: Distributed network including auditory regions of hippocampus in medial temporal lobe, plus prefrontal cortex, thalamus, and posterior parietal cortex. No single generator - reflects widespread neural activity.
Clinical applications:
- Cognitive assessment and dementia: The most extensively used AEP to study age-related declines in central processing. P300 latency prolongs with advancing age, even in healthy adults.
- Alzheimer's disease: Current active research area - P300 latency is typically prolonged in Alzheimer's and other dementias.
- Auditory temporal processing and hemispheric asymmetry
- Assessing higher-level changes in cognition and memory
- Brain-computer interface (BCI) - P300 used in BCI systems for severely motor-disabled patients
- Psychiatric disorders (schizophrenia - reduced amplitude and prolonged latency)
- Can be recorded with speech stimuli for more ecologically valid assessment
Key point: Prolonged P300 latency = abnormal; however, latency prolongation is also observed in normal aging, so age-matched norms are required.
8. N200 (N2)
Latency: ~200 ms post-stimulus
Type: Endogenous event-related potential; late cortical response
Description: N200 refers to a family of negative-going potentials occurring at approximately 200 ms. It overlaps with the N2 component of the long-latency ALR and also relates to cognitive processing. It is classified within the long-latency response (P1-N1-P2-N2 complex) at its earliest and within ERPs at its longest latencies.
Subcomponents:
- N2a (MMN): Pre-attentive mismatch response (~150-200 ms); overlaps with MMN
- N2b (~200-275 ms): Attention-dependent; elicited when the subject detects a rare, attended target; associated with conscious detection of deviance; has a frontocentral distribution
- N2c: Target categorization; associated with discrimination of target from non-target
Generators: Frontal lobe and anterior cingulate cortex contributions; also temporal auditory areas.
Clinical relevance:
- Reflects discrimination and decision-making processes
- Studies in Central Auditory Processing Disorders
- Research in schizophrenia (abnormal N2 components)
- Part of the P300 complex - N2 typically precedes the P300 peak in oddball tasks
9. N400
Latency: ~400 ms post-stimulus (range: 300-500 ms)
Type: Endogenous event-related potential
Description: A negative-going deflection peaking at approximately 400 ms, with centro-parietal scalp distribution. First described by Kutas and Hillyard (1980).
Paradigm: Elicited when an upcoming word is semantically unexpected, inappropriate, or poorly integrated with the preceding context. Classic example: "I take my coffee with cream and dog" - the word "dog" elicits a large N400. Importantly, N400 also occurs for words that are semantically unexpected but not errors (e.g., "Do not touch the wet dog" when "paint" is expected).
Functional significance:
- Reflects lexical retrieval, semantic integration, and context processing
- Larger N400 = more difficulty integrating the word into semantic context
- Occurs regardless of whether stimuli are presented auditorily or visually
Auditory applications:
- Assessment of speech and language comprehension
- Cochlear implant outcomes - monitoring semantic processing after implantation
- Alzheimer's disease: Abnormal N400 responses to speech have been documented
- SLI and language disorders - reduced or absent N400 suggests impaired semantic processing
- Research in auditory scene analysis and speech perception in noise
10. P600
Latency: 500-1000 ms post-stimulus (peaks typically 500-800 ms)
Type: Endogenous event-related potential
Description: A large positive peak at 500-800 ms, with centro-parietal and sometimes frontal distribution.
Paradigm: Elicited in response to syntactic violations in sentences (e.g., subject-verb agreement errors: "The boy walk to school") and syntactically complex sentences.
Functional significance:
- Associated with syntactic reanalysis, sentence repair, and grammatical processing
- Some studies link it to more general semantic integration processes
- Reflects higher-order language processing beyond N400 (which is semantic)
Auditory applications:
- Cochlear implant outcomes - monitoring post-implant language processing
- Language acquisition and development research
- Assessment of sentence-level language processing in clinical populations
11. Cervical VEMP (c-VEMP)
Full name: Cervical Vestibular-Evoked Myogenic Potential
Latency of response components:
- P1 (P13): ~12 ms (SD ±2.5 ms)
- N1 (N23): ~19 ms (SD ±1.5 ms)
- P1-N1 peak-to-peak amplitude: 16-179 µV (click); 15-337 µV (tone burst)
Anatomical pathway:
Saccular macula → Afferent inferior vestibular nerve → Brainstem vestibular nuclei → Descending medial vestibulospinal tract → Motor neurons of the sternocleidomastoid (SCM) muscle
c-VEMP evaluates: The vestibulo-collic reflex and specifically tests the functional integrity of the saccule and inferior vestibular nerve.
Mechanism: High-intensity sound activates the saccule (the saccule is the most acoustically sensitive otolith organ). This generates a brief inhibitory response (relaxation) in the tonically contracting SCM. The response is therefore a reduction in EMG activity, not an excitation.
Stimulus: Repetitive clicks or tone bursts at 500 Hz or 1000 Hz, presented at high intensity (typically 90-100 dB nHL). Presented ipsilaterally.
Patient positioning: The SCM must be tonically activated for the response to be elicitable:
- Supine head elevation (bilateral response): Patient raises head against gravity while lying supine
- Head rotation (unilateral response): Patient turns head away from stimulus ear while seated upright
Asymmetry ratio: Normal 0-~40%. Calculated as: 100 × (A_larger - A_smaller) / (A_larger + A_smaller)
Factors that affect/abolish c-VEMP:
- Middle ear pathology (even a 5 dB air-bone gap can obliterate response - requires intact middle ear)
- Background SCM activation level
- Muscle relaxants (valium)
- Spinal cord injuries, muscular atrophy
- Age >60 years (may be absent in normal elderly)
- NOT significantly affected by sensorineural hearing loss
Clinical applications and disease patterns:
| Condition | c-VEMP Finding |
|---|
| Meniere's disease | Absent during attacks; reduced or enhanced amplitudes; current research shows abnormally large amplitude in affected ear with asymmetry |
| Superior Canal Dehiscence (SCD) | Enhanced amplitudes, lower-than-normal thresholds, present c-VEMP with air-bone gap |
| Vestibular neuritis | Absent or reduced amplitudes; may recover in some patients within 6 months-2 years |
| Acoustic neuroma/vestibular schwannoma | Absent or reduced |
| Bilateral vestibular loss | Reduced or absent |
| Otosclerosis | Absent (abnormal middle ear) |
| Multiple sclerosis, brainstem stroke, spinocerebellar degeneration | Absent or delayed latencies |
| Migraine | Absent or reduced amplitudes; delayed latencies; usually unilateral |
12. Ocular VEMP (o-VEMP)
Full name: Ocular Vestibular-Evoked Myogenic Potential
Latency of response components:
- N1 (N10): ~10 ms median latency
- P1 (P15-17): ~15-17 ms median latency
- Amplitude: ~5 µV ±3 (very small compared to c-VEMP)
Anatomical pathway: The end organ responsible is primarily the utricle (though debate continues, especially for air conduction stimulation). oVEMPs elicited by bone conduction or vibratory stimulus are generally accepted as reflecting utricular function. Pathway: Utricular macula → Superior vestibular nerve → Brainstem → Contralateral inferior oblique muscle (via the crossed vestibulo-ocular reflex)
o-VEMP evaluates: The functional integrity of the utricle and superior vestibular nerve. It is currently the only test that can specifically assess utricular function. Complements c-VEMP which tests the saccule.
Key difference from c-VEMP: o-VEMP is recorded from the contralateral inferior oblique muscle (under the eye, contralateral to the stimulated ear). The patient looks upward (approximately 30 degrees above midline) to bring the inferior oblique closer to the infraorbital electrode.
Clinical applications:
The power of o-VEMP lies in interpretation together with c-VEMP and bithermal calorics:
| Pattern | Interpretation |
|---|
| Normal calorics + Normal c-VEMP + Abnormal o-VEMP | Utricular impairment |
| Abnormal o-VEMP + abnormal c-VEMP | Suggests widespread otolith/vestibular dysfunction |
| SCD | Enhanced o-VEMP responses (low threshold, high amplitude) - complementary to c-VEMP findings |
| Vestibular schwannoma | Absent or reduced o-VEMP |
Note: o-VEMP is more recently developed than c-VEMP and is still gaining wider clinical use. Standardization of stimuli, responses, and equipment is ongoing. It represents an important extension of the vestibular test battery to include otolith organ function beyond the semicircular canal-focused ENG/caloric tests.
Additional AEPs Not Listed (Completing the Picture)
A. Auditory Late Response (ALR) / N1-P2 / CERA
(Covered in section 5 above as "LLR" - this is the same entity with the full detail of P1, N1, P2, N2 components at 50-250 ms.)
B. P50 (P1 at ~50 ms) / Sensory Gating
The P50 is a positive component at ~50 ms (essentially the P1 component of the ALR). It is used in research paradigms assessing sensory gating - the brain's ability to suppress or filter repeated identical stimuli. In a "paired-click" paradigm, the P50 to the second click should be significantly smaller than to the first click (normal gating). Impaired sensory gating (P50 suppression deficit) is a well-documented biomarker for schizophrenia and is also investigated in PTSD and bipolar disorder.
C. Cortical Auditory Evoked Potential (CAEP) P1 in Children
As noted above, the P1 CAEP (the dominant obligatory cortical response in infants and young children) at ~100 ms serves as a cortical maturation biomarker for children with hearing loss receiving intervention (hearing aids or cochlear implants). Children who show P1 latencies in the age-appropriate range for hearing children have better outcomes.
D. P3a (Novel P3)
The P3a (~250-280 ms) reflects the involuntary orienting response to novel or surprising stimuli. Frontally distributed. It does not require active task performance (unlike the classic P3b/P300). Used in research for attention-deficit disorders, frontal lobe dysfunction.
E. Auditory Change Complex (ACC)
A cortical AEP (also called the change-N1 or change-complex) evoked by a change in the acoustic properties (e.g., frequency, intensity) of an ongoing stimulus. The ACC includes N1-P2 morphology time-locked to the change. It has been proposed as a tool for hearing threshold estimation within ongoing speech sounds (more ecological validity). Research application - not yet widely clinical.
F. Sustained Cortical Potential
A slow negative cortical potential sustained during the presence of an ongoing stimulus. Related to attention and expectancy.
Summary Table: All AEPs at a Glance
| AEP | Latency | Generator | State-Dependent | Key Clinical Use |
|---|
| ECochG (CM, SP, AP) | 0-5 ms | Cochlea, distal CN VIII | No | Meniere's, ANSD, intraop monitoring |
| ABR | 0-10 ms | CN VIII to inferior colliculus | No | Threshold in infants, acoustic neuroma, ANSD |
| ASSR | Steady-state | Brainstem (80 Hz) / Cortex (40 Hz) | Partial (80 Hz less affected) | Frequency-specific threshold, severe HL |
| MLR | 10-50 ms | Thalamus, primary auditory cortex | Yes (sleep affects) | CANS lesions, CAPD |
| LLR/ALR (N1-P2) | 50-250 ms | Primary auditory cortex | Yes (requires wakefulness) | Threshold in adults, cortical plasticity |
| P1 CAEP (pediatric) | ~100 ms | Auditory cortex | Yes | CI/HA outcome monitoring, cortical maturation |
| MMN | 100-250 ms | Heschl's gyrus, auditory cortex | No (pre-attentive) | Speech discrimination, CAPD, schizophrenia |
| P300 (P3b) | ~300 ms | Hippocampus, parietal cortex | Yes (attention needed) | Cognitive assessment, dementia, aging |
| N200 (N2) | ~200 ms | Frontal/temporal cortex | Yes (attention needed) | Discrimination, CAPD, psychiatric |
| N400 | ~400 ms | Temporal/parietal cortex | Yes (language task) | Semantic processing, CI outcomes, SLI |
| P600 | 500-800 ms | Parietal/frontal cortex | Yes (language task) | Syntactic processing, CI outcomes |
| c-VEMP | P112ms, N119ms | Saccule → SCM via inferior vest. nerve | No (muscle tension needed) | Meniere's, SCD, vestibular neuritis |
| o-VEMP | N110ms, P115ms | Utricle → inferior oblique via superior vest. nerve | No (upward gaze needed) | Utricular function, SCD, schwannoma |
| P50 | ~50 ms | Auditory cortex | Yes | Sensory gating (schizophrenia biomarker) |
Sources:
- Scott-Brown's Otorhinolaryngology Head & Neck Surgery, Vol. 2 - Ch. 52 (Auditory Evoked Potentials)
- K.J. Lee's Essential Otolaryngology, 11th Ed. (ERA chapter)
- Shambaugh Surgery of the Ear (AEP chapter)
- Cummings Otolaryngology Head and Neck Surgery (Diagnostic Audiology chapter)
- Springer Egyptian Journal of Otolaryngology (N400, P600, MMN in CI outcomes)