Mismatch negativity is an event-related potential elicited when a sequence of unattended repetitive sounds is interrupted by a deviant stimulus (1). It is an electrophysiological correlate of sensory memory representation and can be observed even under distracted conditions. This preattentive response is generated at the level of the planum temporale and is likely to reflect N-methyl-d-aspartic acid (NMDA) channel current influx in cortical layers II and III as found in extracellular recordings in the monkey (2). In humans, a diminished mismatch negativity response following the administration of ketamine (an NMDA antagonist) has been described (3), consistent with NMDA/glutamate involvement in mismatch negativity generation.
The mismatch negativity response to duration deviants, usually recorded with EEG, is consistently reduced in patients with schizophrenia (4–7). In some studies of pitch deviants, this abnormality was not observed (8, 9) (for a review see reference 10). This impairment of sensory memory formation has also been described in unmedicated patients (4), first-degree relatives (11, 12), and in high-risk groups (13). Several studies have reported correlations between mismatch negativity amplitude and negative symptoms (4, 14), and it appears that the mismatch negativity response is particularly reduced over the left hemisphere in patients with schizophrenia (15–17). The mismatch negativity reduction seems not to be ameliorated by either typical (18) or atypical neuroleptics (19).
Mismatch negativity response in schizophrenia is of particular interest given the well-replicated findings of structural superior temporal gyrus abnormalities (20, 21) and the involvement of this brain area in core symptoms of the disease such as hallucinations (22, 23) and formal thought disorder (24). Furthermore, NMDA is suspected to be involved in the pathophysiology of schizophrenia (25). However, little is known about the mechanisms of decreased mismatch responses in schizophrenia, their etiological significance, or their precise locus of generation.
Functional magnetic resonance imaging (fMRI) allows precise localization of cerebral correlates of perceptual and cognitive operations. However, the noise generated by the scanner limits its application in the investigation of auditory processing. To avoid these shortcomings, we developed a method to record the hemodynamic analogue of mismatch responses by means of fMRI using the gradient switching noise as the sound source (26). The advantage of this technique is—besides its ease of use without external stimulation devices—that the sound source is part of the environment so that unattended processing prevails. This stimulation procedure does not yield the frontal activation (26) that suggests attentive processing as observed with other fMRI or positron emission tomography techniques. Moreover, the scanner noise can be recorded and applied within whole-head magnetoencephalography (MEG) to obtain comparable electrophysiological data.
The present study sought to further clarify the alteration of mismatch negativity response in patients with schizophrenia. Preattentive neuromagnetic and hemodynamic responses to amplitude and duration deviants within a series of standard noise bursts were recorded in a group of schizophrenia patients and comparison subjects. In a pilot study employing a similar paradigm in healthy subjects, we found lateralization to the right superior temporal plane only for the duration mismatch condition in MEG and fMRI (26). On the basis of these and findings with EEG (15–17), we hypothesized altered lateralization patterns on the level of the superior temporal plane in the patients for deviant duration stimuli.
Twelve inpatients with DSM-IV schizophrenia, all receiving stable doses of antipsychotic medication, were recruited from the Department of Psychiatry, University of Tübingen, Germany. Diagnosis was established by two independent psychiatrists through a clinical interview and review of the hospital charts. As seen in t1, a comparison group of healthy volunteers was matched with the patient group for sociodemographic and cognitive variables as well as nicotine use (per the Fagerstrom Index [27]). All subjects were right-handed (28). Only subjects with normal hearing per an audiometry test were included. Immediate memory recall and attention were assessed on the day of scanning by using the Digit Span (29) and the Continuous Performance Test (30). Subjects were clinically rated (by A.R.) with the Positive and Negative Syndrome Scale (31) on the day of scanning before collection of imaging data. Permission for the study was obtained from the local ethical committee. After complete description of the study to the subjects, written informed consent was obtained. Patients and comparison subjects received compensation (8 Euro/hour) for participating in the study.
+
Measurement of Magnetic Field Responses (MEG)
Auditory evoked magnetic fields were recorded by means of a 151-channel whole-head MEG (CTF System Inc., Vancouver, Canada) within an electromagnetically shielded room (sampling rate=250 Hz, anti-aliasing filtering=80 Hz). Recorded gradient switching noises served as auditory stimuli (duration=152 msec, amplitude about 80 dB). Two sessions comprising 450 sweeps each (sweep length=800 msec, prestimulus baseline=148 msec, interstimulus interval=810 msec) included 20% randomly interspersed deviants (about 90 of the 450 sweeps). The deviant stimuli were those with amplitude or duration reductions of 9 dB and 76 msec, respectively, which occurred with equal probability. During the entire session, subjects watched silent movies (Charlie Chaplin). They were naive with respect to the experimental paradigm.
Data analysis was fully scripted and automatized to prevent observer errors. Equivalent eye dipoles were used to control for artifacts (symmetric two-dipole model) by eliminating sweeps with eye-blinks characterized by moments >60 nAm. The remaining sweeps obtained from each subject were averaged. Source analysis relied on a simple-sphere head model comprising two tangential dipole currents minimizing residual variance. Dipole fitting was performed at the time point of maximum global field power within a time domain extending from 50 to 96 msec after stimulus onset. Calculation of the individual lead-field functions relied on dipole position and direction. These two linear combinations of the 151 channels maximize the signal-to-noise ratio of current estimation for the respective auditory cortex. The two components provide a separate measure of the time course of dipole strength in both hemispheres (32, 33). Mismatch responses were calculated as the difference waves between averaged responses to the deviants, on the one hand, and the last stimulus of the preceding series of standards, on the other. The last stimulus preceding the infrequent one was used because it is the "most frequent" one and produces the most reliable response. In order to minimize noise, data were weighted by using prestimulus variance as a reference. The rationale for this technique is given in detail elsewhere (34). Quantitative estimates of mismatch fields were obtained within a time window derived from the average mismatch time course. The center of gravity of the computed difference curves between 140 and 280 msec served as a measure of mismatch field latency.
MEG fields represent the sum of multiple signal and noise sources at separate locations. Consequently, intrasubject variability was considered an additive error term allowing for robust linear statistics. In each subject, the probability of the null hypothesis was derived by bootstrap procedures based on the corresponding regression model (35). To that means, the regression coefficients were obtained for 999 independent drawn samples to approximate to empirical distribution of the estimator (35). The probability of this distribution under the null hypothesis was expressed as a z score for each individual. The latter values were averaged across group to test whether the null hypothesis for all subjects could be rejected. Thus, no variance measure was required, avoiding sensitivity to outliers in small sample sizes. Moreover, relative to variance analytic methods, this nonparametric analogue of an analysis of variance procedure exhibits faster convergence and does not presuppose identical distribution families across subjects.
A lateralization score from the interval (–1, 1) was evaluated for each subject as (L–R)/(|L|+|R|), with L and R being the quantified mismatch responses over the left and right hemisphere, respectively.
The groups were compared with respect to activity summed over both hemispheres. This measure is comparable to frontally detected EEG mismatch negativity and is held to reflect the well-known finding of reduced mismatch reactions in schizophrenia (36). In addition, whole-head MEG allowed study of the lateralization score for each subject. In a bootstrapping procedure, the average of the patients was tested to differ from that of the comparison group. Data analysis was performed with Matlab 5 (Mathworks Inc., USA), using a significance level of p<0.05.
+
Measurement of Hemodynamic Responses (fMRI)
Magnetic resonance imaging was performed with a 1.5-T clinical MR scanner (Sonata, Siemens, Germany). First, a T1-weighted anatomical image (three-dimensional fast low-angle shots) was obtained for each subject. Subsequently, functional imaging was conducted. A single-shot, multi-echo echo planar imaging sequence applied a 64×64 matrix, achieving a readout time of 38 msec per echo (TE=26, 64, 120, and 140 msec, nonlinear sampling on sinusoidal gradient ramps, bandwidth=50 kHz, field of view=210×210 mm2, slice thickness=5 mm, slice gap=1 mm, voxel size=3.3×3.3×6 mm3). Acquisition of four echoes per slice (duration=152 msec) represented the frequent events, occurring at a stimulus onset asynchrony of 800 msec. The strength of the imaging gradients was lowered by a factor of 0.35 for amplitude deviants (–9 dB sound pressure level, 7.5% of the events). The sound of the encoding of two instead of four echoes yielded a duration mismatch (–76 msec, 7.5% of the events) (F1). Sound pressure level of the gradient switching noise amounted to 94.5 dB at the head (about –28 dB dampening due to standard noise-canceling headphones). Four imaging slices were acquired covering the supratemporal plane and Heschl’s gyrus in parallel orientation to the Sylvian fissure. The number of stimuli (number of slice acquisitions) amounted to 512, i.e., 128 volumes with four slices each (TR=3,200 msec). Slices encoded during deviant events (15%) were discarded from statistical analysis. A multi-echo sequence was applied 1) to obtain acoustic stimulation of a suitable duration and 2) to increase the sensitivity to the blood-oxygen-level-dependent (BOLD) effect (37). As with the MEG experiment, subjects watched a silent movie and were naive with respect to the acoustic stimulation.
Analysis of the multi-echo MR data required preprocessing before application of standard statistical parametric mapping (SPM 99 [38]) procedures. Differential weighting of the four successive echoes optimized signal-to-noise or contrast-to-noise ratio, respectively. In order to maximize signal-to-noise ratio, each of the acquired echoes (i=1,…4) was weighted before averaging as follows: (exp[–TEi/80 msec])2, where TEi=echo time of ith echo. These images will be referred to as signal-to-noise ratio images. For optimal contrast-to-noise ratio of the BOLD effect, the echoes were averaged using the weight factors (TEi × exp[–TEi/65 msec])2 (contrast-to-noise ratio images). Realignment and normalization of images were performed by using the parameter estimates obtained from the signal-to-noise ratio images. Motion correction was restricted within the acquisition plane (39). Statistical parametric maps were obtained by applying a general linear model on the signal intensities of the normalized and smoothed (12 mm full width at half maximum Gauss-kernel) contrast-to-noise ratio images. The convolution of the bi-gamma function and the indicator function of the respective rare events served as reference vector for the paradigm, i.e., the modeled hemodynamic response function (F1). Analysis was repeated for each slice separately applying the respective time delay. Then the calculated statistical maps were combined.
Regions of interest were defined on the anatomical template (Talairach space) in both the primary auditory cortex (±45, –16, 4 mm, Brodmann’s area 41) and secondary auditory cortex (±63, –21, 6 mm, Brodmann’s area 42). Percent signal changes were analyzed due to differing signal-to-noise ratios in the two groups (40). Lateralization scores were the differences between left and right hemispheric region of interest, weighted by the variance in the secondary auditory cortex. To maintain linear statistics, the difference measures were not normalized to the summed responses.
Statistics followed analysis of variance in a repeated-measure design. Statistical maps were corrected for multiple testing across the volume based on the Gaussian field theory (38) (p<0.05). Testing across the regions of interest was Bonferroni corrected (p<0.05).
Between-group differences were tested by using a t test (p<0.05, volume corrected).
+
Magnetic Field Responses
One patient failed to complete the functional imaging sessions and was excluded from the data analysis. The remaining 11 patients participated in both MEG sessions. Dipole fits did not differ in P50m amplitude or lateralization between groups. In seven out of the 11 patients and 11 out of the 12 comparison subjects, a significant mismatch field was detected at the single-subject level (cutoff at p<0.05). Across both groups, deviants of amplitude (–9 dB) and duration (76 msec shorter) elicited reliable mismatch responses over both hemispheres, achieving high significance levels in both groups (t2 and F2).
As depicted in F3, a significant side difference emerged only in the comparison group, and only for duration deviants, as a right-hemispheric enhancement (–3.3 nA; z=3.0, p<0.001).
The mismatch responses summed across both hemispheres proved to be smaller in the patients for both amplitude deviants (z=2.5, p<0.05) and duration deviants (z=3.1, p<0.05). The lateralization scores (L–R)/(|L|+|R|) did not differ between the groups for amplitude deviants (mean score 0.03 versus 0.04) (z=6.3, p>0.20). As the most important finding, response to duration deviants was significantly more lateralized for the comparison subjects than for the schizophrenia patients (0.19 versus –0.16) (z=2.2, p<0.05).
In both groups, the changes of amplitude and duration of the gradient switching noise elicited reliable hemodynamic responses in both hemispheres in the primary and secondary auditory cortex. A direct comparison of statistical parametric maps (volume corrected, extent ≥15 voxels) revealed a stronger response in the comparison group in the left auditory cortex for amplitude deviants (p<0.05) but not duration deviants (F4).
Lateralization scores were derived from region-of-interest analyses in the primary and secondary auditory cortex (F5). Amplitude changes elicited larger hemodynamic responses at the left side in both the schizophrenia patients (z=4.4, p<0.001) and comparison subjects (z=3.5, p<0.001). As a main result, the duration deviants achieved a right hemispheric advantage only in the comparison group (comparison subjects: z=18.7, p<0.001; schizophrenia patients: z=0.8, p=0.22).
Stronger lateralization patterns emerged in the patient group than in the comparison subjects for amplitude deviants (i.e., larger left-to-right scores in the patients than in the comparison subjects: z=2.5, p<0.01). No significant contrast was apparent for duration deviants, although the comparison subjects tended to have larger right-to-left scores than did the schizophrenia group (z=1.2, p=0.11).
This study employed a novel design in a combined fMRI and MEG setting to compare cerebral responses to deviant auditory stimuli in patients with schizophrenia and matched comparison subjects. To circumvent the interfering background sound of the MRI scanner, its gradient switching noises were used as stimuli. The noise was recorded and applied in a whole-head MEG device. In the comparison group, the deviant stimuli evoked reliably right-lateralized mismatch MEG responses. The patients exhibited decreased responses and failed to show a lateralization effect in line with our hypothesis. In the fMRI setting, duration deviants elicited right lateralized signal increases in the planum temporale in the comparison subjects. The patients, in contrast, exhibited symmetric signal changes. The results of this study confirmed our earlier findings in the comparison subjects (26), and the patient data confirms our hypothesis of dysfunctional lateralization in this group.
The groups in our study were carefully matched for possible confounding effects such as age, gender, and nicotine use. Previous event-related potential studies have reported effects of smoking on P50 and P300 (41, 42), although influences on N100 have not been reported. Neuroleptic medication does not seem to influence mismatch negativity response (18, 19). Because of the size of our study group, the influence of gender on lateralization could not be calculated.
Although our mismatch stimuli might be slightly different in interstimulus interval or duration from standard EEG paradigms, they reliably evoked the well-known neuromagnetic responses in different control populations (26). Corroborating earlier findings of EEG studies (36, 43), the patients showed reduced mismatch fields to the duration as well as amplitude deviants. We could extend these findings and demonstrate that the decrease is at least in part explained by the missing right-hemispheric enhancement in the patients in the planum temporale, but not Heschl’s gyrus. In the only other MEG study investigating mismatch field in schizophrenia (16), such an altered lateralization was not present. However, those authors did not find the normal lateralization pattern to the right in comparison subjects. They used a MEG device that could record responses over one hemisphere only. The sequential comparison of the responses might increase variance and be influenced by a crosstalk from the unobserved auditory cortex. We used a whole-head scanner, which is more reliable in detecting lateralization differences.
Previously, it has been demonstrated that the mismatch negativity response has multiple generators within the superior temporal gyrus (44). The precise localization of its main component cannot be detected with current MEG methods. Using fMRI with its high spatial resolution, our results demonstrated that the hemodynamic equivalent of the mismatch field in the comparison subjects is in part located in the secondary auditory cortex (planum temporale), with a stronger signal change in the right superior temporal plane. Furthermore, a lateralization deficit similar to the mismatch field paradigm in the patients for the duration mismatch was detected. This finding stands in contrast to another fMRI study of mismatch negativity response in schizophrenia, in which standard oddball stimuli were presented via headphones (45). The authors found a smaller number of activated voxels in the superior temporal gyrus in the patients than in the comparison group in the mismatch but not the control condition. They did not find lateralization differences between groups, and the strongest activation was present in the Heschl’s gyrus (primary auditory cortex) in both groups. However, a confounding effect in their study may have been scanner noise, which is known to activate primary auditory regions (46, 47) and to mask the effects of stimulus properties. Another difference is that they employed blocks with deviants compared with blocks without deviants whereas we assessed directly the hemodynamic response to the single deviant events. Thus, lateralized abnormality might be masked by the difference in the activations to the blocks without deviants or specific to the distinct types of deviance detection.
The role of frontal areas in the generation of mismatch fields is still under debate. A number of studies found differences in mismatch responses between patients with schizophrenia and comparison subjects in the frontal (48–50), in the frontal and temporal (51, 52), or exclusively temporal (53–55) regions. Neither in our comparison nor in the patient group did we find responses in frontal areas. Differences in the technique applied and stimulus properties might be responsible for the different results in these studies. It has been proposed that the involvement of frontal areas in the generation of mismatch fields reflects attentive components (56, 57). The comparison with event-related potential recordings suggests that distinct networks participate in preattentive processing and target detection (58). Our paradigm allowed stimulation without attentive processing and thus little frontal contributions would be expected. Moreover, putative attentional deficits in patients would not have an influence in our paradigm. Thus, we focused on the auditory cortices, where we expected the most reliable differences between the groups. Further, our fMRI paradigm allowed only for a limited number of slices to be acquired, and the ventral regions of the frontal lobe were therefore not covered.
The BOLD response reflects differential hemodynamic activation integrated across seconds. Thus, neither early versus late components of a mismatch response, nor processing of a deviant versus its succeeding frequent stimulus, can be entirely separated. Amplitude changes modified responses in the primary auditory cortex due to deviant detection as well as adaptation effects, which cause primary, as well as secondary, auditory cortical responses. The effects of duration deviants on primary areas were minimal. Peak sound pressure level remains the same for the duration deviants whereas amplitude deviants may change the adaptation level. Therefore, the first louder sound may elicit adaptation phenomena in primary auditory areas. We suggest that an integration of activity in the primary and secondary (59) auditory cortices of both hemispheres is necessary for a normal mismatch response. A temporal desynchronization of the process is likely to underlie the deviant mismatch field response in our patients (60). The differences in mismatch fields and BOLD response between the groups in our study might be due to alterations in synaptic connections, pyramidal cell distribution, or connectivity between cortical columns. This might cause reduced stimulus-induced phase locking in the auditory cortex in the patients (61) and thus altered mismatch fields. The fMRI and MEG experiments yielded distinct effects for the topography and lateralization patterns. As a possible explanation, later components might be integrated into the BOLD response, whereas the mismatch field is restricted to the given temporal domain. Nonetheless, no differences emerged in the temporally resolved MEG for the response latencies between the groups. The different lateralization patterns in the fMRI and MEG data may thus reflect differences in temporal integration and synchronization processes between patients and comparison subjects.
Mismatch negativity elicitation is attributed to the presence of short-term memory traces. Templates are formed in the auditory cortex to encode the frequent stimulus elements. These memory traces usually fade within 5–10 seconds (62). For mismatch negativity to occur, an integrative process, consisting of an analysis of the afferent sound (deviant) and its comparison to the memory trace (frequent stimulus) has to occur (63). Impaired language processing in childhood is one of the few known risk factors for the later development of schizophrenia (64). Furthermore, some of the main symptoms in schizophrenia, such as formal thought disorder, auditory hallucinations, and concretism, involve language functions. We previously demonstrated a reversed lateralization pattern of cerebral activation in the superior temporal gyrus in patients with schizophrenia during processing of linguistic context (65) and production of fluent speech (66). The production of thought-disordered speech correlated negatively with signal changes in the posterior superior temporal gyrus (Wernicke’s area [24]).
In the current study, we have shown that language-related dysfunction in schizophrenia may be present already at the early stages of auditory processing of simple stimuli, and not only at stages involving higher-order semantic processing. This dysfunction involves an impaired lateralization of cortical activation in the secondary auditory cortex (planum temporale). Synchronized activity as measured with MEG was more affected than the metabolic changes measured by fMRI. Potential volume differences in the superior temporal gyrus between patient and comparison groups in our study were corrected by the fMRI analysis method. In line with our findings are results from studies using structural MRI. They found a reduction in the volume of the left planum temporale (secondary auditory cortex) but not Heschl’s gyrus (primary auditory cortex) in patients (67–69) (for review see references 70, 71). We therefore could provide a potential link between altered planum temporale morphology in schizophrenia and its functional consequences, i.e., disrupted preattentive acoustic processing.
In conclusion, we found altered neural responses to basic sound processing at the level of the planum temporale in a group of patients with schizophrenia in line with morphological changes in this region. Although the underlying pathology remains elusive, it might be related to deficient NMDA-dependent neurotransmission (3) and impaired synaptic connectivity (72) and thus reduced stimulus-induced phase locking of cortically evoked activity. The superior temporal gyrus is a crucial area in the generation of language-related symptoms in schizophrenia such as formal thought disorder (24) and auditory hallucinations (23), so this region warrants attention in future research.
Received April 14, 2003; revision received Aug. 5, 2003; accepted Aug. 7, 2003. From the University of Tübingen Departments of Psychiatry, Neuroradiology, and Neurology and the MEG Center and Institute of Medical Psychology and Behavioral Neurobiology, University of Tübingen. Address reprint requests to Dr. Kircher, Department of Psychiatry, University of Tübingen, Osianderstr. 24, D-72076 Tübingen, Germany; tilo.kircher@med.uni-tuebingen.de (e-mail). Supported by a grant from the University of Tübingen (fortuene number 727-0-0) and the German Research Foundation (DFG SFB 550). The authors thank Maike Borutta for technical assistance.