Evoked Potentials Laboratory, Behavioral Biology, Gutwirth Bldg., Technion-Israel Institute of Technology, Haifa 32000, Israel. hillel@tx.technion.ac.il
The purpose of the studies summarized in this report was to determine the time course of auditory cortex involvement in speech and language processing in the context of auditory object formation. Forty-one subjects took part in the three studies summarized in this report. In all three studies, subjects performed a choice-reaction task that required their pressing an appropriate button in response to auditory stimuli (speech/non-speech, good/worse fused phonemes, first/second language words) presented through earphones. Event-related potentials (ERPs) were recorded during performance of the task from 21 scalp electrodes, in addition to peri-ocular electrodes for monitoring eye movements. Current densities within the gray matter of the brain were estimated using the LORETA (low resolution electromagnetic tomography) method. In general, except for some periods, processing phonetic and linguistic information was associated with elevated activity in the left auditory cortex. Peaks in auditory cortex activation corresponded in time to scalp recorded peaks in the latencies of P1 and up to as late as P3. The adjacent posterior temporal areas showed a similar temporal pattern of activation, but tended to be less lateralized to the left, or even biased toward right hemisphere predominance, depending on the stimulus, particularly in the later time frames. The results indicate that the auditory cortex is engaged in auditory processing from its early stages and as long as a few hundreds of msec, even after cessation of the stimulus, defining sounds as distinct auditory objects and differentiating speech from non-speech material, relying on acoustic cues. Hemispheric dominance fluctuates to include activity in the 'non-dominant' hemisphere depending on stimulus type and stage of processing.
Mesh-terms: Acoustic Stimulation; Adolescent; Adult; Auditory Cortex :: physiology; Cues; Electroencephalography; Electromagnetic Fields; Electrophysiology; Evoked Potentials :: physiology; Female; Humans; Language; Laterality :: physiology; Male; Reaction Time :: physiology; Research Support, Non-U.S. Gov't; Speech Perception :: physiology; Tomography;
Latest citations:
Functional cerebral asymmetries (FCAs) are a fundamental principle of brain organization in many species. However, little is known about why they have evolved. Since FCAs are such a widespread phenomenon they seem to constitute an evolutionary selective advantage. According to a prominent hypothesis, an asymmetric brain should be associated with advantages in parallel processing, i.e. doing two tasks simultaneously. The strong version of this hypothesis implies that lateralized, instantaneous and complementary tasks are performed more efficiently with a highly lateralized brain. Using a visual half-field procedure, we wanted to test this strong version of the parallel-processing hypothesis in humans. Thirty-two participants (17 women, 15 men) were investigated. First, we assessed the degree of lateralization in a face/non-face and a word/non-word discrimination task favouring the right and left hemisphere, respectively. Based on a median split, subjects were divided into a rather symmetric and a rather asymmetric group. Then, all participants completed both tasks simultaneously. The results revealed that the rather symmetrically organized participants outperformed asymmetric participants in accuracy and response times. Hence, the strong version of the parallel-processing hypothesis has to be revised.
Institute of Psychology, Department of Neuropsychology, University of Zurich, Switzerland.
Synesthesia is defined as the involuntary and automatic perception of a stimulus in 2 or more sensory modalities (i.e., cross-modal linkage). Colored-hearing synesthetes experience colors when hearing tones or spoken utterances. Based on event-related potentials we employed electric brain tomography with high temporal resolution in colored-hearing synesthetes and nonsynesthetic controls during auditory verbal stimulation. The auditory-evoked potentials to words and letters were different between synesthetes and controls at the N1 and P2 components, showing longer latencies and lower amplitudes in synesthetes. The intracerebral sources of these components were estimated with low-resolution brain electromagnetic tomography and revealed stronger activation in synesthetes in left posterior inferior temporal regions, within the color area in the fusiform gyrus (V4), and in orbitofrontal brain regions (ventromedial and lateral). The differences occurred as early as 122 ms after stimulus onset. Our findings replicate and extend earlier reports with functional magnetic resonance imaging and positron emission tomography in colored-hearing synesthesia and contribute new information on the time course in synesthesia demonstrating the fast and possibly automatic processing of this unusual and remarkable phenomenon.
Evoked Potentials Laboratory, Technion-Israel Institute of Technology, Haifa, Israel (N.H., H.P.); and Department of Communication Disorders, Tel Aviv University, Tel-Aviv, Israel (T.M., N.H.).
OBJECTIVE:: To determine whether voicing perception is influenced primarily by linguistic experience or if it is due to innate temporal sensitivity to voicing boundaries, by examining behavioral and electrophysiological correlates of speech Voice-Onset-Time (VOT) and nonspeech Formant-Onset-Time (FOT) categorical perception. DESIGN:: Behavioral measures and auditory event-related potentials (ERPs) were obtained from 14 normal-hearing Hebrew speakers, whose voicing distinction is different than English, during identification and discrimination of two sets of stimuli: a VOT continuum, created by editing natural productions of /ba/ and /pa/, and an analogous nonspeech continuum, composed of two synthesized formants, varying in their onset time-FOT. RESULTS:: VOT and FOT continua yielded similar behavioral identification curves. Differences between the two stimulus types were found in discrimination of within-category differences and in reaction time effects. During identification and discrimination tasks, ERPs were differently affected by the VOT or FOT value of the stimulus: VOT value had a significant effect on N1 latency and on N1 and P2 amplitudes whereas FOT value had a significant effect on P2 amplitude. Additionally, during identification tasks, whereas all speech signals evoked a P3, regardless of overt categorization, only the perceptually "rare" nonspeech stimulus (+15 msec FOT) evoked a P3. CONCLUSIONS:: Voicing boundaries corresponded to Hebrew VOT values of production, suggesting that voicing perception in Hebrew is mediated mainly by linguistic experience rather than by innate temporal sensitivity. ERP data differed to VOT versus FOT stimuli as early as N1, indicating that brain processing of the temporal aspects of speech and nonspeech signals differ from their early stages. Further studies to establish the neural response patterns to voicing in speakers of languages that use different voicing categories than English are warranted.
Other papers by authors:
Evoked Potentials Laboratory, Technion-Israel Institute of Technology, Haifa, Israel (N.H., H.P.); and Department of Communication Disorders, Tel Aviv University, Tel-Aviv, Israel (T.M., N.H.).
OBJECTIVE:: To determine whether voicing perception is influenced primarily by linguistic experience or if it is due to innate temporal sensitivity to voicing boundaries, by examining behavioral and electrophysiological correlates of speech Voice-Onset-Time (VOT) and nonspeech Formant-Onset-Time (FOT) categorical perception. DESIGN:: Behavioral measures and auditory event-related potentials (ERPs) were obtained from 14 normal-hearing Hebrew speakers, whose voicing distinction is different than English, during identification and discrimination of two sets of stimuli: a VOT continuum, created by editing natural productions of /ba/ and /pa/, and an analogous nonspeech continuum, composed of two synthesized formants, varying in their onset time-FOT. RESULTS:: VOT and FOT continua yielded similar behavioral identification curves. Differences between the two stimulus types were found in discrimination of within-category differences and in reaction time effects. During identification and discrimination tasks, ERPs were differently affected by the VOT or FOT value of the stimulus: VOT value had a significant effect on N1 latency and on N1 and P2 amplitudes whereas FOT value had a significant effect on P2 amplitude. Additionally, during identification tasks, whereas all speech signals evoked a P3, regardless of overt categorization, only the perceptually "rare" nonspeech stimulus (+15 msec FOT) evoked a P3. CONCLUSIONS:: Voicing boundaries corresponded to Hebrew VOT values of production, suggesting that voicing perception in Hebrew is mediated mainly by linguistic experience rather than by innate temporal sensitivity. ERP data differed to VOT versus FOT stimuli as early as N1, indicating that brain processing of the temporal aspects of speech and nonspeech signals differ from their early stages. Further studies to establish the neural response patterns to voicing in speakers of languages that use different voicing categories than English are warranted.
Evoked Potentials Laboratory, Technion-Israel Institute of Technology, Gutwirth Building, 3200 Haifa, Israel.
OBJECTIVE: To compare the 'F(fusion)-complex' with the Mismatch negativity (MMN), both components associated with automatic detection of changes in the acoustic stimulus flow. METHODS: Ten right-handed adult native Hebrew speakers discriminated vowel-consonant-vowel (V-C-V) sequences /ada/(deviant) and /aga/(standard) in an active auditory 'Oddball' task, and the brain potentials associated with performance of the task were recorded from 21 electrodes. Stimuli were generated by fusing the acoustic elements of the V-C-V sequences as follows: base was always presented in front of the subject, and formant transitions were presented to the front, left or right in a virtual reality room. An illusion of a lateralized echo (duplex sensation) accompanied base fusion with the lateralized formant locations. Source current density estimates were derived for the net response to the fusion of the speech elements (F-complex) and for the MMN, using low-resolution electromagnetic tomography (LORETA). Statistical non-parametric mapping was used to estimate the current density differences between the brain sources of the F-complex and the MMN. RESULTS: Occipito-parietal regions and prefrontal regions were associated with the F-complex in all formant locations, whereas the vicinity of the supratemporal plane was bilaterally associated with the MMN, but only in case of front-fusion (no duplex effect). CONCLUSIONS: MMN is sensitive to the novelty of the auditory object in relation to other stimuli in a sequence, whereas the F-complex is sensitive to the acoustic features of the auditory object and reflects a process of matching them with target categories. SIGNIFICANCE: The F-complex and MMN reflect different aspects of auditory processing in a stimulus-rich and changing environment: content analysis of the stimulus and novelty detection, respectively.
Evoked Potentials Laboratory, Behavioral Biology, Gutwirth Building, Technion-Israel Institute of Technology, Haifa 32000, Israel. hillel@tx.technion.ac.il
OBJECTIVE: The purpose of this study was to define early brain activity associated with fusion of speech elements to form an auditory object in the middle-latency range preceding the F-Complex. METHODS: Stimuli were binaural formant transition and base, that were presented separately or fused to form the vowel-consonant-vowel sequence /ada/. Eleven right-handed, adult, native Hebrew speakers listened to 2/s presentations, and the brain potentials from C(z) during the 250 msec following transition onset (in the responses to transition and to the fused word) or following the time it would have been presented (in the response to base alone) were recorded. The net-fusion response was extracted by subtracting the sum of potentials to the base and the formant transition from the potentials to the fused sound. RESULTS: Auditory middle-latency components, comprising of 9 peaks and troughs were recorded in response to the base, to the formant transition and to the fused /ada/. In general, the responses to the fused object were significantly smaller in peak amplitude and in total activity (area under the curve) resulting in the difference waveform of the net-fusion response that also included 9 peaks, but with opposite polarities. CONCLUSIONS: The early middle-latency components to fusion indicate that the fusion of speech elements to a word involves inhibition, occlusion or both. The results are in line with the uniqueness of speech perception and the early role of the auditory cortex in speech analysis.
Evoked Potentials Laboratory, Technion-Israel Institute of Technology, Technion City, Haifa, Israel.
We recorded event related brain potentials to assess stages of linguistic processing of first (L1) and second (L2) language and of pseudowords when subjects were engaged in a different task and did not attend to the words. Young adults (n = 15) were presented with pairs of auditory stimuli consisting of words and pseudowords in L1 and L2 with different voice onset times (VOT), which served as distracters in a short-term memory task. ERPs were recorded from 11 scalp electrodes. The ERP peak amplitudes and latencies were subjected to analysis of variance for the effects of language, meaning and scalp location as well as priming of the second word in the pair by the preceding word. Behavioral results showed that attention was drawn to the primary task and away from the words; yet significant, including semantic, processing was evident in the ERPs to the words, with significant effects of language, meaning and priming. Even with barely any awareness of the stimuli, the brain processes words including distinguishing between L1 and L2 and relating to the stimuli's context.
Evoked Potentials Laboratory, Technion--Israel Institute of Technology, Gutwirth Building, 32000 Haifa, Israel.
OBJECTIVE: The purpose of this study was to examine the processing of auditory movement sensation accompanying duplex perception in binaural hearing. METHODS: Stimuli were formant transitions (presented to the front, left or right of the subject) and base (presented to the front), that fused to result in vowel-consonant-vowel (V-C-V) sequences /aga/ and /ada/. An illusion of auditory movement (duplex sensation) accompanied the fusion of these V-C-V sequences when the spatial locations of the formant transitions and base were different. Ten right-handed, adult, native Hebrew speakers discriminated each fused stimulus, and the brain potentials associated with performance of the task were recorded from 21 electrodes. The processing of auditory movement was studied by a factorial design (ANOVA) and statistical non-parametric mapping (SnPM) of low resolution electromagnetic tomography (LORETA) images of the net-fusion response. Brain regions implicated in auditory movement processing were expected to be associated with the lateralized formant location, which gave rise to duplex perception. In addition, the time-course of significant activation in brain areas that differentiated between fusion conditions was determined. RESULTS: The posterior parietal, anterior cingulate and premotor cortices were found to be implicated in duplex processing. Auditory cortex involvement was also evident, and together with the latter two brain regions was affected by right-ear advantage. CONCLUSIONS: Duplex perception resulting from fusion of spatially separate sounds forming an auditory object results in activation of a network of brain regions reflecting enhanced allocation of attention and the effect of language processing.
Evoked Potential Laboratory, Faculty of Medicine, Technion--Israel Institute of Technology, Haifa 32000, Israel. sinai@tx.technion.ac.il
OBJECTIVE: Auditory event-related brain potentials (ERPs) were recorded during a lexical decision task in response to linguistic and non-linguistic stimuli, to assess the detailed time course of language processing in general, and hemispheric dominance in particular. METHODS: Young adults (n=17) were presented with pairs of auditory stimuli consisting of words, pseudowords and words played backwards in a lexical decision task. ERPs were recorded from 21 scalp electrodes. Current densities were calculated using low-resolution electromagnetic tomography (LORETA). Statistic non-parametric maps of activity were derived from the calculated current densities and the number of active brain voxels in the left and right hemispheres was compared throughout the processing of each stimulus. RESULTS: Our results show that hemispheric dominance is highly time dependent, alternating between the right and left hemispheres at different times, and that the right hemisphere's role in language processing follows a different time course for first and second language. The time course of hemispheric dominance for non-linguistic stimuli was highly variable. CONCLUSIONS: The time course of hemispheric dominance is dynamic, alternating between left and right homologous regions, with different time courses for different stimulus classes.
Evoked Potentials Laboratory, Gutwirth Building, Technion - Israel Institute of Technology, Haifa 32000, Israel.
OBJECTIVE: The purpose of this study was to define and analyze the brain activity associated with fusion of speech elements to form an auditory object and to study the effects of presenting the elements at different spatial locations (duplex stimulus). METHODS: Stimuli were formant transitions (presented to the front, left or right of the subject) and base (presented to the front), that fused to result in V-C-V sequences /aga/ and /ada/. Ten right-handed, adult, native Hebrew speakers discriminated each fused stimulus, and the brain potentials associated with performance of the task were recorded from 21 electrodes. The net-fusion response, the 'F(fusion)-complex', was extracted by subtracting the sum of potentials to the base and formant transitions from the potentials to the fused sound. Low resolution electromagnetic tomography analysis (LORETA) was performed to assess the timing and brain location of the fusion process. RESULTS: The 'F-complex', comprising of the difference N(1), P(2), N(2b)(FN(1), FP(2), FN(2b)) components could be identified for each of the stimuli and reflected a process indicating inhibition, occlusion or both, with right ear advantage in fusion. LORETA analyses indicate sequential processing of speech fusion in the temporal lobes, beginning with right prominence in FN(1) and FP(2) shifting to a more symmetrical pattern in FN(2). CONCLUSIONS: The electrophysiological correlates of speech fusion highlight the uniqueness of speech perception and the brain areas involved in its analysis.
Evoked Potentials Laboratory, Israel Institute of Technology, Haifa, Israel. sinai@techunix.technion.ac.il
The study purpose was to indicate when language-specific processing first occurs and how first and second language priming processes interact. Event-related potentials were recorded from 14 normal native Hebrew speakers, in a variation of lexical decision task, to pairs of stimuli (S1, S2) in Hebrew, English, Hebrew pseudowords, and English pseudowords. Although no behavioral priming was observed, priming by pseudowords in either language affected both N400 and the late positive component of event related potential. N1 and P2 latencies were longer to S2 in semantically related pairs, indicating that language-specific processing may take place as early as auditory cortex. Different processing of first and second language was evident only in response to pseudowords.
Department of Physics of Complex Systems, The Weizmann Institute of Science, Rehovot, Israel.
OBJECTIVE: The combination of transcranial magnetic stimulation (TMS) and electroencephalography (EEG) has been hampered by the large artifact that the TMS generates in the EEG. Using TMS with EEG necessitates a sophisticated artifact-resistant EEG system that can acquire reliable signals in the crucial several tens of milliseconds immediately following the TMS pulse. Here, we demonstrate the use of a novel artifact removal algorithm together with a 24-bit EEG system to achieve similar recordings as those obtained with the dedicated TMS-compatible EEG system. METHODS: This setup was used to compare TMS-evoked responses between a group of healthy controls and a group of patients with schizophrenia, a condition in which effective neural connectivity is thought to be compromised. RESULTS: We observe differences in TMS-evoked responses between the two groups, similar to those recently reported in a study that used a dedicated TMS-compatible EEG system. CONCLUSIONS: The standard 24-bit EEG system combined with an artifact removal algorithm produces results similar to the dedicated TMS-compatible system. SIGNIFICANCE: This paves the way for more researchers and clinicians to use TMS-evoked responses for research and diagnosis of a wide spectrum of disorders.
Department of Neurology, Carmel Medical Center, Technion-Israel Institute of Technology, Haifa, Israel. guyhaiman@gmail.com
The purpose of this study was to characterize the brain activity and associated cortical structures involved in pseudobulbar affect (PBA), a condition characterized by uncontrollable episodes of laughing and/or crying in patients with multiple sclerosis before and after treatment with dextromethorphan/quinidine (DM/Q). Behavioral responses and event-related potentials (ERPs) in response to subjectively significant and neutral verbal stimuli were recorded from 2 groups: 6 multiple sclerosis patients with PBA before (PBA-preTx) and after (PBA-DM/Q) treatment with DM/Q and 6 healthy control (HC) subjects. Statistical nonparametric mapping comparisons of ERP source current density distributions between groups were conducted for subjectively significant and neutral stimuli separately before and after treatment with DM/Q. Treatment with DM/Q had a normalizing effect on the behavioral responses of PBA patients. Event-related potential waveform comparisons of PBA-preTx and PBA-DM/Q with HC, for both neutral and subjectively significant stimuli, revealed effects on early ERP components. Comparisons between PBA-preTx and HC, in response to subjectively significant stimuli, revealed both early and late effects. Source analysis comparisons between PBA-preTx and PBA-DM/Q indicated distinct activations in areas involved in emotional processing and high-level and associative visual processing in response to neutral stimuli and in areas involved in emotional, somatosensory, primary, and premotor processing in response to subjectively significant stimuli. In most cases, stimuli evoked higher current density in PBA-DM/Q compared with the other groups. In conclusion, differences in brain activity were observed before and after medication. Also, DM/Q administration resulted in normalization of behavioral and electrophysiological measures.
Latest similar papers:
Electrocorticographic frequency alteration mapping for extraoperative localization of speech cortex.
Melinda Wu,
Kimberly Wisneski,
Gerwin Schalk,
Mohit Sharma,
Jarod Roland,
Jonathan Breshears,
Charles Gaona,
Eric C Leuthardt
Washington University School of Medicine, Washington University in St. Louis, St. Louis, Missouri 63110, USA.
OBJECTIVE: Electrocortical stimulation (ECS) has long been established for delineating eloquent cortex in extraoperative mapping. However, ECS is still coarse and inefficient in delineating regions of functional cortex and can be hampered by afterdischarges. Given these constraints, an adjunct approach to defining motor cortex is the use of electrocorticographic (ECoG) signal changes associated with active regions of cortex. The broad range of frequency oscillations are categorized into 2 main groups with respect to sensorimotor cortex: low-frequency bands (LFBs) and high-frequency bands (HFBs). The LFBs tend to show a power reduction, whereas the HFBs show power increases with cortical activation. These power changes associated with activated cortex could potentially provide a powerful tool in delineating areas of speech cortex. We explore ECoG signal alterations as they occur with activated region of speech cortex and its potential in clinical brain mapping applications. METHODS: We evaluated 7 patients who underwent invasive monitoring for seizure localization. Each had extraoperative ECS mapping to identify speech cortex. Additionally, all subjects performed overt speech tasks with an auditory or a visual cue to identify associated frequency power changes in regard to location and degree of concordance with ECS results. RESULTS: Electrocorticographic frequency alteration mapping (EFAM) had an 83.9% sensitivity and a 40.4% specificity in identifying any language site when considering both frequency bands and both stimulus cues. Electrocorticographic frequency alteration mapping was more sensitive in identifying the Wernicke area (100%) than the Broca area (72.2%). The HFB is uniquely suited to identifying the Wernicke area, whereas a combination of the HFB and LFB is important for Broca localization. CONCLUSION: The concordance between stimulation and spectral power changes demonstrates the possible utility of EFAM as an adjunct method to improve the efficiency and resolution of identifying speech cortex.
Department of Electrical Engineering, University of Southern California, Los Angeles, ca 90089 USA.
Auditory attention is a complex mechanism that involves the processing of low-level acoustic cues together with higher level cognitive cues. In this paper, a novel method is proposed that combines biologically inspired auditory attention cues with higher level lexical and syntactic information to model task-dependent influences on a given spoken language processing task. A set of low-level multiscale features (intensity, frequency contrast, temporal contrast, orientation, and pitch) is extracted in parallel from the auditory spectrum of the sound based on the processing stages in the central auditory system to create feature maps that are converted to auditory gist features that capture the essence of a sound scene. The auditory attention model biases the gist features in a task-dependent way to maximize target detection in a given scene. Furthermore, the top-down task-dependent influence of lexical and syntactic information is incorporated into the model using a probabilistic approach. The lexical information is incorporated by using a probabilistic language model, and the syntactic knowledge is modeled using part-of-speech (POS) tags. The combined model is tested on automatically detecting prominent syllables in speech using the BU Radio News Corpus. The model achieves 88.33% prominence detection accuracy at the syllable level and 85.71% accuracy at the word level. These results compare well with reported human performance on this task.
Carolin Sheridan,
Rossitza Draganova,
Maureen Ware,
Pamela Murphy,
Rathinaswamy Govindan,
Eric R Siegel,
Hari Eswaran,
Hubert Preissl
Department of Obstetrics and Gynecology, University of Arkansas for Medical Sciences, 4301 W. Markham # 518, Little Rock, AR 72205, USA; MEG Center Tübingen, University of Tübingen, Otfried-Müller-Str. 47, Tübingen 72076, Germany.
Background: The processing of rapidly presented stimuli has been shown to be a precursor for the perception of speech in infants, long before they learn to speak. However, the onset and early development of rapid temporal processing (RTP) skills is not yet well understood. The main goal of this study was to assess the development of RTP skills during the prenatal and early postnatal stages of life. Methodology: Tone pairs were presented in two difficulties (long and short) and event-related magnetic fields were recorded using MEG. Pregnant women (22)(gestational ages between 29 and 38weeks') participated in the fetal study and 15 returned for a neonatal follow-up study between 2 and 38days after delivery or 38 and 44weeks gestational age (GA). Results: In the postnatal follow-up study, a trend towards two peaks with increasing chronological and gestational age was observed in the longer tone pair. However, no such trend was evident in neonatal responses to the short tone pairs or in fetal recordings. Conclusions: Neonates showed a gradual trend to successful processing of the longer tone pair with increasing age. By 22days of chronological age, the infants processed this tone pair successfully, as indicated by two-peak waveforms. Therefore, the first 3weeks of life could be critical for the development of RTP. Significance: This study is a first approach towards the assessment of early RTP development. The results provide promising indications for future studies, which might lead to an early detection of deficits in speech perception and therefore prevent further language impairments.
Silicon Speech, Santa Venetia, CA 94903.
J S Anderson,
N Lange,
A Froehlich,
M B Dubray,
T J Druzgal,
M P Froimowitz,
A L Alexander,
E D Bigler,
J E Lainhart
Departments of Neuroradiology, Psychology, and Psychiatry, Program in Neuroscience, and Brain Institute, University of Utah, Salt Lake City, Utah; Department of Psychiatry, Harvard Medical School, Boston, Massachusetts; Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts; Neurostatistics Laboratory, McLean Hospital, Belmont, Massachusetts; Departments of Medical Physics and Psychiatry and Waisman Laboratory for Brain Imaging and Behavior, Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin; and Department of Psychology and Neuroscience Center, Brigham Young University, Provo, Utah.
BACKGROUND AND PURPOSE: Individuals with autism spectrum disorders often exhibit atypical language patterns, including delay of speech onset, literal speech interpretation, and poor recognition of social and emotional cues in speech. We acquired functional MR images during an auditory language task to evaluate systematic differences in language-network activation between control and high-functioning autistic populations. MATERIALS AND METHODS: Forty-one right-handed male subjects (26 high-functioning autistic subjects, 15 controls) were studied by using an auditory phrase-recognition task, and areas of differential activation between groups were identified. Hand preference, verbal intelligence quotient (IQ), age, and language-function testing were included as covariables in the analysis. RESULTS: Control and autistic subjects showed similar language-activation networks, with 2 notable differences. Control subjects showed significantly increased activation in the left posterior insula compared with autistic subjects (P <.05, false discovery rate), and autistic subjects showed increased bilaterality of receptive language compared with control subjects. Higher receptive-language scores on standardized testing were associated with greater activation of the posterior aspect of the left Wernicke area. A higher verbal IQ was associated with greater activation of the bilateral Broca area and involvement of the prefrontal cortex and lateral premotor cortex. CONCLUSIONS: Control subjects showed greater activation of the posterior insula during receptive language, which may correlate with impaired emotive processing of language in autism. Subjects with autism showed greater bilateral activation of receptive-language areas, which was out of proportion to the differences in hand preference in autism and control populations.
Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany.
Studies of the neural basis of spoken language comprehension typically focus on aspects of auditory processing by varying signal intelligibility, or on higher-level aspects of language processing such as syntax. Most studies in either of these threads of language research report brain activation including peaks in the superior temporal gyrus (STG) and/or the superior temporal sulcus (STS), but it is not clear why these areas are recruited in functionally different studies. The current fMRI study aims to disentangle the functional neuroanatomy of intelligibility and syntax in an orthogonal design. The data substantiate functional dissociations between STS and STG in the left and right hemispheres: first, manipulations of speech intelligibility yield bilateral mid-anterior STS peak activation, whereas syntactic phrase structure violations elicit strongly left-lateralized mid STG and posterior STS activation. Second, ROI analyses indicate all interactions of speech intelligibility and syntactic correctness to be located in the left frontal and temporal cortex, while the observed right-hemispheric activations reflect less specific responses to intelligibility and syntax. Our data demonstrate that the mid-to-anterior STS activation is associated with increasing speech intelligibility, while the mid-to-posterior STG/STS is more sensitive to syntactic information within the speech. Hum Brain Mapp, 2009.(c) 2009 Wiley-Liss, Inc.
Phyllis Prescott, UNC-Greensboro.
We investigated operant sucking response learning in human newborns. Auditory reinforcers always occurred monaurally to see whether their potency differed between ears. Experiment 1- we controlled the reinforcers, either intrauterine heartbeat sounds or unfamiliar speech, while infants chose which ear received it. Experiment 2- we controlled the reinforcers and the receiving ear. Unfamiliar speech reinforced learning only if infants could use their right ear and heartbeats reinforced learning only if infants could use their left ear. Experiment 3- we controlled the ear while infants chose between their mothers' vs. a stranger's voice and between their mothers' vs. a foreign language. The more familiar speech reinforced learning only if infants could use their left ear. We proposed reinforcer potency differed between ears because the newborn's auditory system, just like adult's, optimizes their perceptual clarity by left-lateralized processing of their rapid temporal variations and right-lateralized processing of their longer-lasting spectral characteristics.
Auditory Neuroscience Laboratory, Department of Communication Sciences, Northwestern University, Evanston, Illinois 60208, USA. daa@stanford.edu
Children with reading impairments have long been associated with impaired perception for rapidly presented acoustic stimuli and recently have shown deficits for slower features. It is not known whether impairments for low-frequency acoustic features negatively impact processing of speech in reading-impaired individuals. Here we provide neurophysiological evidence that poor readers have impaired representation of the speech envelope, the acoustical cue that provides syllable pattern information in speech. We measured cortical-evoked potentials in response to sentence stimuli and found that good readers indicated consistent right-hemisphere dominance in auditory cortex for all measures of speech envelope representation, including the precision, timing, and magnitude of cortical responses. Poor readers showed abnormal patterns of cerebral asymmetry for all measures of speech envelope representation. Moreover, cortical measures of speech envelope representation predicted up to 41% of the variability in standardized reading scores and 50% in measures of phonological processing across a wide range of abilities. Our findings strongly support a relationship between acoustic-level processing and higher-level language abilities, and are the first to link reading ability with cortical processing of low-frequency acoustic features in the speech signal. Our results also support the hypothesis that asymmetric routing between cerebral hemispheres represents an important mechanism for temporal encoding in the human auditory system, and the need for an expansion of the temporal processing hypothesis for reading disabilities to encompass impairments for a wider range of speech features than previously acknowledged.
Maastricht University, Faculty of Psychology & Neuroscience, Dept of Cognitive Neuroscience, P.O. Box 616, 6200 MD Maastricht, The Netherlands; Dept. of Psychiatry, Columbia University College of Physicians and Surgeons & New York State Psychiatric Institute, 1051 Riverside Drive, NY 10032, New York, USA.
In addition to visual information from the face of the speaker, a less natural, but nowadays extremely important visual component of speech is its representation in script. In this review, neuro-imaging studies are examined which were aimed to understand how speech and script are associated in the adult "literate" brain. The reviewed studies focused on the role of different stimulus and task factors and effective connectivity between different brain regions. The studies will be summarized in a neural mechanism for the integration of speech and script that can serve as a basis for future studies addressing (the failure of) literacy acquisition. In this proposed mechanism, speech sound processing in auditory cortex is modulated by co-presented visual letters, depending on the congruency of the letter-sound pairs. Other factors of influence are temporal correspondence, input quality and task instruction. We present results showing that the modulation of auditory cortex is most likely mediated by feedback from heteromodal areas in the superior temporal cortex, but direct influences from visual cortex are not excluded. The influence of script on speech sound processing occurs automatically and shows extended development during reading acquisition. This review concludes with suggestions to answer currently still open questions to get closer to understanding the neural basis of normal and impaired literacy.
Erik Edwards,
Maryam Soltani,
Won Kim,
Sarang S Dalal,
Mitchel S Berger,
Srikantan Nagarajan,
Robert T Knight
University of Washington, Seattle.
We recorded the electrocorticogram (ECoG) directly from the exposed cortical surface of awake neurosurgical patients during the presentation of auditory syllable stimuli. All patients were unanesthetized as part of a language mapping procedure for subsequent left-hemisphere tumor resection. Time-frequency analyses showed significant high-gamma (gammahigh: 70-160 Hz) responses from the left superior temporal gyrus (STG), but no reliable response from the left inferior frontal gyrus (IFG). Alpha suppression (alpha: 7-14 Hz) and event-related potential (ERP) responses exhibited a more widespread topography. Across electrodes, the alpha suppression from 200-450 ms correlated with the preceding (50-200 ms) gammahigh increase. The results are discussed in terms of the different physiological origins of these electrocortical signals.
