Initially, the theory was associationist: infants mimic the speech they hear and that this leads to behavioristic associations between articulation and its sensory consequences. Later, this overt mimicry would be short-circuited and become speech perception. This aspect of the theory was dropped, however, with the discovery that prelinguistic infants could already detect most of the phonetic contrasts used to separate different speech sounds.
Initially, speech perception was assumed to link to speech objects that were both
This was later revised to include the phonetic gestures rather than motor commands, and then the gestures intended by the speaker at a prevocal, linguistic level, rather than actual movements.
The "speech is special" claim has been dropped, as it was found that speech perception could occur for nonspeech sounds (for example, slamming doors for duplex perception).
If speech is identified in terms of how it is physically made, then nonauditory information should be incorporated into speech percepts even if it is still subjectively heard as "sounds". This is, in fact, the case.
If people can hear the gestures in speech, then the imitation of speech should be very fast, as in when words are repeated that are heard in headphones as in speech shadowing. People can repeat heard syllables more quickly than they would be able to produce them normally.
Evidence exists that perception and production are generally coupled in the motor system. This is supported by the existence of mirror neurons that are activated both by seeing (or hearing) an action and when that action is carried out. Another source of evidence is that for common coding theory between the representations used for perception and action.
The motor theory of speech perception is not widely held in the field of speech perception, though it is more popular in other fields, such as theoretical linguistics. As three of its advocates have noted, "it has few proponents within the field of speech perception, and many authors cite it primarily to offer critical commentary".p. 361 Several critiques of it exist.
Speech perception is affected by nonproduction sources of information, such as context. Individual words are hard to understand in isolation but easy when heard in sentence context. It therefore seems that speech perception uses multiple sources that are integrated together in an optimal way.
The motor theory of speech perception would predict that speech motor abilities in infants predict their speech perception abilities, but in actuality it is the other way around. It would also predict that defects in speech production would impair speech perception, but they do not. However, this only affects the first and already superseded behaviorist version of the theory, where infants were supposed to learn all production-perception patterns by imitation early in childhood. This is no longer the mainstream view of motor-speech theorists.
Several sources of evidence for a specialized speech module have failed to be supported.
As a result, this part of the theory has been dropped by some researchers.
The evidence provided for the motor theory of speech perception is limited to tasks such as syllable discrimination that use speech units not full spoken words or spoken sentences. As a result, "speech perception is sometimes interpreted as referring to the perception of speech at the sublexical level. However, the ultimate goal of these studies is presumably to understand the neural processes supporting the ability to process speech sounds under ecologically valid conditions, that is, situations in which successful speech sound processing ultimately leads to contact with the mental lexicon and auditory comprehension." This however creates the problem of " a tenuous connection to their implicit target of investigation, speech recognition".
Liberman, A. M.; Cooper, F. S.; Shankweiler, D. P.; Studdert-Kennedy, M. (1967). "Perception of the speech code". Psychological Review. 74 (6): 431–461. doi:10.1037/h0020279. PMID 4170865. /wiki/Doi_(identifier)
Liberman, A. M.; Mattingly, I. G. (1985). "The motor theory of speech perception revised". Cognition. 21 (1): 1–36. CiteSeerX 10.1.1.330.220. doi:10.1016/0010-0277(85)90021-6. PMID 4075760. S2CID 112932. /wiki/CiteSeerX_(identifier)
Liberman, A. M.; Mattingly, I. G. (1989). "A specialization for speech perception". Science. 243 (4890): 489–494. Bibcode:1989Sci...243..489L. doi:10.1126/science.2643163. PMID 2643163. /wiki/Bibcode_(identifier)
Liberman, A. M.; Whalen, D. H. (2000). "On the relation of speech to language". Trends in Cognitive Sciences. 4 (5): 187–196. doi:10.1016/S1364-6613(00)01471-6. PMID 10782105. S2CID 12252728. /wiki/Doi_(identifier)
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Liberman, A. M. (1996). Speech: A special code. Cambridge, MA: MIT Press. ISBN 978-0-262-12192-7 /wiki/ISBN_(identifier)
Liberman, A. M.; Delattre, P.; Cooper, F. S. (1952). "The role of selected stimulus-variables in the perception of the unvoiced stop consonants". The American Journal of Psychology. 65 (4): 497–516. doi:10.2307/1418032. JSTOR 1418032. PMID 12996688. /wiki/Doi_(identifier)
Liberman, A. M.; Delattre, P. C.; Cooper, F. S.; Gerstman, L. J. (1954). "The role of consonant-vowel transitions in the perception of the stop and nasal consonants". Psychological Monographs: General and Applied. 68 (8): 1–13. doi:10.1037/h0093673. PDF Archived 2016-03-03 at the Wayback Machine /wiki/Doi_(identifier)
Fowler, C. A.; Saltzman, E. (1993). "Coordination and coarticulation in speech production". Language and Speech. 36 ( Pt 2-3) (2–3): 171–195. doi:10.1177/002383099303600304. PMID 8277807. S2CID 7199908. PDF /wiki/Doi_(identifier)
Liberman, A. M.; Delattre, P. C.; Cooper, F. S.; Gerstman, L. J. (1954). "The role of consonant-vowel transitions in the perception of the stop and nasal consonants". Psychological Monographs: General and Applied. 68 (8): 1–13. doi:10.1037/h0093673. PDF Archived 2016-03-03 at the Wayback Machine /wiki/Doi_(identifier)
Liberman, A. M.; Cooper, F. S.; Shankweiler, D. P.; Studdert-Kennedy, M. (1967). "Perception of the speech code". Psychological Review. 74 (6): 431–461. doi:10.1037/h0020279. PMID 4170865. /wiki/Doi_(identifier)
Liberman, A. M.; Cooper, F. S.; Shankweiler, D. P.; Studdert-Kennedy, M. (1967). "Perception of the speech code". Psychological Review. 74 (6): 431–461. doi:10.1037/h0020279. PMID 4170865. /wiki/Doi_(identifier)
Liberman, A. M.; Isenberg, D.; Rakerd, B. (1981). "Duplex perception of cues for stop consonants: Evidence for a phonetic mode". Perception & Psychophysics. 30 (2): 133–143. doi:10.3758/bf03204471. PMID 7301513. https://doi.org/10.3758%2Fbf03204471
Liberman, A. M.; Delattre, P. C.; Cooper, F. S.; Gerstman, L. J. (1954). "The role of consonant-vowel transitions in the perception of the stop and nasal consonants". Psychological Monographs: General and Applied. 68 (8): 1–13. doi:10.1037/h0093673. PDF Archived 2016-03-03 at the Wayback Machine /wiki/Doi_(identifier)
Liberman, A. M. (1970). "The grammars of speech and language" (PDF). Cognitive Psychology. 1 (4): 301–323. doi:10.1016/0010-0285(70)90018-6. Archived from the original (PDF) on 2015-12-31. Retrieved 2009-06-02. https://web.archive.org/web/20151231094548/http://www.haskins.yale.edu/Reprints/HL0099.pdf
Liberman, A. M.; Cooper, F. S.; Shankweiler, D. P.; Studdert-Kennedy, M. (1967). "Perception of the speech code". Psychological Review. 74 (6): 431–461. doi:10.1037/h0020279. PMID 4170865. /wiki/Doi_(identifier)
Liberman, A. M.; Mattingly, I. G. (1985). "The motor theory of speech perception revised" (PDF). Cognition. 21 (1): 1–36. CiteSeerX 10.1.1.330.220. doi:10.1016/0010-0277(85)90021-6. PMID 4075760. S2CID 112932. Archived from the original (PDF) on 2021-04-15. Retrieved 2009-06-02. https://web.archive.org/web/20210415025120/http://www.haskins.yale.edu/Reprints/HL0519.pdf
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Fowler, C. A.; Rosenblum, L. D. (1990). "Duplex perception: A comparison of monosyllables and slamming doors". Journal of Experimental Psychology. Human Perception and Performance. 16 (4): 742–754. doi:10.1037/0096-1523.16.4.742. PMID 2148589. /wiki/Doi_(identifier)
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Massaro, D. W.; Chen, T. H. (2008). "The motor theory of speech perception revisited". Psychonomic Bulletin & Review. 15 (2): 453–457, discussion 457–62. doi:10.3758/pbr.15.2.453. PMID 18488668. S2CID 9266946. /wiki/Doi_(identifier)
MacLeod, A.; Summerfield, Q. (1987). "Quantifying the contribution of vision to speech perception in noise". British Journal of Audiology. 21 (2): 131–141. doi:10.3109/03005368709077786. PMID 3594015. /wiki/Doi_(identifier)
Fowler, C. A.; Dekle, D. J. (1991). "Listening with eye and hand: Cross-modal contributions to speech perception". Journal of Experimental Psychology. Human Perception and Performance. 17 (3): 816–828. doi:10.1037/0096-1523.17.3.816. PMID 1834793. /wiki/Doi_(identifier)
Nygaard LC, Pisoni DB (1995). "Speech Perception: New Directions in Research and Theory". In J.L. Miller, P.D. Eimas (eds.). Handbook of Perception and Cognition: Speech, Language, and Communication. San Diego: Academic Press. ISBN 978-0-12-497770-9. 978-0-12-497770-9
Liberman, A. M.; Harris, K. S.; Hoffman, H. S.; Griffith, B. C. (1957). "The discrimination of speech sounds within and across phoneme boundaries". Journal of Experimental Psychology. 54 (5): 358–368. doi:10.1037/h0044417. PMID 13481283. S2CID 10117886. /wiki/Doi_(identifier)
Marslen-Wilson, W. (1973). "Linguistic structure and speech shadowing at very short latencies". Nature. 244 (5417): 522–523. Bibcode:1973Natur.244..522M. doi:10.1038/244522a0. PMID 4621131. S2CID 4220775. /wiki/Bibcode_(identifier)
Porter Jr, R. J.; Lubker, J. F. (1980). "Rapid reproduction of vowel-vowel sequences: Evidence for a fast and direct acoustic-motoric linkage in speech". Journal of Speech and Hearing Research. 23 (3): 593–602. doi:10.1044/jshr.2303.593. PMID 7421161. /wiki/Doi_(identifier)
Fadiga, L.; Craighero, L.; Buccino, G.; Rizzolatti, G. (2002). "Speech listening specifically modulates the excitability of tongue muscles: A TMS study". The European Journal of Neuroscience. 15 (2): 399–402. CiteSeerX 10.1.1.169.4261. doi:10.1046/j.0953-816x.2001.01874.x. PMID 11849307. S2CID 16504172. /wiki/CiteSeerX_(identifier)
Watkins, K. E.; Strafella, A. P.; Paus, T. (2003). "Seeing and hearing speech excites the motor system involved in speech production". Neuropsychologia. 41 (8): 989–994. doi:10.1016/s0028-3932(02)00316-0. PMID 12667534. S2CID 518384. /wiki/Doi_(identifier)
Wilson, S. M.; Saygin, A. E. P.; Sereno, M. I.; Iacoboni, M. (2004). "Listening to speech activates motor areas involved in speech production". Nature Neuroscience. 7 (7): 701–702. doi:10.1038/nn1263. PMID 15184903. S2CID 8080063. /wiki/Doi_(identifier)
Skipper, J. I.; Van Wassenhove, V.; Nusbaum, H. C.; Small, S. L. (2006). "Hearing Lips and Seeing Voices: How Cortical Areas Supporting Speech Production Mediate Audiovisual Speech Perception". Cerebral Cortex. 17 (10): 2387–2399. doi:10.1093/cercor/bhl147. PMC 2896890. PMID 17218482. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896890
Meister, I. G.; Wilson, S. M.; Deblieck, C.; Wu, A. D.; Iacoboni, M. (2007). "The Essential Role of Premotor Cortex in Speech Perception". Current Biology. 17 (19): 1692–1696. Bibcode:2007CBio...17.1692M. doi:10.1016/j.cub.2007.08.064. PMC 5536895. PMID 17900904. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5536895
Pulvermuller, F.; Huss, M.; Kherif, F.; Moscoso del Prado Martin F; Hauk, O.; Shtyrov, Y. (2006). "Motor cortex maps articulatory features of speech sounds". Proceedings of the National Academy of Sciences. 103 (20): 7865–7870. Bibcode:2006PNAS..103.7865P. doi:10.1073/pnas.0509989103. PMC 1472536. PMID 16682637. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1472536
d'Ausilio, A.; Pulvermüller, F.; Salmas, P.; Bufalari, I.; Begliomini, C.; Fadiga, L. (2009). "The Motor Somatotopy of Speech Perception". Current Biology. 19 (5): 381–385. Bibcode:2009CBio...19..381D. doi:10.1016/j.cub.2009.01.017. hdl:11392/534437. PMID 19217297. https://doi.org/10.1016%2Fj.cub.2009.01.017
Assaneo, M. Florencia; Poeppel, David (2018). "The coupling between auditory and motor cortices is rate-restricted: Evidence for an intrinsic speech-motor rhythm". Science Advances. 4 (2): eaao3842. Bibcode:2018SciA....4.3842A. doi:10.1126/sciadv.aao3842. PMC 5810610. PMID 29441362. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5810610
Rizzolatti, G.; Craighero, L. (2004). "The Mirror-Neuron System". Annual Review of Neuroscience. 27: 169–192. doi:10.1146/annurev.neuro.27.070203.144230. PMID 15217330. S2CID 1729870. PDF Archived 2007-06-30 at the Wayback Machine /wiki/Doi_(identifier)
Hommel, B.; Müsseler, J.; Aschersleben, G.; Prinz, W. (2001). "The Theory of Event Coding (TEC): A framework for perception and action planning". The Behavioral and Brain Sciences. 24 (5): 849–878, discussion 878–937. doi:10.1017/s0140525x01000103. PMID 12239891. /wiki/Doi_(identifier)
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Massaro, D. W. (1997). Perceiving talking faces: From speech perception to a behavioral principle. Cambridge, MA: MIT Press. ISBN 978-0-262-13337-1. 978-0-262-13337-1
Lane, H (1965). "The Motor Theory of Speech Perception: A Critical Review". Psychological Review. 72 (4): 275–309. doi:10.1037/h0021986. PMID 14348425. /wiki/Doi_(identifier)
Massaro, D. W. (1997). Perceiving talking faces: From speech perception to a behavioral principle. Cambridge, MA: MIT Press. ISBN 978-0-262-13337-1. 978-0-262-13337-1
Tsao, F. M.; Liu, H. M.; Kuhl, P. K. (2004). "Speech perception in infancy predicts language development in the second year of life: A longitudinal study". Child Development. 75 (4): 1067–84. doi:10.1111/j.1467-8624.2004.00726.x. PMID 15260865. S2CID 10954073. http://ntur.lib.ntu.edu.tw//handle/246246/173252
MacNeilage, P. F.; Rootes, T. P.; Chase, R. A. (1967). "Speech production and perception in a patient with severe impairment of somesthetic perception and motor control". Journal of Speech and Hearing Research. 10 (3): 449–67. doi:10.1044/jshr.1003.449. PMID 6081929. /wiki/Doi_(identifier)
Fowler, C. A.; Rosenblum, L. D. (1990). "Duplex perception: A comparison of monosyllables and slamming doors". Journal of Experimental Psychology. Human Perception and Performance. 16 (4): 742–754. doi:10.1037/0096-1523.16.4.742. PMID 2148589. /wiki/Doi_(identifier)
Galantucci, B.; Fowler, C. A.; Turvey, M. T. (2006). "The motor theory of speech perception reviewed". Psychonomic Bulletin & Review. 13 (3): 361–377. doi:10.3758/bf03193857. PMC 2746041. PMID 17048719. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2746041
Hickok, G.; Poeppel, D. (2007). "The cortical organization of speech processing". Nature Reviews Neuroscience. 8 (5): 393–402. doi:10.1038/nrn2113. PMID 17431404. S2CID 6199399. See page 394 /wiki/Doi_(identifier)
Hickok, G.; Poeppel, D. (2007). "The cortical organization of speech processing". Nature Reviews Neuroscience. 8 (5): 393–402. doi:10.1038/nrn2113. PMID 17431404. S2CID 6199399. See page 394 /wiki/Doi_(identifier)
Williams, H.; Nottebohm, F. (1985). "Auditory responses in avian vocal motor neurons: A motor theory for song perception in birds". Science. 229 (4710): 279–282. Bibcode:1985Sci...229..279W. doi:10.1126/science.4012321. PMID 4012321. /wiki/Bibcode_(identifier)