Hostname: page-component-69cd664f8f-lpdgq Total loading time: 0 Render date: 2025-03-12T23:29:28.076Z Has data issue: false hasContentIssue false

Machine translation and language teaching and learning

Published online by Cambridge University Press:  04 March 2025

Jason Jolley*
Affiliation:
Missouri State University, Springfield, MO, USA
Luciane Maimone
Affiliation:
Missouri State University, Springfield, MO, USA
*
Corresponding author: Jason Jolley; Email: jasonjolley@missouristate.edu
Rights & Permissions [Opens in a new window]

Extract

Decades before educators were forced to confront the disruption posed by widely accessible generative artificial intelligence (AI) tools such as ChatGPT, language learners, instructors, and researchers began dealing with its game-changing predecessor: machine translation (MT). Researchers began assessing MT systems and proposing language teaching applications for them as soon as universities and schools gained access to them in the mid-1980s (*Anderson, 1995*; Ball, 1989*; Corness, 1985; French 1991; Lewis, 1997; Richmond, 1994*). These inquiries accelerated in the early 2000s, when internet-enabled computer labs and increasingly smarter devices put free online MT services such as Babel Fish and Google Translate (GT) at students' fingertips, triggering concerns over output quality, academic dishonesty, and the short-circuiting of actual learning. In recent years, there has been a veritable explosion of research on MT's role in and impact on language teaching and learning, with many dozens of peer-reviewed articles published in the past five years alone, as documented in a handful of comprehensive literatures reviews (Gokgoz-Kurt, 2023; Jiang et al., 2024; Jolley & Maimone, 2022; Klimova et al., 2023; Lee, 2023). The present article provides a timeline of this rapidly expanding research domain.

Type
Research Timeline
Copyright
Copyright © The Author(s), 2025. Published by Cambridge University Press

1. Introduction

Decades before educators were forced to confront the disruption posed by widely accessible generative artificial intelligence (AI) tools such as ChatGPT, language learners, instructors, and researchers began dealing with its game-changing predecessor: machine translation (MT). Researchers began assessing MT systems and proposing language teaching applications for them as soon as universities and schools gained access to them in the mid-1980s (Footnote *Anderson, 1995*; Ball, 1989*; Corness, Reference Corness1985; French Reference French, Brierley and Kemble1991; Lewis, Reference Lewis1997; Richmond, 1994*). These inquiries accelerated in the early 2000s, when internet-enabled computer labs and increasingly smarter devices put free online MT services such as Babel Fish and Google Translate (GT) at students' fingertips, triggering concerns over output quality, academic dishonesty, and the short-circuiting of actual learning. In recent years, there has been a veritable explosion of research on MT's role in and impact on language teaching and learning, with many dozens of peer-reviewed articles published in the past five years alone, as documented in a handful of comprehensive literatures reviews (Gokgoz-Kurt, Reference Gokgoz-Kurt, Qin and Stapleton2023; Jiang et al., Reference Jiang, Yu and Zhao2024; Jolley & Maimone, Reference Jolley and Maimone2022; Klimova et al., Reference Klimova, Pikhart, Benites, Lehr and Sanchez-Stockhammer2023; Lee, Reference Lee2023). The present article provides a timeline of this rapidly expanding research domain.

Hutchins and Somers (Reference Hutchins and Somers1992) define MT as “computerised systems responsible for the production of translations from one natural language into another, with or without human assistance” (p. 3). A detailed account of the history and evolution of MT technology, capabilities, and applications is beyond the scope of this timeline and available elsewhere (Garg & Agarwal, Reference Garg and Agarwal2019; Hutchins, Reference Hutchins2010; Németh, Reference Németh2019; Pestov, Reference Pestov2018). However, given the impact of MT's evolving functionality on research in the domain in question, familiarization with its major developmental milestones is important. Hutchins (Reference Hutchins2010) dates MT's origins to the late 1940s, coinciding with the invention of computers. During the 1950s and 1960s, the Cold War tech race and advances in computational linguistics fueled developments. By the 1980s, early commercial systems, such as SYSTRAN, Logos, and METAL, previously available only on mainframes in government or military facilities, could be installed on home computers and desktop workstations in schools and universities. SYSTRAN's Babel Fish, generally considered the first free online MT tool, was launched as a website in 1997, nine years before GT debuted in 2006. In the years since, advances in smartphone technology and network coverage have connected billions of people to MT and generative AI apps that translate just as well.

MT capabilities and quality have increased in step with these advances in accessibility. The earliest MT systems relied on technology known as rule-based MT, whereby automated transfer operations are executed using one-to-one lexical substitutions and preprogrammed morphosyntactic rules. By the 1990s, this rudimentary approach had largely been replaced by more sophisticated databank- and corpus-based approaches that target phrase-level equivalents, such as example-based MT, which identifies analogous phrases from aligned parallel text databanks, and statistical MT, which analyzes patterns in bilingual corpora and calculates probabilities to determine optimal word combinations and sequences. The most recent major advance in MT technology happened in 2016, when GT switched its underlying architecture to a neural machine translation (NMT) model. A powerful form of machine learning, NMT uses artificial neural networks to analyze large datasets, training itself to predict the most likely sequences of words in sentences. NMT has been shown to be substantially more accurate than previous systems. For example, Wu et al. (Reference Wu, Schuster, Chen, Le, Norouzi, Macherey, Krikun, Cao, Gao, Macherey, Klingner, Shah, Johnson, Liu, Kaiser, Gouws, Kato, Kudo, Kazawa and Dean2016) found that NMT-powered GT (GNMT) commits 60% fewer errors than the previous phrase-based model and approaches the accuracy levels of experienced human translators. Recent assessments of the translation capabilities of large language models (LLMs) have demonstrated that they are also highly accurate (Moslem et al., Reference Moslem, Haque, Kelleher and Way2023), with some suggesting that LLMs are the new paradigm for MT development (Xu et al., Reference Xu, Kim, Sharaf and Awadalla2024).

As the most recent stages outlined in this MT history overview unfolded, four clearly discernible strands or subdomains of the research into MT and language education have emerged. Research in this field rests on foundational articles that describe the history and capabilities of the first MT systems available in instructional settings, often suggesting potential applications in translator training and language programs (Anderson, 1995*; Ball, 1989*; Corness, Reference Corness1985; French, Reference French, Brierley and Kemble1991; Hutchins & Somers, Reference Hutchins and Somers1992; Lewis, Reference Lewis1997; Richmond, 1994*; Somers, Reference Somers, Forcada and Pérez-Ortiz2001). A second clearly identifiable strand involves a cluster of survey-based studies designed to gauge how often, in what ways, and for what reasons students use MT in their language learning activities, as well as the perceptions, beliefs, and attitudes learners and instructors hold regarding various aspects of MT, including specific kinds of uses (Case, 2015*; Clifford et al., 2013*; Farzi, 2016*; Hellmich & Vinall, 2023*; Jolley & Maimone, 2015*; Knowles, 2016*; Larson-Guenette, 2013*; Niño, 2009*; *O'Neill, 2019a*; and White & Heidrich, 2013*, among others). This research relates to a third strand, which centers around questions of academic dishonesty. Many publications in this strand argue that unauthorized MT use is, in fact, cheating and should be discouraged through detection and prevention strategies (Correa, Reference Correa2011, 2014*; Harris, Reference Harris2010; Luton, Reference Luton2003; McCarthy, Reference McCarthy2004; Steding, 2009*). As a corollary to this line of thought, a handful of empirical studies have sought to determine whether instructors are able to reliably detect MT use and to identify the telltale signs of MT as compared with unassisted translation or direct second language (L2) writing (Innes, 2019*; Maimone & Jolley, 2023*; O'Neill, 2012*; Somers et al., 2006*; Stapleton & Leung, 2019*). Finally, the bulk of publications in this research field may be categorized as pertaining to a strand focused on the uses of and implications for MT in formal language learning contexts. These articles discuss ways in which MT may be or has been used to support language learning, as well as its impact on performance (particularly on L2 writing output), and often discuss pedagogical implications (Ducar & Schocket, 2018*; Garcia & Pena, 2011*; Fredholm 2015*, 2019*; Lee, 2020*; Niño, 2004*, 2008*, O'Neill, 2012*, 2019b*; Vold, 2018*; and Williams, 2006*, among others), including the importance of MT literacy (Bowker, Reference Bowker2020; Loock et al., 2022*; Pellet & Meyers, 2022*). Individual learner differences, such as proficiency level (Chung, 2020*; Chung & Ahn, 2021*; Lee, 2022*; Mujtaba et al., 2022*; Shin & Chon, 2023*), learner strategies (Lee, 2020*; Ryu et al., 2022*; White & Heidrich, 2013*), and motivation (Tsai & Liao, Reference Tsai and Liao2021), as well as the role of MT and translation more broadly in supporting translanguaging approaches (Beiler & Dewilde, Reference Beiler and Dewilde2020; Hell et al., 2022*; Heugh et al., Reference Heugh, French, Arya, Pham, Tudini, Billinghurst, Tippett, Chang, Nichols and Viljoen2022; Jiang et al., Reference Jiang, Yu and Zhao2024; Kelly & Hou, 2021*; Rowe, Reference Rowe2022; Zhou et al., Reference Zhou, Zhao and Groves2022), are also frequent themes in this strand. Overall, the studies in this timeline suggest many practical applications for MT in language teaching, even if findings have been somewhat contradictory (Lee, 2022*), and more studies investigating whether MT use actually supports language development or durable proficiency gains are needed (Jolley & Maimone, Reference Jolley and Maimone2022).

The present timeline includes 63 publications. One challenge in providing a representative sampling of notable articles from each of the subdomains summarized above is that many empirical studies in this field are relatively recent. As a whole, this domain has seen a gradual progression from articles grounded in personal perspectives or anecdotal observations to small-scale exploratory experiments to more rigorous empirical studies. In selecting articles for inclusion, we prioritized empirical studies published after the advent of GT. However, we also opted to include a handful of earlier and non-empirical articles that have been especially influential or that propose systematically designed pedagogical uses for MT. We elected to consider influential doctoral dissertations but to exclude master's theses. To maintain a focus on research with an explicit emphasis on the intersection of MT and language teaching and learning, we disregarded studies more narrowly focused on translator training. We also excluded non-English sources. The publications included in this timeline are categorized according to the following themes:

  • A. Use of and perceptions about MT and language learning

    1. 1. Learner use: frequency, types, and reasons

    2. 2. Learner perceptions of MT: accuracy, usefulness, appropriateness, etc.

    3. 3. Instructor use: frequency, types, and reasons

    4. 4. Instructor perceptions of MT: accuracy, usefulness, appropriateness, etc.

  • B. MT and academic dishonesty

    1. 1. Belief that unauthorized use of MT is a form of cheating

    2. 2. Detection: emphasis on detection or instructor ability to detect MT use

    3. 3. Signs: textual features characteristic of MT use

    4. 4. Response to unauthorized use of MT (resist vs. accept, policies, etc.)

  • C. Instructional applications of MT to promote leaning

    1. 1. Proposal of or report on MT-enhanced learning activity

    2. 2. Use of pre-editing or post-editing activities to raise linguistic awareness

    3. 3. Proposal of or report on MT-enhanced pedagogical strategies or model

    4. 4. Focus on MT training or MT literacy

    5. 5. Other pedagogical implications or recommended bast practices

  • D. Impact of MT on L2 writing (products, process, overall quality, textual features, etc.)

  • E. Contributions of MT use to language development or proficiency gains

Jason Jolley is a professor of Spanish at Missouri State University. He holds a Ph.D. in Spanish from The Pennsylvania State University. His teaching and research interests include Latin American literature, translation and machine translation in language learning, motivation, and self-directed language learning.

Luciane Maimone is an associate professor of Hispanic Linguistics at Missouri State University. She holds a Ph.D. in Applied Linguistics from Georgetown University. Her research focuses on psycholinguistic aspects of non-native language acquisition, language pedagogy and assessment, and the educational affordances of online environments, including machine translation and telecollaboration.

Footnotes

* Indicates that the full reference is available in the subsequent timeline.

Note. Authors' names are shown in small capitals when the study referred to appears in this timeline.

References

Beiler, I., & Dewilde, J. (2020). Translation as translingual writing practice in English as an additional language. Modern Language Journal, 104(3), 533549. doi:10.1111/modl.12660CrossRefGoogle Scholar
Bowker, L. (2020). Machine translation literacy instruction for international business students and business English instructors. Journal of Business & Finance Librarianship, 25(1-2), 2543. doi:10.1080/08963568.2020.1794739CrossRefGoogle Scholar
Corness, P. (1985). The ALPS computer-assisted translation system in an academic environment. Translating and the Computer, 7, 118127.Google Scholar
Correa, M. (2011). Academic dishonesty in the second language classroom: Instructors’ perspectives. Modern Journal of Language Teaching Methods, 1, 6579.Google Scholar
French, R. J. (1991). Machine translation. In Brierley, W. & Kemble, I. R. (Eds.), Computers as a tool in language learning (pp. 5570). Ellis Horwood Limited.Google Scholar
Garg, A., & Agarwal, M. (2019). Machine translation: A literature review. doi:10.48550/arXiv.1901.01122CrossRefGoogle Scholar
Gokgoz-Kurt, B. (2023). Using machine translation in EFL writing: A scoping review. In Qin, J. & Stapleton, P. (Eds.), Advances in composing, translation, writing pedagogy and data-driven learning (pp. 4662). Routledge. doi:10.4324/9781003279358-4Google Scholar
Harris, H. (2010). Machine translations revisited: Issues and treatment protocol. Language Teacher, 34(4), 2529.CrossRefGoogle Scholar
Heugh, K., French, M., Arya, V., Pham, M., Tudini, V., Billinghurst, N., Tippett, N., Chang, L.-C., Nichols, J., & Viljoen, J.-M. (2022). Multilingualism, translanguaging and Transknowledging: Translation technology in EMI higher education. AILA Review, 35(1), 89127. doi:10.1075/aila.22011.heuCrossRefGoogle Scholar
Hutchins, J. (2010). Machine translation: A concise history. Journal of Translation Studies, 13(1-2), 2970.Google Scholar
Hutchins, W., & Somers, H. (1992). An introduction to machine translation. Academic Press.Google Scholar
Jiang, L., Yu, R., & Zhao, Y. (2024). Theoretical perspectives and factors influencing machine translation use in L2 writing: A scoping review. Journal of Second Language Writing, 64, 101099. doi:10.1016/j.jslw.2024.101099CrossRefGoogle Scholar
Jolley, J. R., & Maimone, L. (2022). Thirty years of machine translation in language teaching and learning: A review of the literature. L2 Journal, 14(1), 2644. doi:10.5070/L214151760CrossRefGoogle Scholar
Klimova, B., Pikhart, M., Benites, A. D., Lehr, C., & Sanchez-Stockhammer, C. (2023). Neural machine translation in foreign language teaching and learning: A systematic review. Education and Information Technologies, 28(1), 663682. doi:10.1007/s10639-022-11194-2CrossRefGoogle Scholar
Lee, S.-M. (2023). The effectiveness of machine translation in foreign language education: A systematic review and meta-analysis. Computer Assisted Language Learning, 36(1-2), 103125. doi:10.1080/09588221.2021.1901745CrossRefGoogle Scholar
Lewis, D. (1997). Machine translation in a modern language curriculum. Computer Assisted Language Learning, 10(3), 255271. doi:10.1080/0958822970100305CrossRefGoogle Scholar
Luton, L. (2003). If the computer did my homework, how come I didn't get an “A”? The French Review, 76(4), 766770.Google Scholar
McCarthy, B. (2004). Does online machine translation spell the end of take-home translation assignments? Computer-Assisted Language Learning Electronic Journal, 6(1), 2639.Google Scholar
Moslem, Y., Haque, R., Kelleher, J. D., & Way, A. (2023). Adaptive machine translation with large language models. doi:10.48550/arXiv.2301.13294CrossRefGoogle Scholar
Németh, G. (2019, October 17). Machine translation: A short overview. Towards Data Science. https://towardsdatascience.com/machine-translation-a-short-overview-91343ff39c9fGoogle Scholar
Pestov, I. (2018, March 12). A history of machine translation from the Cold War to deep learning. freeCodeCamp. https://www.freecodecamp.org/news/a-history-of-machine-translation-from-the-cold-war-to-deep-learning-f1d335ce8b5/Google Scholar
Rowe, L. (2022). Google translate and biliterate composing: Second-graders’ use of digital translation tools to support bilingual writing. TESOL Quarterly, 56(3), 883906. doi:10.1002/tesq.3143CrossRefGoogle Scholar
Somers, H. (2001). Three perspectives on MT in the classroom. In Forcada, M. L. & Pérez-Ortiz, J. A. (Eds.), Workshop on teaching machine translation (pp. 2529). Santiago de Compostela.Google Scholar
Tsai, P.-S., & Liao, H.-C. (2021). Students’ progressive behavioral learning patterns in using machine translation systems: A structural equation modeling analysis. System, 101, 102594. doi:10.1016/j.system.2021.102594CrossRefGoogle Scholar
Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M., Macherey, W., Krikun, M., Cao, Y., Gao, Q., Macherey, K., Klingner, J., Shah, A., Johnson, M., Liu, X., Kaiser, L., Gouws, S., Kato, Y., Kudo, T., Kazawa, H., … Dean, J. (2016). Google's neural machine translation system: Bridging the gap between human and machine translation. doi:10.48550/arXiv.1609.08144CrossRefGoogle Scholar
Xu, H., Kim, Y. J., Sharaf, A., & Awadalla, H. A. (2024). A paradigm shift in machine translation: Boosting translation performance of large language models. doi:10.48550/arXiv.2309.11674CrossRefGoogle Scholar
Zhou, S., Zhao, S., & Groves, M. (2022). Towards a digital bilingualism? Students’ use of machine translation in international higher education. Journal of English for Academic Purposes, 60, 101193. doi:10.1016/j.jeap.2022.101193CrossRefGoogle Scholar