Revitalizing Heritage Language through Natural Language Processing: Innovations and Challenges

Main Article Content

Hao Li
Mao Ran


This paper explores the innovative application of Natural Language Processing (NLP) in the context of heritage language education, addressing both the potential benefits and challenges encountered. With the advancement of technology, NLP has found significant use in various domains including linguistics, healthcare, and education. This research delves into how NLP tools and techniques can be utilized to enhance the learning and teaching of heritage languages, a crucial aspect often overlooked in mainstream language education. It assesses the practical value and effectiveness of NLP in facilitating language acquisition, preserving cultural identity, and addressing the unique challenges faced by heritage language learners. The study also highlights the limitations and ethical considerations of implementing NLP in this domain. Through a comprehensive analysis of existing literature and case studies, this paper aims to provide insights into the role of NLP in revitalizing heritage languages, thereby contributing to the broader field of linguistics and language education.

Article Details

How to Cite
Li, H., & Ran, M. (2024). Revitalizing Heritage Language through Natural Language Processing: Innovations and Challenges. Rajapark International Journal, 1(1), 82–91. Retrieved from
Academic Article


Abu-Ghuwaleh, M., & Saffaf, R. (2023). Integrating AI and NLP with project-based learning in STREAM education (2023060848). Preprints.

Anderson, J. (2008). Towards an integrated second-language pedagogy for foreign and community/ heritage languages in multilingual Britain. Language Learning Journal, 36(1), 79–89. Scopus.

Astrachan, C. B., Patel, V. K., & Wanzenried, G. (2014). A comparative study of CB-SEM and PLS-SEM for theory development in family firm research. Journal of Family Business Strategy, 5(1), 116–128.

Beaudrie, S., Ducar, C., & Relaño-Pastor, A. (2009). Curricular perspectives in the heritage language context: Assessing culture and identity. Language Culture and Curriculum - LANG CULT CURRIC, 22(2), 157–174.

Berthelier, B. (2023). Division and the digital language divide: A critical perspective on natural language processing resources for the South and North Korean languages. Korean Studies, 47(1), 243–273.

Burgo, C. (2017). Culture and instruction in the Spanish heritage language classroom. Philologica Canariensia, 23, 7–17.

Burstein, J., Shore, J., Sabatini, J., Moulder, B., Lentini, J., Biggers, K., & Holtzman, S. (2014). From teacher professional development to the classroom: How NLP technology can enhance teachers’ linguistic awareness to support curriculum development for English language learners. Journal of Educational Computing Research, 51(1), 119–144.

Cun, A., & Cheng, Y. (2023). Creating ZPDs for emergent bilingual children to engage in literacy learning in a virtual Chinese heritage language classroom. International Journal of Chinese Education, 12(3), 2212585X231211016.

Dunn, J. (2022, April 30). Natural language processing for corpus linguistics.

Eslit, E. (2023). Thriving beyond the crisis: Teachers’ reflections on literature and language education in the era of artificial intelligence (AI) and globalization (2023072151). Preprints.

Gao, R., Merzdorf, H. E., Anwar, S., Hipwell, M. C., & Srinivasa, A. R. (2024). Automatic assessment of text-based responses in post-secondary education: A systematic review. Computers and Education: Artificial Intelligence, 6, 100206.

García, O., & Wei, L. (2014). Translanguaging. Palgrave Macmillan UK.

García-Allén, A., & Taylor, S. K. (2023). Seeing innovation from different prisms: University students’ and instructors’ perspectives on flipping the Spanish language classroom. Language Learning in Higher Education, 13(1), 105–125.

Gironzetti, E., & Belpoliti, F. (2021). The other side of heritage language education: Understanding Spanish heritage language teachers in the United States. Foreign Language Annals, 54(4), 1189–1213.

Hao, L. (2022). Chopsticks and clothes: Chinese heritage parents’ perspectives on young children’s technology use as a tool for language and cultural learning. Literacy, 57(1), 28–39.

Hirst, G. (2011). The Handbook of computational linguistics and natural language processing (review). Language, 87(4), 897–899.

Holguín Mendoza, C. (2018). Critical language awareness (CLA) for Spanish heritage language programs: Implementing a complete curriculum. International Multilingual Research Journal, 12(2), 65–79.

Jain, A. K., Sahoo, S. R., & Kaubiyal, J. (2021). Online social networks security and privacy: Comprehensive review and analysis. Complex & Intelligent Systems, 7(5), 2157–2177.

Jones, D., Lotz, N., & Holden, G. (2019). Open Design Studio: Virtual studio development over a decade. Insider Knowledge - Proceedings of the Design Research Society Learn X Design Conference, 2019.

Khalil, N. (2023). Digital Transformation of Teacher Education by Bridging Digital Divide between Teacher Educators and Prospective Teachers: Aysha Khalil, Nasim Ishaq & Andayani Boedihartono. JCTE, 4.

Khensous, G., Labed, K., & Labed, Z. (2023). Exploring the evolution and applications of natural language processing in education. Revista Română de Informatică Și Automatică, 33(2), 61–74.

Kim, H.-S. (2020). Korean heritage language teaching and learning. In Teach. Korean as a Foreign Language: Theories and Practices (pp. 109–126). Taylor and Francis; Scopus.

Kim, J.-I., & Kim, M. (2016). Three Korean heritage language teachers’ identities, their identification of their students, and their instructional practices. Journal of Language Identity and Education, 15(6), 361–375.

Kisselev, O., Dubinina, I., & Polinsky, M. (2020a). Form-focused instruction in the heritage language classroom: Toward research-informed heritage language pedagogy. Frontiers in Education, 5, 53.

Kochmar, E., Vu, D. D., Belfer, R., Gupta, V., Serban, I. V., & Pineau, J. (2022). Automated data-driven generation of personalized pedagogical interventions in intelligent tutoring systems. International Journal of Artificial Intelligence in Education, 32(2), 323–349.

Kumar, R., Verma, A., Shome, A., Sinha, R., Sinha, S., Jha, P. K., Kumar, R., Kumar, P., Shubham, Das, S., Sharma, P., & Vara Prasad, P. V. (2021). Impacts of plastic pollution on ecosystem services, sustainable development goals, and the need to focus on circular economy and policy interventions. Sustainability, 13(17), Article 17.

Lenci, A., & Padó, S. (2022). Editorial: Perspectives for natural language processing between AI, linguistics and cognitive science. Frontiers in Artificial Intelligence, 5, 1059998.

Li, X., & Shen, Q. (2023). Individual agency in language-in-education policy: A story of Chinese heritage language schools in multilingual Brussels. Current Issues in Language Planning, 1–20.

Macias, E., Aquino, M., Silva, J., Vásquez, E., & Congreso, I. X. (2023). Teaching English from the multiple intelligences theory approach for bilingualism development. ESPOCH Congresses The Ecuadorian Journal of S T E A M.

Marreddy, M., Oota, S., Vakada, S., Chinni, V. C., & Mamidi, R. (2022). Am I a resource-poor language? Data sets, embeddings, models, and analysis for four different NLP tasks in Telugu language. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(1), 1–34.

Martinez Negrette, G., & Garcia-Peterman, M. E. (2023). “many of our families have moved because they are afraid”: A critical bifocal analysis of the creation of a dual language immersion program. NABE Journal of Research and Practice, 1–16.

Nguyen, N. X., Tran, K., & Nguyen, T. A. (2021). Impact of service quality on in-patients’ satisfaction, perceived value, and customer loyalty: A mixed-methods study from a developing country. Patient Preference and Adherence, 2523–2538.

Pradhan, U., & Dey, J. (2023). Language, artificial education, and future-making in indigenous language education. Learning Media and Technology, 1–14.

Rafique, H., Almagrabi, A. O., Shamim, A., Anwar, F., & Bashir, A. K. (2020). Investigating the acceptance of mobile library applications with an extended technology acceptance model (TAM). Computers & Education, 145, 103732.

Rosenfeld, I., Yemini, M., & Mamlok, D. (2022). Agency and professional identity among mobile teachers: How does the experience of teaching abroad shape teachers’ professional identity? Teachers and Teaching, 28, 668–689.

Santos, V., Ramos, P., Sousa, B., Almeida, N., & Valeri, M. (2022). Factors influencing touristic consumer behavior. Journal of Organizational Change Management, 35(3), 409–429.

Seals, C. A., & Peyton, J. K. (2017). Heritage language education: Valuing the languages, literacies, and cultural competencies of immigrant youth. Current Issues in Language Planning, 18(1), 87–101.

Siddique, M. M., & Kumar, S. (2023). Sentiment analysis on educational tweets: A case of national education policy 2020. 2023 IEEE International Conference on Contemporary Computing and Communications (InC4), 1, 1–6.

Su, F. (2022). Research on the integration of emotion analysis in English modular teaching based on natural language processing. Frontiers in Psychology, 13, 928883.

Sun, H., Low, J., & Chua, I. (2022). Maternal heritage language proficiency and child bilingual heritage language learning. International Journal of Bilingual Education and Bilingualism, 1–15.

Wang, D. (2023). Chinese as a Heritage Language in New Zealand: A Historical Overview. In Multiling. Educ. (Vol. 44, pp. 21–40). Springer Science and Business Media B.V.; Scopus.

Wiley, T. G. (2021). Heritage language planning and policy. In M. Polinsky & S. Montrul (Eds.), The Cambridge Handbook of Heritage Languages and Linguistics (1st ed., pp. 934–957). Cambridge University Press.

Yalan, Y., & Wei, T. (2023). An effective language convention model based on deep structured learning and natural language processing for higher education. ACM Transactions on Asian and Low-Resource Language Information Processing, 3490502.

Zhang, H., Zhou, Y., & Stodolska, M. (2022). Socio-cultural adaptation through leisure among Chinese international students: An experiential learning approach. Leisure Sciences, 44(2), 141–160.

Zhao, S. (2013, November 25). Actors in language planning.

Zheng, T., & Cong, W. (2017). Study of language identity and teaching Chinese as the heritage language—Investigation and analysis of the language maintenance of European-Chinese heritage language. Agro Food Industry Hi-Tech, 28, 3054–3056.

Zhou, W., Deng, Z., Liu, Y., Shen, H., Deng, H., & Xiao, H. (2022). Global Research Trends of Artificial Intelligence on Histopathological Images: A 20-Year Bibliometric Analysis. International Journal of Environmental Research and Public Health, 19(18), 11597.

Zhu, J., Kim, G. J. Y., & Weng, Z. (2022). Affordances and constraints: Using collaborative autoethnography as a methodology to examine language teacher agency. International Journal of Qualitative Studies in Education, 1–14.