Lexical-semantic clustering in the diagnostics of culture integration in discourse
Semantic clustering as a research method is commonly utilized to explore lexical systems transformations, meanwhile it can provide access to far more complicated processes of cognitive transformations in discourse or even culture. This work presents a cognitive-semantic approach to the study of cultural integration, based on semantic clustering. Two cognitive processes of cultural integration are identified: direct and indirect knowledge transfer. These are manifested in the direct and indirect translation of areas of knowledge from the donor culture to the recipient culture. The analysis procedure and results are demonstrated using a corpus of 864,005 examples of words borrowed from Russian into Kyrgyz from web discourse.
Objectives. The research objectives are to reveal the knowledge domains transferred directly from Russian to Kyrgyz culture through integrated loanwords in web discourse and to specify the roles of qualia in the indirect transfer of knowledge through loanword collocates.
Methods. While corpus analysis allows to identify the knowledge domains which are directly transferred from donor into recipient culture, semantic clustering helps specify the indirect culture integration since it explores the contexts mediated by the use of these domains.
Results. Based on the established lists of loanwords, the compiled database of their contexts and lexical-semantic clustering algorithm performed to identify their similarity in the context use of loanwords (accessed at https://osf.io/78u4m/), the study determined both the thematic attribution and qualia roles of the knowledge domains in direct and indirect knowledge transfer as a result of culture integration which is found in the context of Russian (as donor) and Kyrgyz (as recipient) cultures. Featuring 4,126 target words with their vector similarity weight identified, further analysis revealed their cognitive roles attributed to four qualia – formal, constitutive, telic, agentive, which were indirectly mediated by culture integration.
Conclusion. The study shows that, while direct knowledge transfer resulted in the penetration of novel thematic domains and the modification of existing ones, indirect knowledge transfer mostly resulted in the transformation of the formal role of encoding taxonomic information about the referent of the loanword. This suggests that the recipient culture adapted the knowledge domains within a wider domain matrix rather than adapting or transforming their components for more specific purposes. The results obtained provide evidence for novel applications of lexical-semantic clustering, as well as novel methods for exploring cultural integration and disintegration processes.
Figures





Kiose, M. I., Khulkhachieva, Zh. S., Izyumskaya-Kapitonova, V. V., Barmin, A. V. (2025). Lexical-semantic clustering in the diagnostics of culture integration in discourse, Research Result. Theoretical and Applied Linguistics, 11 (4), 63–84.


















While nobody left any comments to this publication.
You can be first.
Cimiano, P., Wenderoth, J. (2005). Automatically Learning Qualia Structures from the Web. In T. Baldwin, A. Korhonen, A. Vil-lavicencio (Eds.), Proceedings of the ACL workshop on Deep Lexical Acquisition. Ann Arbor, Michigan, 28–37. (In English)
Copestake, A., Briscoe, T. (1995). Semi-Productive Polysemy and Sense Extension, Journal of Semantics, 12 (1), 15–67. (In English)
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K. and Harshman, R. (1990). Indexing by latent semantic analysis, Journal of the American Society for Information Science, 41(6), 391–407. (In English)
Demyankov, V. Z. (2016). Language techniques of knowledge transfer, in Feschenko, V. V. (ed.) Linguistics and semiotics of cultural transfers: methods, principles, technologies. Moscow, Cultural revolution, 61–85. (In Russian)
Derbisheva, Z. K. (2017). Language and ethnos. Moscow, Flinta. (In Russian)
Derbisheva, Z. K. (2019). The basics of linguocognitive exploring languages in contrast. Moscow, Flinta.(In Russian)
Feschenko, V. V., Bochaver, S. Yu. (2016). Theory of cultural transfers: from translation studies – through cultural studies – to the theory of language, in Feschenko, V. V. (ed.) Linguistics and semiotics of cultural transfers: methods, principles, technologies. Moscow, Cultural revolution, 5–35. (In Russian)
Gallagher, S., Hutto, D. D. (2008). Understanding others through primary interaction and narrativepractice, in J. Zlatev, T. Racine, C. Sinha and E. Itkonen (eds.), The shared mind: Perspectives of intersubjectivity. Amsterdam, John Benjamins, 17–38. (In English)
Haspelmath, M. (2009). Lexical borrowings: Concepts and issues, in M. Haspelmath, U. Tadmor (eds.). Loanwords in the world’s languages. Berlin, De Gruyter, 35–54. (In English)
Iriskhanova, O. K., Kiose, M. I. (2016). Technologies of interdisciplinary terms transfer into language studies, in Feschenko, V. V. (ed.) Linguistics and semiotics of cultural transfers: methods, principles, technologies. Moscow, Cultural revolution, 151–180. (In Russian)
Jain, A. K., Murty, M. N., Flynn, P. J. (1999). Data clustering: a review. ACM computing surveys (CSUR), 31(3), 264–323. (In English)
Jezek, E., Quochi, V., Calzolari, N. (2009). Relevance of Qualia Relations in Coercive Contexts: Conference paper, 5th International Conference on Generative Approaches to the Lexicon. URL: https://www.researchgate.net/
publication/228960165_Relevance_of_Qualia_Relations_in_Coercive_Contexts. (Accessed 18 August 2015). (In English)
Kambaralieva, U. D., Sternin, I. A. (2021). Russian and Kyrgyz communicative behaviour. Voronezh, Ritm. (In Russian)
Kuhn, A., Ducasse, S., Gîrba, T. (2007). Semantic clustering: Identifying topics in source code, Information and Software Technology, 49 (3), 230–243. URL: https://doi.org/10.1016/j.
infsof.2006.10.017 (Accessed 25 July 2015). (In English)
Langacker, R. W. (2007). Cognitive grammar, in D. Geeraerts, H. Cuyckens (eds.), The Oxford handbook of cognitive linguistics. Oxford, Oxford University Press, 421–507. (In English)
Lee, C., Chang, C., Hsu, W., Hsieh, S. (2010). Qualia modification in noun-noun compounds: A cross language survey, Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing (ROCLING-2010), 379–390. (In English)
Lee, S., Kim, H., Hwang, J., Park, E. and Ok, J. (2025). Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs. URL: https://doi.org/arXiv preprint arXiv:2506.00344. (In English)
Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013). Efficient estimation of word representations in vector space. URL: https://doi.org/arXiv preprint arXiv:1301.3781. (In English)
Myers-Scotton, C. (2002). Contact linguistics: Bilingual encounters and grammatical outcomes. Oxford, Oxford University Press. (In English)
Myers-Scotton, C. (2006). Multiple voices: An introduction to bilingualism. Malden, MA, Blackwell. (In English)
Pérez-Serrano, M., Nogueroles-López, M. and Duñabeitia, J. A. (2022). Effects of semantic clustering and repetition on incidental vocabulary learning. Frontiers in Psychology, 13, 997951.(In English)
Postovalova, V. I. Routes and principles of knowledge transfer in the humanities, in Feschenko, V. V. (ed.) Linguistics and semiotics of cultural transfers: methods, principles, technologies. Moscow, Cultural revolution, 36–60. (In Russian)
Pustejovsky, J. (1991) The generative lexicon, Computational Linguistics, 17 (4), 409–441. (In English)
Pustejovsky, J. (2006). Generative lexicon, Encyclopedia of language and linguistics. Amsterdam, Elsevier, 138–147.(In English)
Pustejovsky, J., Hanks, P., Rumshisky, A. (2004). Automated induction of sense in context, COLING, 924–931. (In English)
Pustejovsky, J., Jezek, E. (2016). A Guide to Generative Lexicon Theory. Oxford, Oxford University Press. (In English)
Shaffer, C., Andreano, J. M., Touroutoglou, A., Barrett, L. F., Dickerson, B. C. and Wong, B. (2024). Semantic clustering during verbal episodic memory encoding and retrieval in older adults: One cognitive mechanism of superaging. Brain Sciences, 14 (2), 171. (In English)
Shipilov, A. V. (2018). Russian – Kyrgyzstan: the historical experience of cross-cultural discourse construal (second half of XIX – XXI c.). Bishkek. (In Russian)
Siew, C. S., Wulff, D. U., Beckage, N. M., Kenett, Y. N. (2019). Cognitive network science: A review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2019 (1), 2108423. (In English)
Song, Z. and Qiu, L. (2013). Qualia relations in Chinese nominal compounds containing verbal elements, International Journal of Knowledge and Language Processing, 4 (1), 1–15. (In English)
Talmy, L. (2000). Toward a cognitive semantics. Vol. 2, Typology and process in concept structuring. Cambridge, MA, MIT Press. (In English)
Tinkham, T. (1993). The effects of semantic clustering on the learning of second language vocabulary. System, 21, 371–380. (In English)
Thomason, S. G. (2001). Language contact. Washington, D.C.: Georgetown University Press. (In English)
Troyer, A. K., Moscovitch, M., Winocur, G. (1997). Clustering and switching as two components of verbal fluency: Evidence from younger and older healthy adults. Neuropsychology, 11 (1), 138–146. (In English)
Verhagen, A. (2007). Construal and perpectivization, in D. Geeraerts, H. Cuyckens (Eds.), The Oxford handbook of cognitive linguistics. Oxford, Oxford University Press, 48–80.(In English)
Winford, D. (2005). Contact-induced changes: Classification and processes. Diachronica, 22 (2), 373–427. (In English)
Witschard, D., Jusufi, I., Martins, R. M., Kucher, K., Kerren, A. (2022). Interactive optimization of embedding-based text similarity calculations, Information Visualization, 21 (4), 335–353. URL: https://doi.org/10.1177/14738716221114372 (Accessed 25 September 2025). (In English)
Corpus materials
Karasaev, Kh. K. (1986). The Dictionary of Loanwords: 5100 words. Frunze: Kyrgyz Sovet. (In Kyrgyz)
Kyrgyz Web Text Corpus. Leipzig Corpora, available at: https://wortschatz.uni-leipzig.de/en (Accessed 6 September 2025). (In English)
Python Documentation, 5. Data Structures (2025). Python Software Foundation. URL: https://docs.python.org/3/tutorial/datastructures.html (Accessed 25 September 2025). (In English)
The Kyrgyz News Corpus dataset (2025). Hugging Face AI community. URL: https://huggingface.co/datasets/the-cramer-project/Kyrgyz_News_Corpus#kyrgyz_news_corpus (Accessed 6 September 2025). (In English)
Yudakhin, K. K. (1985). Kyrgyz-Russian Dictionary. Frunze: Glavnaya redaktsiya Kirgyzskoy Sovetskoy Entsiklopedii. (InRussian)
The research is part of the FSFU-2025-0004 Project “Assessment of cultural integration and disintegration processes in CIS countries: the study of communicative practices” which is being carried out at Moscow State Linguistic University.