CORPUS-BASED MODELLING OF SCRIPT VARIATION AND DIGITAL LEXICAL INNOVATION IN MODERN UZBEK: A FUNCTIONAL APPROACH TO NORMATIVE STABILIZATION

Authors

  • Rakhmatullayeva Dilafruzkhon Shukhratovna PhD, Associate Professor, Department of Uzbek Language and Literature Kokand State University

DOI:

https://doi.org/10.17605/

Keywords:

Uzbek linguistics; corpus linguistics; script variation; digital communication; lexical borrowing; literary norm; language policy; normative stabilization.

Abstract

This article investigates one of the most urgent problems of contemporary Uzbek linguistics: the functional stabilization of the literary norm under conditions of digital communication, script coexistence, rapid lexical borrowing and corpus-based language description. The central argument is that modernization of Uzbek cannot be reduced to orthographic regulation alone, because real linguistic development is now formed at the intersection of education, official communication, digital journalism, social media, machine transliteration, speech technologies and academic terminology. The study proposes a functional model for analysing modern Uzbek through five interrelated dimensions: script variation, lexical innovation, morpho-syntactic regularity, digital-register differentiation and educational codification. The research is based on qualitative synthesis of corpus-linguistic, sociolinguistic and applied-linguistic approaches, with particular attention to the challenges faced by an agglutinative, historically multi-script and comparatively low-resource language in the digital environment. The findings indicate that Latin-Cyrillic coexistence, English-based digital borrowings and inconsistent orthographic habits should not be treated merely as signs of disorder. They constitute empirical material through which linguists can identify actual usage patterns and design more precise lexicographic, educational and technological solutions. The article concludes that the future of Uzbek literary language development depends on evidence-based codification, balanced terminology policy, high-quality annotated corpora and systematic integration of linguistic research with digital humanities and language technologies.

References

1. Abjalova, M., & colleagues. Educational Corpus of the Uzbek Language and Its Opportunities. Proceedings of the International Conference on Computer Science and Engineering, 2023.

2. Biber, D., Conrad, S., & Reppen, R. Corpus Linguistics: Investigating Language Structure and Use. Cambridge: Cambridge University Press, 1998.

3. Crystal, D. Internet Linguistics: A Student Guide. London: Routledge, 2011.

4. Fishman, J. A. Reversing Language Shift: Theoretical and Empirical Foundations of Assistance to Threatened Languages. Clevedon: Multilingual Matters, 1991.

5. Gries, S. T. Statistics for Linguistics with R: A Practical Introduction. Berlin: De Gruyter Mouton, 2021.

6. Haugen, E. Dialect, Language, Nation. American Anthropologist, 68(4), 1966, 922–935.

7. Herring, S. C. A Faceted Classification Scheme for Computer-Mediated Discourse. Language@Internet, 4, 2007.

8. McEnery, T., & Hardie, A. Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press, 2012.

9. Mansurov, B., & Mansurov, A. Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration. arXiv preprint arXiv:2101.05162, 2021.

10. Povey, A., & Povey, K. FeruzaSpeech: A 60 Hour Uzbek Read Speech Corpus with Punctuation, Casing, and Context. Proceedings of the 7th International Conference on Natural Language and Speech Processing, 2024, 360–364.

11. Salaev, U., Kuriyozov, E., & Gómez-Rodríguez, C. A Machine Transliteration Tool Between Uzbek Alphabets. CEUR Workshop Proceedings, Vol. 3315, 2022.

12. Salaev, U. UzMorphAnalyser: A Morphological Analysis Model for the Uzbek Language Using Inflectional Endings. arXiv preprint arXiv:2405.14179, 2024.

13. Spolsky, B. Language Policy. Cambridge: Cambridge University Press, 2004.

14. Wardhaugh, R., & Fuller, J. M. An Introduction to Sociolinguistics. Oxford: Wiley-Blackwell, 2015.

15. Ўзбек тилининг изоҳли луғати. 5 жилдлик. Тошкент: Ўзбекистон миллий энциклопедияси, 2006–2008.

16. Ҳожиев, А. Тилшунослик терминларининг изоҳли луғати. Тошкент: Ўзбекистон миллий энциклопедияси, 2002.

17. Қўнғуров, Р., Бегматов, Э., & Тожиев, Ё. Нутқ маданияти ва услубият асослари. Тошкент: Ўқитувчи, 1992.

18. Jamolxonov, H. Hozirgi o‘zbek adabiy tili. Toshkent: Talqin, 2005.

Downloads

Published

2026-06-09

Issue

Section

Articles