Publication Details
Issue: Vol 7, No 3 (2026)
Pages: 441-448
ISSN: 2660-6828

Abstract

This article analyzes the corpus-linguistic foundations for presenting Uzbek language phraseological units in multilingual electronic dictionaries. In recent decades, the rapid development of electronic lexicography, particularly the emergence of national language corpora, has opened up new methodological possibilities for phraseographical research: it has now become possible to determine the meaning, frequency of use, stylistic coloring, and contextual adaptability of a phraseological unit not by abstract linguistic intuition, but by relying on large-scale electronic databases—corpora, which consist of actual texts. The article provides a comparative review of global corpus-linguistic lexicography and the research being conducted on the basis of the National Corpus of the Uzbek Language (uzbekcorpus.uz). As a result of the research, a phased model for presenting Uzbek phraseology as a multilingual electronic dictionary article was developed: extracting concordances from the corpus, frequency and collocation analysis, semantic-stylistic annotation, determining the type of interlingual equivalence (full, partial, and null equivalence), and designing the microstructure of the dictionary article. For the analysis, samples of somatic and zoonymic phraseologisms in Uzbek were selected and compared with their Turkish, Russian, and English equivalents. Research has shown that a corpus-based approach provides objectivity, representativeness, and scalability compared to traditional intuitive phraseography, however, for low-frequency and dialectal phraseologisms, it is necessary to increase the corpus size and the quality of morphosyntactic annotation. The results of the article have theoretical and practical significance for creating a phraseological module based on the national Uzbek language corpus, improving machine translation systems, and the practice of teaching Uzbek as a foreign language.

Keywords
corpus linguistics phraseology phraseological unit multilingual electronic dictionary corpus-based lexicography collocation national corpus semantic annotation cross-linguistic equivalence lingvistic-cultural connotation