The article analyzes the main role of the corpus as a multifaceted linguistic source, the fact that the corpus mainly has two types of information, and its types. The article recognizes that saving time spent on the text analysis process, being able to illustrate the features of language units in speech with thousands of examples, are the main effective capabilities of the corpus. In the field of computer linguistics, information is given about the national corps, the educational corpus, the parallel corpus. It was emphasized that linguistic, extralinguistic tagging of them, development of corpus creation algorithm, creation of linguistic support for corpus is a social necessity. It recognizes the urgency of developing the basis for the creation of the Uzbek language corpus, conducting research in the field of computer linguistics as a scientific and theoretical source.
Abstract views:
Downloads:
hh-index
Citations
inLibrary — is a scientific electronic library built on the paradigm of open science (Open Science), the main tasks of which are the popularization of science and scientific activities, public quality control of scientific publications, the development of interdisciplinary research, a modern institute of scientific review, increasing the citation of Uzbek science and building a knowledge infrastructure.
CONTACTS:
Republic of Uzbekistan, Tashkent, Parkent street 51, floor 2