The constructive nature and functions of the corpus
Views: 282 / PDF downloads: 132
DOI:
https://doi.org/10.32523/2616-678X-2025-150-1-172-185Keywords:
computational linguistics, corpus, corpus composition, National Corpus, corpus assembly, lexicographic studiesAbstract
The study aims to explore the concept of corpora and parallel corpora, their purpose, structural features, and role in machine translation. It provides an overview of corpus creation and proposes ways to leverage computational linguistics effectively for machine translation. Information on corpus structure and operational algorithms serves as a solid theoretical and practical foundation for research in this field. The research employs methods such as theoretical analysis, generalization, and questionnaires. The main findings indicate a high demand for the National Corpus of the Kazakh Language's capabilities. However, the corpus currently supports only a limited range of text processing functions and has several basic and technical shortcomings. Machine translation should become a key tool in enhancing the corpus’s functionality. To advance this, it is essential to develop parallel corpora within the existing framework. The study significantly contributes to the improvement of the National Corpus of the Kazakh Language. Its practical value lies in offering guidance on corpus structure, algorithms, and functionality, which can aid future researchers in conducting practical work on machine translation.
Downloads
Published
How to Cite
Issue
Section
License
Here is the academic English version suitable for publication on the journal website:
The academic journal “Bulletin of L.N. Gumilyov Eurasian National University. Philology Series” adheres to an Open Access policy for all published materials, based on the principle of free and equitable dissemination of scholarly knowledge. The Editorial Board believes that open access to research results contributes to the advancement of philological science, strengthens academic communication, and promotes the integration of national research into the international scientific community.
1. Free and Open Access
All articles published in the journal are made openly available on the official website of the journal and are accessible to all users without restrictions, registration, or payment.
Users are entitled to:
-
freely read and download materials;
-
copy and distribute the texts of publications;
-
print articles;
-
use materials for scientific and educational purposes, provided that proper attribution is given to the author(s) and the original source of publication.
2. Licensing
Journal materials are distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license:
https://creativecommons.org/licenses/by-nc/4.0/
This license permits the use, copying, distribution, and adaptation of the materials for non-commercial purposes, provided that appropriate credit is given to the author(s) and a link to the original publication is included.
3. Benefits of Open Access
The Open Access policy ensures:
-
increased visibility and citation of scholarly publications;
-
prompt dissemination of research findings in the fields of philology, linguistics, literary studies, and translation studies;
-
expansion of international academic cooperation;
-
access for readers to up-to-date scientific information without financial or technical barriers.
The Editorial Board is committed to ensuring transparency in editorial processes, maintaining high standards of peer review, and providing broad accessibility to research outcomes in the field of philological studies.





