Development of Corpus-Based Tatar-Russian Socio-Political Dictionary of Collocations

A. Galieva, O. Nevzorova


This paper discusses main sources and methodology of compiling the Tatar-Russian Socio-Political Dictionary of collocations. The area of collocations within the language system is of particular importance, and the well-known language-specificity of collocations suggests the need for bilingual collocation dictionaries. Socio-political domain is one of the most dynamically developing spheres of present-day life, with the socio-political vocabulary rapidly developing and being enriched with new lexical items and senses reflecting the realities of the time. The Dictionary is based on data of the available corpora of the Tatar language and is built as a collocation dictionary. The main criteria for selecting linguistic data are those of objective (frequency in the corpus) and subjective evaluation (evaluation of the word from the point of view of its thematic, stylistic and collocational value). The main unit in the Dictionary is the noun phrase formed by filling one of possible semantic-syntactic positions of the word and meeting the criteria of semantic completeness. As an exception, we also included certain combinations of header words with postpositions derived from nouns, as long as the corresponding collocations are typical for socio-political discourse. A special attention is paid to a distinguishing feature of the contemporary Tatar lexicon – synonymy.

Full Text:



Tatar-Russian Socio-Political Dictionary of Collocations (2018): //> [Accessed 29/09/2017].

Jantunen, Jarmo Harri (2016). “Corpora, Phraseology and Dictionaries: How Does Corpus Research Intersect Language Teaching and Learning?”. In Vilas, Begoña Sanromán (ed.) Collocations cross-linguistically: Corpora, dictionaries and language teaching. Helsinki: Société Néophilologique, 97-119.

Stubbs, Michael (2002): Words and Phrases: Corpus Studies of Lexical Semantics. Oxford: Blackwell.

Partington, Alan. (1998): Patterns and Meanings: Using Corpora for English Language Research and Teaching. Amsterdam / Philadelphia: John Benjamins.

Nation, Paul. (2001). Learning vocabulary in another language. Cambridge: Cambridge University Press.

Benson, Morton, Benson, Evelyn & Ilson, Robert (1986): The BBI Combinatory Dictionary of English. Amsterdam: John Benjamins.

Hill, Jimmie, Lewis, Michael (1999). LTP Dictionary of Selected Collocations. Hove, London: Language Teaching Publications.

Mcintosh Colin, Francis Ben. & Poole Richard (eds.) (2002). Oxford Collocations Dictionary for Students of English. Oxford: Oxford University Press.

Rundell, Michael. (2010) Macmillan Collocations Dictionary for Learners of English. Oxford: Macmillan Publishers Ltd.

Bosque, Ignacio (ed.) (2004): Redes. Diccionario combinatorio del español contemporáneo. Madrid: SM.

Bosque, Ignacio (ed.) (2006). Diccionario combinatorio práctico del español contemporáneo. Madrid: SM.

Beauchesne, Jacques (2001). Dictionnaire des cooccurrences. Montréal: Guerin.

Borisova, Yelena (1995): Slovo v tekste. Slovar’ kollokatsiy (ustoychivyih slovosochetaniy) russkogo yazyika s anglo-russkim slovarem klyuchevyih slov [A Word in the Text. Dictionary of Collocations of the Russian Language with English-Russian Dictionary of Key Words]. Moscow: Filologia.

Enikeeva E.V., Mitrofanova, O.A. Russian Collocation Extraction Based on Word Embeddings //

Yagunova, E. V., and L. M. Pivovarova. The nature of collocations in the Russian language. The experience of automatic extraction and classification of the material of news texts // Automatic Documentation and Mathematical Linguistics 44, no. 3 (2010): 164-175.

Zakharov V.P., Khokhlova M. V. Study of effectiveness of statistical measures for collocation extraction on Russian texts // Computational linguistics and intellectual technologies. 2010. No. 9, p. 137-143.

Khokhlova M. Building a Gold Standard for a Russian Collocations Database // Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts / Jaka Čibej, Vojko Gorjanc, Iztok Kosem and Simon Krek (Eds.) - Ljubljana, 2018. - Pp. 863-869.

Benson, Morton & Benson, Evelyn (1993). Russian-English Dictionary of Verbal Collocations. Amsterdam: John Benjamins.

Klégr Aleš, Key Petra & Hronková Norah (2005): Česko-anglický slovník spojení: podstatné jméno a sloveso [Czech-English combinatory dictionary: noun and verb]. Praha: Karolinum.

Konecny, Christine & Autelli Erica, E. (2014). Kollokationen Italienisch-Deutsch. Vydavateľ: Helmut Buske Verlag.

Orenha-Ottaiano, Adriane. (2016)." The Compilation of a Printed and Online Corpus-Based Bilingual Collocations Dictionary". In Margalitadze Tinatin, Meladze George (eds.). Proceedings of the XVII EURALEX International Congress. Lexicography and Linguistic Diversity, 6 – 10 September, 2016. Tbilisi: Ivane Javakhishvili Tbilisi State University, 735-745.

Safiullina, Gulshat (2016)."Bilingual Lexicography in the Republic of Tatarstan in 1990-2010". In Margalitadze Tinatin, Meladze George (eds.). Proceedings of the XVII EURALEX International Congress. Lexicography and Linguistic Diversity, 6 – 10 September, 2016. Tbilisi: Ivane Javakhishvili Tbilisi State University, 475 – 479.

Agishev, Khanif (2001): Russko-tatarskiy slovar’ slovosochetaniy dlya uchashchikhsia [Russian-Tatar Dictionary of Word Combinations for Students]. Kazan: Fan.

Safiullina, Flera (2002): Tatarsko-russkiy slovar' sostavnykh clov [Tatar-Russian Dictionary of Compound Words]. Kazan: Tatar Publishing House.

Loukachevitch Natalia & Dobrov Boris: (2015) "The Sociopolitical Thesaurus as a resource for automatic document processing in Russian". Terminology. Vol. 21, 2: 237-262.

Corpus of Written Tatar (2018): [Accessed 28/09/2017].

Tatar National Corpus (2018): [Accessed 28/09/2017].

Socio-Political Subcorpus of the Tatar language (2018): [Accessed 28/09/2017].

Galieva A.M., Yelezarova Yu. N. Development of Tatar Russian Socio-Political Dictionary // I.L.Kopylov (ed.) Vocabulum et vocabularium. - Minsk: Chetyre chetverti, 2017. - Pp. 348-353.

Divjak, Dagmar. (2010). Structuring the lexicon: A clustered model for near-synonymy. Walter de Gruyter.

Ullmann, Stephen. (1967): Semantics, An Introduction to the Science of Meaning. Oxford: Basil Blackwell.

Lyons, John. (1995): Linguistic Semantics. Cambridge: Cambridge University Press.

Kjellmer, Goran (1994): A Dictionary of English Collocations: Based on the Brown Corpus: in three volumes. Oxford, Fairlawn, New Jersey: Clarendon Press: Press, 1994.


  • There are currently no refbacks.

Abava  Absolutech Convergent 2022

ISSN: 2307-8162