Modeling a system for automatic processing of text in the Tajik language

Khurshed Khudoyberdiev

Abstract


The article deals with the modeling of processes in the system of automatic processing of text information in the Tajik language. It is proposed a methodology for the formation of text information processing processes in the TajLINGVO system. Also, it is proposed a logical structure of the system with a detailed description of each sub-process. The structure of the information model of the system is deciphered, which consists of a set of interconnected text elements in natural language. Functional model of the TajLINGVO system is proposed with a detailed description of the diagrams of use cases, activities and classes based on the UML modeling language. The utilization effectiveness of the proposed models has acquired their evidence and implementation in solving specific applied problems, such as the development of a computer thesaurus, automatic spelling, speech synthesis and machine translation for the Tajik language. The results obtained and the developed information systems are available on the Internet at www.tajlingo.tj.


Full Text:

PDF (Russian)

References


A.G.Sboev, R.B.Rybka, I.I.Ivanov. Chislennoe modelirovanie protsedury sintaksicheskogo razbora s ispol'zovaniem neironnykh setei. Vestnik VGU. Seriya: Lingvistika i mezhkul'turnaya kommunikatsiya. 2015. № 3. – pp. 28-33.

L.A. Kuznetsov, A.V. Kapnin. Tekhnologiya avtomaticheskogo formirovaniya tezaurusa russkogo yazyka. Informatsionnye sistemy i tekhnologii. 2012. № 4 (72). – pp. 14-19.

Mikhailov D.V. Teoreticheskie osnovy postroeniya otkrytykh voprosno-otvetnykh sistem. Semanticheskaya ehkvivalentnost' tekstov i modeli ikh raspoznavaniya: monografiya / D.V. Mikhailov, G.M. Emel'yanov; NoVGU im. Yaroslava Mudrogo. Novgorod, 2010. – 286p.

Voronina I.E. Komp'yuternoe modelirovanie lingvisticheskikh ob"ektov: monografiya / I. E. Voronina. –Voronezh: Izdatel'sko- poligraficheskii tsentr VGU, 2007. – 177p.

Zagoruiko N. G. Prikladnye metody analiza dannykh i znanii. Novosibirsk: IM SO RAN, 1999. – 270p.

Tsirul'nik, L.I. Algoritmy sinteza prosodicheskikh kharakteristik rechi po tekstu v sisteme «Mul'tifoN» / L.I. Tsirul'nik, D.V. Zhadinets, B.M. Lobanov, O.G. Sizonov // Komp'yuternaya lingvistika i intellektual'nye tekhnologii: trudy mezhdunarodnoi konferentsii DialoG’2007, Bekasovo, 30 maya – 3 iyunya 2007 g. – M.: Izdatel'skii tsentr RGGU, 2007. – pp. 550-558.

Anisimov A.V., Marchenko A.A. Sistema obrabotki tekstov na estestvennom yazyke // Iskusstvennyi intellekt. – 2002. – № 4. – pp. 157-163.

Klyshinskiy EH.S. Nachal'nye ehtapy analiza teksta / EH.S. Klyshinskii // Avtomaticheskaya obrabotka tekstov na estestvennom yazyke i komp'yuternaya lingvistika: ucheb. posobie. – M.: MIEHM, 2011. – pp. 106-140.

Avtomaticheskaya obrabotka tekstov na estestvennom yazyke i komp'yuternaya lingvistika: ucheb. posobie / E.I. Bol'shakova, EH.S. Klyshinskiy, D.V. Landeh, A.A. Noskov, O.V. Peskova, E.V. Yagunova. – M.: MIEHM, 2011. -272p.

Belonogov, G.G. Komp'yuternaya lingvistika i perspektivnye informatsionnye tekhnologii / G.G. Belonogov. – M.: Russkii mir, 2004. – 248p.

Palagin A.V. Ontologicheskie metody i sredstva obrabotki predmetnykh znanii. – [monografiya] – Lugansk: izd-vo VNU im. V. Dalya, 2012. – 323p.

Formal'nye modeli i sistemy v vychislitel'noi lingvistike. D.SH. Suleimanov, O.A. Nevzorova, P.I. Sosnin, L.N. Belyaeva, N.V. Lukashevich, S.G. Tatevosov: Nauchnoe izdanie / Pod redaktsiei P. I. Sosnina, O. A. Nevzorovoi – Akademiya nauk RT, Institut prikladnoi semiotiki AN RT. – Kazan': 2016. – 187p.

Seleznev K., Vladimirov A. Lingvistika i obrabotka tekstov // Otkrytye sistemy. -2013. №04. – pp.46-49. https://www.osp.ru/os/2013/04/1303556 (data obrashcheniya 20.10.2022)

Usmanov Z.D. Ob odnom tsifrovom portrete teksta i ego prilozhenii / Z. D. Usmanov // Politekhnicheskii vestnik. Seriya: Intellekt. Innovatsii. Investitsii. – 2019. – № 3(47). – pp. 35-38.

Usmanov Z.D. Avtomaticheskii poisk i statisticheskie zakonomernosti mnozhestva anagramm / Z.D. Usmanov. – Dushanbe: "Donish", 2020. – 75p.

Usmanov Z.D., Khudoyberdiev KH.A. Nizomhoi khudkori korkardi ma"lumot bo zaboni tojiki. Monografiya. Khujand. «Irfon», 2022, – 186p.

Khudoyberdiev KH.A. Web-prilozhenie «Avtomaticheskie sistemy obrabotki informatsii na tadzhikskom yazyke» www.tajlingvo.tj. – Svidetel'stvo o gosudarstvennoi registratsii informatsionnogo resursa, Respublika Tadzhikistan. №4202200496, 28/04/2022.


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность IT Congress 2024

ISSN: 2307-8162