Automated grammar checking of text in Korean

Anna Elfimova, Irina Polyakova

Abstract


Recently, more and more attention has been attracted to Asian countries, in particular South Korea. As a result, learning the Korean language is becoming more popular in society. However, not everyone is ready to attend language learning courses or hire a tutor. In this regard, applications for independent study of foreign languages began to develop rapidly. There are very few such applications for the Korean language, and their tools are poor. In this paper, an attempt was made to develop and implement a program that automatically checks text in Korean for compliance with grammar rules. With the help of such program, anyone can independently check their text and correct possible errors in it. To automate the verification of texts in Korean for compliance with grammar rules, it was decided to build a model of the grammar of the language, select the most suitable morphological analyzer, and also select the most suitable algorithm for syntactic parsing of texts. In addition, syntactic parser should be modified to work with Korean language. Thus, it was decided to adapt it for the correct syntactic analysis of the Korean language according to the constructed grammar and implement it. The resulting program allows you to analyze a text in Korean and automate its verification for compliance with grammar rules. With the help of such program, students studying Korean or anyone who wants to check themselves can independently study their text and correct possible errors. To improve the program in the future, the set of grammar rules can be expanded. The more rules, the better the program will cope with more complex sentences. By expanding the model, you can make the program work with a larger number of grammar rules and peculiars of Korean language. Also, you can try to improve the result of morphological analysis by combining the use of morphological analyzers. In addition, one of the possible ways to improve and develop the resulting analyzer is to detect typical errors in the text.  The program will be more user-friendly if it also displays a message about where the error was made, what type of error, and how it can be corrected.

 


Full Text:

PDF (Russian)

References


RPORC [electronic resource] – Electronic data – URL: https://wciom.ru/analytical-reviews/analiticheskii-obzor/inostrannyj-yazyk-perspektivnaya-investicziya (Date of access: 05.02.2025)

Duolingo [electronic resource] – Electronic data - URL: https://blog.duolingo.com/2023-duolingo-language-report/ (Date of access: 03.11.2024)

Modern Language Association [electronic resource] – Electronic data – URL: https://www.mla.org/Resources/Guidelines-and-Data/Reports-and-Professional-Guidelines/Enrollments-in-Languages-Other-Than-English-in-United-States-Institutions-of-Higher-Education (Date of access: 05.10.2024)

Foltz, P.W., Laham, D. and Landauer, T.K. (1999). The intelligent essay assessor: Applications to educational technology. Interactive Multimedia Electronic Journal of Computer-Enhanced Learning (IMEJ) 1(2), 939–944 p.

Aho. A. V., Lam M. S., Sethi R., Ulman J. D. Compilers: Principles, Techniques, and Tools – Publisher «Pearson Education, Inc», 2006. – 1128 p.

Chomsky N. Syntactic Structures – Publisher «Mouton & Co». – 1957 – 157 p.

Volkova I.A., Vylitok A.A., Rudenko T.V. Formal Grammars and Languages. Elements of Translation Theory: A Textbook for 2nd-Year Students. – M.: Publisher «MAX Press», 2009 – 114 p.

Ahn. J., Lee K., Han H. Korean Grammar in Use Beginning to Early Intermediate. – Darakwon, 2010. – 345 p.

Cook D. J., Bays G. Computer mathematics. – M.: Publisher «Nauka», 1990. – 385 p.

KoNLPy documentation [electronic resource] – Electronic data – URL: https://konlpy.org/en/latest/ (Date of access: 15.09.2024)

Park, E.L.; Cho, S. KoNLPy: Korean Natural Language Processing in Python - In Proceedings of the Annual Conference on Human and Language Technology, Chuncheon, Republic of Korea, 10–11 October 2014; - 133–136 p.

Lee, D.; Yeon, J.; Hwang, I.; Lee, S. KKMA : A Tool for Utilizing Sejong Corpus Based on Relational Database – 2010 - 1046–1050 p.

Wumaier, A.; Yibulayin, T.; Kadeer, Z.; Tian, S. Conditional Random Fields Combined FSM Stemming Method for Uyghur - In Proceedings of the 2009 2nd IEEE International Conference on Computer Science and Information Technology, ICCSIT - 2009, Beijing, China, 8–11 August 2009 - 295–299 c.

Hopcroft J., Motwani R., Ullman J.D. Introduction to Automata Theory, Languages, and Computation. – Publisher: «Addison-Wesley», 2001. — 521 p.

Aho A., Ulman J.D. The Theory of Parsing, Translation, and Compiling. – Publisher «

Prentice-Hall, Inc. », 1972. – 2051 p.

Younger D., Daniel H. Recognition and parsing of context-free languages in time n3 // Information and Control. – 1967. Т. 10,vol. 2. – p. 189–202.

Serebryakov V., Galochkin V., Gochar D., Furugyan M. Theory and implementation of programming languages - M.: Publisher: «MZ Press», 2006. – 358 p.

Earley J. An efficient context-free parsing algorithm // Communications of the ACM. – 1970. Т. 13, vol. 2. – p. 94–102.

Parsing: a timeline // Ocean of Awareness blog [electronic resource]. – 2014 – URL: http://jeffreykegler.github.io/Ocean-of-Awareness-blog/individual/2014/09/chron.html (Date of access: 05.12.2024)

Kuzina L.N., Polyakova I.N. Object-oriented programming - 2013.

Gamma E. Design Patterns: Elements of Reusable Object-Oriented Software. – Addison-Wesley Professional, 1994.– 395 с.

Python documentation [electronic resource] - Electronic data – URL: https://docs.python.org/3/index.html (Date of access: 08.11.2024)

Baklanova M.A., Khokhlova E.A., Yu Zho Yeon. Korean Language Basic Course: Study Guide. – M.: Publisher: HSE, 2021. – 360 p.

Ahn Chinmyeon, Lee Kyunga, Han Huyeon. Korean Grammar for Beginners. – M.: Publisher AST, 2021. – 384 p.

Lee Yeondo. Dragon Raja. – Publisher: «Golden Branch», 2012.

Hyun Jing-gon. Lucky Day. – M.: Publisher: «AST», 2019. – 224 p.

Jeong Yu-jeong. Seven Years' Night. – M.: Publisher: «AST», 2021. – 528 p.

Samsung. [electronic resource]. – Electronic data – URL: https://www.samsung.com/global/galaxy/galaxy-z-flip4. (Date of access: 21.09.2024)

Daum. [electronic resource]. – Electronic data – URL: https://www.daum.net. (Date of access: 03.02.2025)

Naver News [electronic resource] – Electronic data –URL: https://news.naver.com. (Date of access: 05.01.2025)


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность ИБП для ЦОД СНЭ

ISSN: 2307-8162