Development of the lexico-conceptual knowledge resource for Myanmar NLP applications

Authors

  • Soe Lai Phyue University of Computer Studies, Mandalay, Myanmar

Keywords:

Lexico-conceptual, Myanmar, NLP

Abstract

The computational lexicons and semantic resources as WordNets of knowledge resources are mostly important for Natural Language Processing (NLP). The importance of computational lexicon is even greater in languages with rich morphology, where the lexicon is expected to provide morphological analyzers with enough information to enable them to correctly process intricately inflected forms. This paper presents for the semiautomatic construction of a multipurpose lexico-conceptual knowledge resources for NLP systems by acquiring the lexical and conceptual knowledge from WordNet and Myanmar<->English Machine Readable Dictionaries (MRDs). To construct the WordNet like lexicon, to collect the translation links, semantic meaning and synset links, the existing bilingual MRDs and English WordNet are used for building the knowledge resources. The collected of links and their meaning are manually verified. In addition, morphological processors (analyzers and generators) are support to handle cases of inflectional morphology. These resources are very efficient and effected for NLP applications, the building methodology of constructing Myanmar<->English Computational Lexicon and Myanmar WordNet lexical database reduced the human effort and time consuming.

References

Artale, A., Magnini, B., Strapparava, C., 1997. Lexical discrimination with the Italian version of WordNet. In Proceedings of ACL Workshop on Automatic Information Extraction and Building of Lexical Semantic Resources.

Dzikovska, O., Swift, D., Allen, F., 2004. Building a computational lexicon and ontology with FrameNet. Proceeding of the Workshop on Building Lexical Resources from Semantically Annotated Corpora, LREC.

Fellbaum, C., 1998. WordNet: An electronic lexical database. MIT Press, Cambridge, Massachusetts.

Krejcie, R.V., Morgan, D.W., 1970. Determining sample size for research activities. Educ. Psychol. Meas., 30, 607-610.

Neff, M.S., Blaser, B., Lange, J.M., Lehmann, H., Dominguez, I.Z., 1993. Get it where you can: Acquiring and maintaining bilingual lexicons for machine translation. In Proceeding of AAAI.

Phyue, S.L., 2011. Building Myanmar language resources. Appl. Inform. Comput. Technol.

Phyue, S.L., 2011. Construction of Myanmar WordNet Lexical Database. IEEE student Conference on Research and Development, Malaysia.

Phyue, S.L., 2012. Lexical analyzer for Myanmar language. Proceedings of the International Conference on Computer Applications, Yangon, 95-101.

Phyue, S.L., Thida, A., 2012. Morphological processor for inflectional case of multipurpose lexico-conceptual knowledge resource. Int. J. Adv. Res. Comput. Eng. Technol., 1(7), September.

Pianta, E., Bentivogli, L., Girardi, C., 1990. MultiWordNet developing an aligned multilingual database. ITC-irst, Centro per la Ricerca Scientifica e Tecnologica.

Prószéky, G., Miháltz, M., 2002. Semi-automatic development of the Hungarian WordNet. Proceedings of the LREC 2002 Workshop on WordNet Structures and Standardization, Las Palmas.

Sathapornrungkij, P., 2004. A semi-automatic construction of Thai WordNet lexical database from machine-readable dictionaries. Master’s Thesis, Mahidol University.

Published

2020-03-20

How to Cite

Lai Phyue, S. . (2020). Development of the lexico-conceptual knowledge resource for Myanmar NLP applications. Scientific Journal of Pure and Applied Sciences, 9(3), 915-925. Retrieved from http://sjournals.com/index.php/sjpas/article/view/178

Issue

Section

Computer and Information Science