International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 185 - Number 33 |
Year of Publication: 2023 |
Authors: Ikechukwu Ignatius Ayogu |
10.5120/ijca2023922810 |
Ikechukwu Ignatius Ayogu . An Exploratory Study of Stacked Multilingual SMT Systems for Low Resource Languages. International Journal of Computer Applications. 185, 33 ( Sep 2023), 1-5. DOI=10.5120/ijca2023922810
The indigenous capacity for the development of computational linguistics tools for Nigerian languages is yet low compared to what has been achieved in other multi-ethno-linguistic nations such as India. Effective communication among Nigerian citizens of different tongues, and who are unable to use English has been continuously hampered. Thus the need to inter-translate Nigerian languages has become increasingly urgent. Though machine translation (MT) research has achieved state-of-the-art for English and some few privileged languages of the world, the lack of datasets for many Nigerian languages further increases the difficulty of developing MT systems for them. This paper proposes a model for rapidly developing MT system for a new language in a multilingual setup. The overall aim of this research is to establish a scalable platform for the continuous development of MT systems for Nigerian languages using English language as a pivot language. For ease of adaptation and inclusion of a new language, purely datadriven approaches that carefully avoids absolute dependence on the availability of linguistic expertise is adopted. This paper presents a multilingual translation system for English, Igbo, and Yor`ub´a language mix. Using a research dataset, an overall best BLEU score of 35.62 was obtained for the English-Igbo system, 32.10 for English- Yor`ub´a system, and 21.03 for Igbo-Yor`ub´a. These results are encouraging, given the size of the training corpora used.