Skip to main content

Research Repository

Advanced Search

Design of Named Entity Recognition Model for Awingi Language Using Deep Learning Approach

Kassahun, Abebe Belew; Derb, Eshetie; Ayalew, Amogne Andualem; Salau, Ayodeji Olalekan; Braide, Sepiribo Lucky; Buhari, Murtala Aminu

Authors

Abebe Belew Kassahun

Eshetie Derb

Amogne Andualem Ayalew

Ayodeji Olalekan Salau

Sepiribo Lucky Braide

Murtala Aminu Buhari



Abstract

Named entity recognition is a field of NLP that aims to detect and extract relevant information from unstructured text documents. However, some Ethiopian languages were getting started in different research fields. One of the languages spoken in Ethiopia is called Awingi language has a rich morphology and it is highly lacking in computational linguistic tools. There is some NER research done in Ethiopian language using different approaches and classification techniques, but since there was a lack of study conducted on NER for the Awingi language. This study aims to design a named entity recognition model for the Awngi Language using a deep learning approach with word2vect word embedding techniques. The data collected from AMICO Hiber Awiꬼgi program was posted on social media pages, Awngi Language and Literature Department of Injibara University, and Awi Zone Educational and Training Department. Therefore, the study uses a newly tagged dataset with 37,133 tokens. Presently, researchers focused on discovering four main named entities such as persons, locations, Dates, and organizations from unstructured Awiꬼgi text by using the three algorithms namely convolutional neural networks, long short-term memory, and bi-directional long-short-term memory. For the feature engineering task, Word2vec and one-hot encoding are utilized. To identify the best-performing model, the researchers have conducted several experiments. Over other algorithms, the Bi-LSTM algorithm before sampling and word embedding performed better results, i.e 95.4%, 92.4%, 95.3%, and 91.2% Accuracy, precision, recall, and f1-score, respectively, and after sampling and word embedding it performed with 99.4%, 97.3%, 99.2%, and 97.2% Accuracy, precision, recall, and f1-score, respectively.

Presentation Conference Type Conference Paper (published)
Conference Name 2024 IEEE 5th International Conference on Electro-Computing Technologies for Humanity (NIGERCON)
Start Date Nov 26, 2024
End Date Nov 28, 2024
Publication Date Mar 24, 2025
Deposit Date Apr 16, 2025
Journal Proceedings of the International Conference on Emerging & Sustainable Technologies for Power & ICT in a Developing Society
Electronic ISSN 2377-2697
Peer Reviewed Peer Reviewed
Book Title 2024 IEEE 5th International Conference on Electro-Computing Technologies for Humanity (NIGERCON)
ISBN 979-8-3315-4256-6
DOI https://doi.org/10.1109/nigercon62786.2024.10927049