Abebe Belew Kassahun
Design of Named Entity Recognition Model for Awingi Language Using Deep Learning Approach
Kassahun, Abebe Belew; Derb, Eshetie; Ayalew, Amogne Andualem; Salau, Ayodeji Olalekan; Braide, Sepiribo Lucky; Buhari, Murtala Aminu
Authors
Eshetie Derb
Amogne Andualem Ayalew
Ayodeji Olalekan Salau
Sepiribo Lucky Braide
Murtala Aminu Buhari
Abstract
Named entity recognition is a field of NLP that aims to detect and extract relevant information from unstructured text documents. However, some Ethiopian languages were getting started in different research fields. One of the languages spoken in Ethiopia is called Awingi language has a rich morphology and it is highly lacking in computational linguistic tools. There is some NER research done in Ethiopian language using different approaches and classification techniques, but since there was a lack of study conducted on NER for the Awingi language. This study aims to design a named entity recognition model for the Awngi Language using a deep learning approach with word2vect word embedding techniques. The data collected from AMICO Hiber Awiꬼgi program was posted on social media pages, Awngi Language and Literature Department of Injibara University, and Awi Zone Educational and Training Department. Therefore, the study uses a newly tagged dataset with 37,133 tokens. Presently, researchers focused on discovering four main named entities such as persons, locations, Dates, and organizations from unstructured Awiꬼgi text by using the three algorithms namely convolutional neural networks, long short-term memory, and bi-directional long-short-term memory. For the feature engineering task, Word2vec and one-hot encoding are utilized. To identify the best-performing model, the researchers have conducted several experiments. Over other algorithms, the Bi-LSTM algorithm before sampling and word embedding performed better results, i.e 95.4%, 92.4%, 95.3%, and 91.2% Accuracy, precision, recall, and f1-score, respectively, and after sampling and word embedding it performed with 99.4%, 97.3%, 99.2%, and 97.2% Accuracy, precision, recall, and f1-score, respectively.
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2024 IEEE 5th International Conference on Electro-Computing Technologies for Humanity (NIGERCON) |
Start Date | Nov 26, 2024 |
End Date | Nov 28, 2024 |
Publication Date | Mar 24, 2025 |
Deposit Date | Apr 16, 2025 |
Journal | Proceedings of the International Conference on Emerging & Sustainable Technologies for Power & ICT in a Developing Society |
Electronic ISSN | 2377-2697 |
Peer Reviewed | Peer Reviewed |
Book Title | 2024 IEEE 5th International Conference on Electro-Computing Technologies for Humanity (NIGERCON) |
ISBN | 979-8-3315-4256-6 |
DOI | https://doi.org/10.1109/nigercon62786.2024.10927049 |
You might also like
A Comparative Study on the Characteristics and Behaviours of Social Media Users in Response to Fake News on X(Twitter)
(2024)
Presentation / Conference Contribution
Performance Analysis of Different Shape Planner Inverted F Antenna for Unmanned Aerial Vehicle
(2025)
Presentation / Conference Contribution
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search