Skip to main content

Research Repository

Advanced Search

Arabic text generation : deep learning for poetry synthesis

Hejazi, HD; Khamees, AA; Alshurideh, M; Salloum, SA

Authors

HD Hejazi

AA Khamees

M Alshurideh

SA Salloum



Contributors

H-E Hassanien
Editor

K-C Chang
Editor

T Mincong
Editor

Abstract

Text Generation, especially poetry synthesis, is a promising and challenging AI task. We have used LSTM and word2vec methods to explore this area. We do forward and backward word training with different word sequences lengths. Two Datasets of Arabic poems were used. Preprocessing for unification and cleaning was done too, but the Data size was big and required very high memory and processing, so we used a sub-Datasets for training; this affected our experiments since the model is trained on fewer data. A user-supplied keyword was implemented. We have found the shorter training sequence models were better in generating more meaningful text, and longer models prefer most frequent words, repeat text, and use small words. Best predicted sentences were selected by measuring each of its words conditional probability and multiply them; this avoids local maxima if we used a greedy method that chooses the best next-word only. Moreover, the AraVecword2vec module was not very helpful since it was provided synonyms much more that related words. Many enhancements can be done in the future, such as Arabic prosody constraints, and overcome the hardware issue.

Citation

Hejazi, H., Khamees, A., Alshurideh, M., & Salloum, S. Arabic text generation : deep learning for poetry synthesis. Advances in Intelligent Systems and Computing, 1339, 104-116. https://doi.org/10.1007/978-3-030-69717-4_11

Journal Article Type Conference Paper
Conference Name International Conference on Advanced Machine Learning Technologies and Applications
Conference Location Cairo, Egypt
End Date Mar 22, 2021
Online Publication Date Mar 5, 2021
Deposit Date Jun 22, 2021
Journal Advanced Machine Learning Technologies and Applications : proceedings of AMLTA 2021
Electronic ISSN 2194-5365
Publisher Springer
Volume 1339
Pages 104-116
Series Title Advances in Intelligent Systems and Computing
Series Number 1339
Book Title Advanced Machine Learning Technologies and Applications : proceedings of AMLTA 2021
ISBN 9783030697167-(print);-9783030697174-(ebook)
DOI https://doi.org/10.1007/978-3-030-69717-4_11
Publisher URL https://doi.org/10.1007/978-3-030-69717-4_11
Related Public URLs https://doi.org/10.1007/978-3-030-69717-4
Additional Information Event Type : Conference