A Bagheri
Sentiment classification in Persian: Introducing a mutual information-based method for feature selection
Bagheri, A; Saraee, MH; de Jong, F
Abstract
With the enormous growth of online reviews in Internet, sentiment analysis has received more and more attention in information retrieval and natural language processing community. Up to now there are very few researches conducted on sentiment analysis for Persian documents. This paper considers the problem of sentiment classification for online customer reviews in Persian language. One of the challenges of Persian language is using of a wide variety of declensional suffixes. Another common problem of Persian text is word spacing. In Persian in addition to white space as interwords space, an intra-word space called pseudo-space separates word's part. One more noticeable challenge in customer reviews in Persian language is that of utilizing many informal or colloquial words in text. In this paper we study these challenges by proposing a model for sentiment classification of Persian review documents. The proposed model is based on a lemmatization approach for Persian language and is employed Naive Bayes learning algorithm for classification. Additionally we present a new feature selection method based on the mutual information method to extract the best feature collection from the initial extracted features. Finally we evaluate the performance of the model on a manually gathered collection of cellphone reviews, where the results show the effectiveness of the proposed model.
Citation
Bagheri, A., Saraee, M., & de Jong, F. (2013, May). Sentiment classification in Persian: Introducing a mutual information-based method for feature selection. Presented at 21st Iranian Conference on Electrical Engineering (ICEE), 2013, Mashhad, Iran
Presentation Conference Type | Other |
---|---|
Conference Name | 21st Iranian Conference on Electrical Engineering (ICEE), 2013 |
Conference Location | Mashhad, Iran |
Start Date | May 14, 2013 |
End Date | May 16, 2013 |
Publication Date | Jan 1, 2013 |
Deposit Date | Nov 27, 2013 |
Book Title | 2013 21st Iranian Conference on Electrical Engineering (ICEE) |
DOI | https://doi.org/10.1109/IranianCEE.2013.6599671 |
Publisher URL | http://dx.doi.org/10.1109/IranianCEE.2013.6599671 |
Related Public URLs | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6599671&queryText%3Dsaraee+bagheri |
Additional Information | Event Type : Conference |
You might also like
Features in extractive supervised single-document summarization: case of Persian news
(2024)
Journal Article
Deriving Environmental Risk Profiles for Autonomous Vehicles From Simulated Trips
(2023)
Journal Article
DeepClean : a robust deep learning technique for autonomous vehicle camera data privacy
(2022)
Journal Article
Machine learning-based optimized link state routing protocol for D2D communication in 5G/B5G
(2022)
Presentation / Conference