Call for Paper - January 2024 Edition
IJCA solicits original research papers for the January 2024 Edition. Last date of manuscript submission is December 20, 2023. Read More

Building a New Tourism Sentiment Lexicon Containing Descriptive Words in Modern Standard and Colloquial Arabic

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2022
Mohammed Alkoli, B. Sharada, Sami A.M. Alquhali

Mohammed Alkoli, B Sharada and Sami A M Alquhali. Building a New Tourism Sentiment Lexicon Containing Descriptive Words in Modern Standard and Colloquial Arabic. International Journal of Computer Applications 183(53):29-31, February 2022. BibTeX

	author = {Mohammed Alkoli and B. Sharada and Sami A.M. Alquhali},
	title = {Building a New Tourism Sentiment Lexicon Containing Descriptive Words in Modern Standard and Colloquial Arabic},
	journal = {International Journal of Computer Applications},
	issue_date = {February 2022},
	volume = {183},
	number = {53},
	month = {Feb},
	year = {2022},
	issn = {0975-8887},
	pages = {29-31},
	numpages = {3},
	url = {},
	doi = {10.5120/ijca2022921951},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


In tourism industry, sentiment analysis is emerging as a technology that can be used to assess the sentiments of the tourists based on their responses on different social media sites or platforms. Sentiment analysis is an important and helpful technique for decision makers to evaluate services and identify problems and deficiencies.

Too many studies have been done on this field in other languages, but in Arabic the number of studies is limited. In addition, such studies on Arabic examine each dialect of Arabic separately and no single study includes sentiment analysis examination of a group or some varieties of Arabic dialects along with Modern Standard Arabic (MSA). The previous studies also do not address the different Arabic dialects, therefore the researches here think there should be a study that includes sentiment analysis of a number of Arabic dialects along with Modern Standard Arabic, the current research paper is an example, as most Arabs (Arab people) express their opinions in their dialects of Arabic and few use Modern Standard Arabic.

The main goal of this research paper is to build a new sentiment analysis lexicon based on the opinions of Arab tourists visiting India. This lexicon includes lexemes (words or vocabularies) taken of three Arabic dialects (namely: Gulf, Levantine and Egyptian dialects of Arabic) along with Modern Standard Arabic. The lexicon will be also evaluated by comparing it to the existing one, namely SemEval2016, using a machine learning technique called Support Vector Classifier for obtaining better results. Thus, building a new dictionary will be effective in sentiment analysis in modern Arabic and most Arabic dialects.


  1. Alaei, A. R., Becken, S., & Stantic, B. (2019). Sentiment analysis in tourism: capitalizing on big data. Journal of Travel Research, 58(2), 175-191.
  2. Altawaier, M. M., & Tiun, S. (2016). Comparison of machine learning approaches on arabic twitter sentiment analysis. International Journal on Advanced Science, Engineering and Information Technology, 6(6), 1067-1073.
  3. Dave, K., Lawrence, S., & Pennock, D. M. (2003, May). Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th international conference on World Wide Web (pp. 519-528).
  4. Boudad, N., Faizi, R., Thami, R. O. H., & Chiheb, R. (2018). Sentiment analysis in Arabic: A review of the literature. Ain Shams Engineering Journal, 9(4), 2479-2490.
  5. Al-Kabi M. N., Gigieh A. H., Alsmadi I. M., Wahsheh H. A., and Haidar M. M., "Opinion Mining and Analysis for Arabic Language", International Journal of Advanced Computer Science and Applications (IJACSA), 5: 181- 195, (2014)
  6. Hamed O. and Zesch T. "The Role of Diacritics in Designing Lexical Recognition Tests for Arabic", In: Proceedings of the 3rd International Conference on Arabic Computational Linguistics, ACLing 2017, Dubai, United Arab Emirates, 119-128, (2017)
  7. Al-Twairesh, N., Al-Khalifa, H., Al-Salman, A., & Al-Ohali, Y. (2017). Arasenti-tweet: A corpus for arabic sentiment analysis of saudi tweets. Procedia Computer Science, 117, 63-72.
  8. Abdulla, N. A., Ahmed, N. A., Shehab, M. A., & Al-Ayyoub, M. (2013, December). Arabic sentiment analysis: Lexicon-based and corpus-based. In 2013 IEEE Jordan conference on applied electrical engineering and computing technologies (AEECT) (pp. 1-6). IEEE.
  9. Nabil, M., Aly, M., & Atiya, A. (2015, September). Astd: Arabic sentiment tweets dataset. In Proceedings of the 2015 conference on empirical methods in natural language processing (pp. 2515-2519).
  10. El-Beltagy, S. R., & Ali, A. (2013, March). Open issues in the sentiment analysis of Arabic social media: A case study. In 2013 9th International Conference on Innovations in Information Technology (IIT) (pp. 215-220). IEEE.
  11. El-Beltagy, S. R. (2016, May). Nileulex: A phrase and word level sentiment lexicon for egyptian and modern standard arabic. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp. 2900-2905).
  12. Abidi, K., & Smaïli, K. (2018, May). An automatic learning of an algerian dialect lexicon by using multilingual word embeddings. In 11th edition of the Language Resources and Evaluation Conference, LREC 2018.
  13. Kiritchenko, S., Mohammad, S., & Salameh, M. (2016, June). Semeval-2016 task 7: Determining sentiment intensity of english and arabic phrases. In Proceedings of the 10th international workshop on semantic evaluation (SEMEVAL-2016) (pp. 42-51).


Arabic Dialects; Modern Standard Arabic; Sentiment Analysis; Tourism; Support Vector Classifier.