CFP last date
22 April 2024
Reseach Article

Performance Analysis of Machine Learning Algorithms for Movie Review

by Arafat Habib Quraishi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 177 - Number 36
Year of Publication: 2020
Authors: Arafat Habib Quraishi
10.5120/ijca2020919839

Arafat Habib Quraishi . Performance Analysis of Machine Learning Algorithms for Movie Review. International Journal of Computer Applications. 177, 36 ( Feb 2020), 7-10. DOI=10.5120/ijca2020919839

@article{ 10.5120/ijca2020919839,
author = { Arafat Habib Quraishi },
title = { Performance Analysis of Machine Learning Algorithms for Movie Review },
journal = { International Journal of Computer Applications },
issue_date = { Feb 2020 },
volume = { 177 },
number = { 36 },
month = { Feb },
year = { 2020 },
issn = { 0975-8887 },
pages = { 7-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume177/number36/31137-2020919839/ },
doi = { 10.5120/ijca2020919839 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:47:53.384080+05:30
%A Arafat Habib Quraishi
%T Performance Analysis of Machine Learning Algorithms for Movie Review
%J International Journal of Computer Applications
%@ 0975-8887
%V 177
%N 36
%P 7-10
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, we have evaluated the performance of four machine learning algorithms in terms of sentiment analysis in the IMDB review dataset. Among these algorithms, two are neural network based and two are non-neural network based. We used binary classification for sentiment analysis in IMDB reviews and examined all the four algorithms to detect whether the sentiment of the text is positive or negative. Among the neural network based approaches, we applied LSTM and GRU. We found that GRU performed better than LSTM. Among the non-neural network based algorithms, we applied Multinomial Naïve Bayes and Support Vector Machine. We found that SVM outperformed Multinomial Naïve Bayes. Among these four algorithms, GRU performed the best with an accuracy of 89.0%.

References
  1. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. (2011). Sentiment analysis of Twitter data. In Proceedings of the Workshop on Languages in Social Media, LSM '11, ACL, 30-38.
  2. Basiri, M., Ghasem-Aghae, N., & Naghsh-Nilchi, A. (2014). Exploiting reviewers’ comment histories for sentiment analysis. Journal of Information Science, 40(3), 313-328.
  3. Bengio, Y., Simard, P., & Frasconi, P. (1994). Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 157-166.
  4. Hutto, C., & Gilbert, E. (2014). VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text. In International AAAI Conference on Weblogs and Social Media, AAAI, 216-225.
  5. Kouloumpis, E., Wilson, T. & Moore, J., (2011). Twitter Sentiment Analysis: The Good the Bad and the OMG! In Proceedings of the Fifth International Conference on Weblogs and Social Media, 538-541.
  6. Lin, C., & He, Y. (2009). Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on Information and knowledge management, CIKM '09, ACM, 375-384.
  7. Liu, Y., Huang, X., An, A., & Yu, X. (2008). Modeling and Predicting the Helpfulness of Online Reviews, in Eighth IEEE International Conference on Data Mining, Pisa, 443-452.
  8. Maas, A. L., Daly, R. E., Pham, P. T., Huang, D., Ng, A. Y., & Potts, C. (2011). Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistic, ACL, 142-150.
  9. Melville, P., Gryc, W., & Lawrence R. D. (2009). Sentiment analysis of blogs by combining lexical knowledge with text classification. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '09, ACM, 1275-1284.
  10. Paltoglou, G., & Thelwall, M. (2010). A study of information retrieval weighting schemes for sentiment analysis. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10. ACL, 1386-1395.
  11. Pang, B., Lee, L., & Vaithyanathan, S. (2002). Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing, EMNLP '02, ACL, 79-86.
  12. Prabowo, R., & Thelwall, M. (2009). Sentiment analysis: A combined approach. Journal Of Informetrics, 3(2), 143-157.
  13. Turney, P. D. (2002). Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL '02, ACL, 417-424.
  14. Turney, P. D., & Littman, M. L. (2003). Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems (TOIS), 21(4), 315-346.
  15. Wilson, T., Wiebe, J., & Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT '05, ACL, 347-354.
  16. Laskar, M. T. R., Hossain, M. T., Kamal, A. R. M., & Rashid, N. (2016). Automated Disease Prediction System (ADPS): A User Input-based Reliable Architecture for Disease Prediction. International Journal of Computer Applications, 975, 8887.
  17. Yadav, S., Ekbal, A., Saha, S., & Bhattacharyya, P. (2018, May). Medical sentiment analysis using social media:towards building a patient assisted system. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).
  18. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
  19. Pennington, J., Socher, R., & Manning, C. (2014, October). Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1532-1543).
Index Terms

Computer Science
Information Sciences

Keywords

IMDB Reviews Sentiment Analysis Neural Network LSTM GRU SVM Naïve Bayes.