CFP last date
20 May 2024
Reseach Article

Arabic Sentences Classification via Deep Learning

by Dania Sagheer, Fadel Sukkar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 182 - Number 5
Year of Publication: 2018
Authors: Dania Sagheer, Fadel Sukkar
10.5120/ijca2018917555

Dania Sagheer, Fadel Sukkar . Arabic Sentences Classification via Deep Learning. International Journal of Computer Applications. 182, 5 ( Jul 2018), 40-46. DOI=10.5120/ijca2018917555

@article{ 10.5120/ijca2018917555,
author = { Dania Sagheer, Fadel Sukkar },
title = { Arabic Sentences Classification via Deep Learning },
journal = { International Journal of Computer Applications },
issue_date = { Jul 2018 },
volume = { 182 },
number = { 5 },
month = { Jul },
year = { 2018 },
issn = { 0975-8887 },
pages = { 40-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume182/number5/29762-2018917555/ },
doi = { 10.5120/ijca2018917555 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:10:31.210157+05:30
%A Dania Sagheer
%A Fadel Sukkar
%T Arabic Sentences Classification via Deep Learning
%J International Journal of Computer Applications
%@ 0975-8887
%V 182
%N 5
%P 40-46
%D 2018
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a Convolutional Neural Network CNN Models to classify Arabic sentences into three topics. These sentences are derived from Essex Arabic Summaries Corpus (EASC) corpus, tokenized to words and transformed to sequences of word indices. All sequences are padded to be in the same length. The models of Convolution Neural Network are built on top of word embedding layer. The word embedding layer is either pre-trained or jointed into the model. Dropout and l2 weight regularization are used to overcome the overfitting during training. The CNN models achieve high performance in accuracy for Arabic sentences classification.

References
  1. Sagheer Dania, Sukkar Fadel, "A hybrid Intelligent System for Abstractive summarization”, International Journal of computer Applications, vol (168),No (9), June, 2017.
  2. Xiang Zhang,Yann LeCun ,“Text understanding from scratch”, arXiv:1502.01710v5 [cs.LG], Apr, 2016.
  3. Aris Kosmopoulos, “large scale hierarchical text classification”, Ph.D. thesis department of informatics Athens university of economics and business, 2015.
  4. Mrs. Manisha Pravin Mali, Dr. Mohammad Atique, "Applications of Text Classification using Text Mining", International Journal of Engineering Trends and Technology (IJETT), Volume 13, Number 5, Jul 2014.
  5. Ian Goodfellow, Yoshua Bengio, Aaron Courville, “Deep Learning”, An MIT Press book, 2016
  6. Marc Moreno, Lopez,Jugal Kalita "Deep Learning applied to NLP", arXiv:1703.03091v1 [cs.CL] 9 Mar 2017
  7. Francois chollet, “Deep learning with python”, Manning publication Co, 2018
  8. PengWang, JiamingXu, BoXu,Cheng-LinLiu, HengZhang FangyuanWang, HongweiHao, “Semantic Clustering and Convolutional Neural Network for Short Text Categorization”, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Short Papers), pages 352–357, Beijing, China, July 26-31, 2015.
  9. Daojian Zeng, Kang Liu, Siwei Lai, Guangyou Zhouand JunZhao, “Relation Classification via Convolutional Deep Neural Network”, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pages 2335–2344, Dublin, Ireland, 2014.
  10. Yoon Kim, “Convolutional Neural Networks for Sentence Classification”, arXiv:1408.5882v2 [cs.CL], 2016
  11. Ye Zhang, Byron C. Wallace, "A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification", arXiv:1510.03820v4 [cs.CL], 2016
  12. NalK alchbrenner, Edward Grefenstette, Phil Blunsom, “A Convolutional Neural Network for Modelling Sentences”, arXiv:1404.2188v1, [cs.CL], 2014
  13. Rie Johnson, Tong Zhang, "Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding", arXiv:1504.01255v3 [stat.ML], 2015
  14. Xiang Zhang Junbo Zhao Yann Le Cun, “Character-level Convolutional Networks for Text Classification”, arXiv:1509.01626v3 [cs.LG] , 2016
  15. Alexis Conneau, Holger Schwenk, Yann Le Cun, Lo¨ıc Barrault , “Very Deep Convolutional Networks for Text Classification”, arXiv:1606.01781 [cs.CL], 2017
  16. Rie Johnson, Tong Zhang Baidu, “Effective Use of Word Order for Text Categorization with Convolutional Neural Networks”, Human Language Technologies: Annual Conference of the North American Chapter of the ACL, pages 103–112, Denver, Colorado, May 31 – June 5, 2015
  17. Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, Eduard Hovy, “Hierarchical Attention Networks for Document Classification”, Proceedings of NAACL-HLT 2016, pages 1480–1489, San Diego, California, June 12-17, 2016.
  18. Abdelghani Dahou,Shengwu Xiong, Junwei Zhou, Mohamed Houcine Haddoud and Pengfei Duan “Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification”, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 2418–2427, Osaka, Japan, 2016.
  19. Arabic Corpora. Essex Arabic Summaries Corpus: https://www.essex.ac.uk/linguistics/research/arabic/arabiccorpora/easc.aspx
  20. Tomas Mikolov, Kai Chen, Greg Corrado, Jeffry Dean, “Efficient Estimation of Word Representations in Vector Space”, arXiv:1301.3781v3 [cs.CL] 7 Sep 2013.
  21. Abu Bakr Soliman, Kareem Eissa, Samhaa R. El-Beltagy “AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP”, 3rd International Conference on Arabic Computational Linguistics, ACLing 2017, 5-6 November 2017, Dubai, United Arab Emirates
  22. Josh Patterson and Gibson, “Deep learning, A practitioner’s approach”, O’Reilly, Media, Inc., 2017
  23. Hamed Habibi Aghdam, Elnaz Jahani Heravi, “Guide to Convolutional Neural Networks. A Practical Application to Traffic-Sign Detection and Classification”, Springer International Publishing AG 2017.
Index Terms

Computer Science
Information Sciences

Keywords

Classification Convolutional neural network Word Embedding