CFP last date
22 April 2024
Reseach Article

Machine Learning based Approach for Detection of Cyberbullying Tweets

by Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 175 - Number 37
Year of Publication: 2020
Authors: Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil
10.5120/ijca2020920946

Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil . Machine Learning based Approach for Detection of Cyberbullying Tweets. International Journal of Computer Applications. 175, 37 ( Dec 2020), 52-57. DOI=10.5120/ijca2020920946

@article{ 10.5120/ijca2020920946,
author = { Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil },
title = { Machine Learning based Approach for Detection of Cyberbullying Tweets },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2020 },
volume = { 175 },
number = { 37 },
month = { Dec },
year = { 2020 },
issn = { 0975-8887 },
pages = { 52-57 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume175/number37/31695-2020920946/ },
doi = { 10.5120/ijca2020920946 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:40:33.530332+05:30
%A Rashi Shah
%A Srushti Aparajit
%A Riddhi Chopdekar
%A Rupali Patil
%T Machine Learning based Approach for Detection of Cyberbullying Tweets
%J International Journal of Computer Applications
%@ 0975-8887
%V 175
%N 37
%P 52-57
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In today's technologically sound world the use of social media is inevitable. Along with benefits of social media there are serious negative impacts as well. An important issue that needs to be addressed here is cyberbullying. An effective solution for resolving this issue is the detection of the cyber-bullying content by Machine Learning. This manuscript aims to put forward ideas regarding cyber-bullying detection on the social media platform twitter. The outcome of this manuscript is that whichever tweet is a bully tweet that is represented by the value 1, thus all the bully tweets are detected. The Twitter dataset is equally distributed into bully and non-bully tweets and fed to different machine learning models. The logistic regression classifier provides accurate classification of bully and non-bully tweets with precision of 91%, recall 94% and F1-score 93%. This work will help curb cyber-bullying, so that the users can stay at bay from victimization.

References
  1. Zhao, R., Zhou, A. and Mao, K., 2016, January. Automatic detection of cyber-bullying on social networks based on bullying features. In Proceedings of the 17th international conference on distributed computing and networking (pp. 1-6).
  2. Van Hee, C., Lefever, E., Verhoeven, B., Mennes, J., Desmet, B., De Pauw, G., Daelemans, W. and Hoste, V., 2015. Automatic detection and prevention of cyber-bullying. In International Conference on Human and Social Analytics (HUSO 2015) (pp. 13-18). IARIA.
  3. Saravanaraj, A., Sheeba, J.I. and Devaneyan, S.P., 2016. Automatic detection of cyber-bullying from twitter. Int. J. Comput. Sci. Info. Technol. Secur, 6.
  4. Van Hee, C., Jacobs, G., Emmery, C., Desmet, B., Lefever, E., Verhoeven, B., De Pauw, G., Daelemans, W. and Hoste, V., 2018. Automatic detection of cyber-bullying in social media text. PloS one, 13(10).
  5. Dadvar, M., Trieschnigg, D. and de Jong, F., 2014, May. Experts and machines against bullies: A hybrid approach to detect cyberbullies. In Canadian Conference on Artificial Intelligence (pp. 275-281). Springer, Cham.
  6. Zhang, L., Ghosh, R., Dekhil, M., Hsu, M. and Liu, B., 2011. Combining lexicon-based and learning-based methods for Twitter sentiment analysis. HP Laboratories, Technical Report HPL-2011, 89.
  7. Amolik, A., Jivane, N., Bhandari, M. and Venkatesan, M., 2016. Twitter sentiment analysis of movie reviews using machine learning techniques. international Journal of Engineering and Technology, 7(6), pp.1-7.
  8. Dinakar, K., Reichart, R. and Lieberman, H., 2011, July. Modeling the detection of textual cyber-bullying. In fifth international AAAI conference on weblogs and social media.
  9. Nahar, V., Li, X. and Pang, C., 2013. An effective approach for cyber-bullying detection. Communications in Information Science and Management Engineering, 3(5), p.238
  10. Sardar Hamidian and Mona Diab. Rumor Detection and Classification for Twitter Data, IARIA (2015), 71-77, SOTICS2015: The Fifth International Conference on Social Media Technologies, Communication, and Informatics.ISBN:978-1-61208-443-5.
  11. K. Reynolds, A. Kontostathis, and L. Edwards, “December 18–21, 2011, Honolulu, Hawaii. IEEE Computer Society, Dec. 2011, pp. 241–244, IEEE, ISBN: 978-0-7695-4607-0,
  12. https://en.wikipedia.org/wiki/Logistic_regression#:~:text=Logistic%20regression%20is%20a%20statistical,a%20form%20of%20binary%20regression
  13. https://en.wikipedia.org/wiki/Supportvectormachine
  14. https://towardsdatascience.com/how-to-make-sgd-classifier-perform-as-well-as-logistic-regression-using-parfit-cc10bca2d3c4
  15. https://en.wikipedia.org/wiki/Random_forest
  16. https://scikitlearn.org/stable/modules/generated/sklearn.naive_bayes.MultinomialNB.html#:~:text=The%20multinomial%20Naive%20Bayes%20classifier,tf%2Didf%20may%20also%20work
  17. https://www.kaggle.com/vkrahul/twitter-hate-speech
  18. https://www.jmir.org/
Index Terms

Computer Science
Information Sciences

Keywords

Cyber Bullying Machine Learning Natural Language Processing (NLP) Twitter Logistic Regression Bully tweets Non-bully tweets Victimization Classification.