Machine Learning based Approach for Detection of Cyberbullying Tweets

Rashi Shah; Srushti Aparajit; Riddhi Chopdekar; Rupali Patil

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2025

Submit your paper

Know more

The week's pick

Real-time Synchronization Mechanisms Between Batch-oriented Legacy Systems and Modern Interfaces in the Retirement Domain

Balamurugan Krishnaswamy Gnanasekaran

Random Articles

Trust Enhancing Model for Cloud Environment

December

2015

Fuzzy Crime Investigation Framework for Tracking Data Theft based on USB Storage

December

2013

A New Ranking Algorithm for Search Engine: Content’s Weight based Page Ranking

Oct

2016

Online Customer Care: An Android Application for Mobile Customers using Speech Synthesis

Jul

2016

Reseach Article

Machine Learning based Approach for Detection of Cyberbullying Tweets

by Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 175 - Number 37

Year of Publication: 2020

Authors: Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil

10.5120/ijca2020920946

Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil . Machine Learning based Approach for Detection of Cyberbullying Tweets. International Journal of Computer Applications. 175, 37 ( Dec 2020), 52-57. DOI=10.5120/ijca2020920946

@article{ 10.5120/ijca2020920946,

author = { Rashi Shah, Srushti Aparajit, Riddhi Chopdekar, Rupali Patil },

title = { Machine Learning based Approach for Detection of Cyberbullying Tweets },

journal = { International Journal of Computer Applications },

issue_date = { Dec 2020 },

volume = { 175 },

number = { 37 },

month = { Dec },

year = { 2020 },

issn = { 0975-8887 },

pages = { 52-57 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume175/number37/31695-2020920946/ },

doi = { 10.5120/ijca2020920946 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:40:33.530332+05:30

%A Rashi Shah

%A Srushti Aparajit

%A Riddhi Chopdekar

%A Rupali Patil

%T Machine Learning based Approach for Detection of Cyberbullying Tweets

%J International Journal of Computer Applications

%@ 0975-8887

%V 175

%N 37

%P 52-57

%D 2020

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In today's technologically sound world the use of social media is inevitable. Along with benefits of social media there are serious negative impacts as well. An important issue that needs to be addressed here is cyberbullying. An effective solution for resolving this issue is the detection of the cyber-bullying content by Machine Learning. This manuscript aims to put forward ideas regarding cyber-bullying detection on the social media platform twitter. The outcome of this manuscript is that whichever tweet is a bully tweet that is represented by the value 1, thus all the bully tweets are detected. The Twitter dataset is equally distributed into bully and non-bully tweets and fed to different machine learning models. The logistic regression classifier provides accurate classification of bully and non-bully tweets with precision of 91%, recall 94% and F1-score 93%. This work will help curb cyber-bullying, so that the users can stay at bay from victimization.

References

Zhao, R., Zhou, A. and Mao, K., 2016, January. Automatic detection of cyber-bullying on social networks based on bullying features. In Proceedings of the 17th international conference on distributed computing and networking (pp. 1-6).
Van Hee, C., Lefever, E., Verhoeven, B., Mennes, J., Desmet, B., De Pauw, G., Daelemans, W. and Hoste, V., 2015. Automatic detection and prevention of cyber-bullying. In International Conference on Human and Social Analytics (HUSO 2015) (pp. 13-18). IARIA.
Saravanaraj, A., Sheeba, J.I. and Devaneyan, S.P., 2016. Automatic detection of cyber-bullying from twitter. Int. J. Comput. Sci. Info. Technol. Secur, 6.
Van Hee, C., Jacobs, G., Emmery, C., Desmet, B., Lefever, E., Verhoeven, B., De Pauw, G., Daelemans, W. and Hoste, V., 2018. Automatic detection of cyber-bullying in social media text. PloS one, 13(10).
Dadvar, M., Trieschnigg, D. and de Jong, F., 2014, May. Experts and machines against bullies: A hybrid approach to detect cyberbullies. In Canadian Conference on Artificial Intelligence (pp. 275-281). Springer, Cham.
Zhang, L., Ghosh, R., Dekhil, M., Hsu, M. and Liu, B., 2011. Combining lexicon-based and learning-based methods for Twitter sentiment analysis. HP Laboratories, Technical Report HPL-2011, 89.
Amolik, A., Jivane, N., Bhandari, M. and Venkatesan, M., 2016. Twitter sentiment analysis of movie reviews using machine learning techniques. international Journal of Engineering and Technology, 7(6), pp.1-7.
Dinakar, K., Reichart, R. and Lieberman, H., 2011, July. Modeling the detection of textual cyber-bullying. In fifth international AAAI conference on weblogs and social media.
Nahar, V., Li, X. and Pang, C., 2013. An effective approach for cyber-bullying detection. Communications in Information Science and Management Engineering, 3(5), p.238
Sardar Hamidian and Mona Diab. Rumor Detection and Classification for Twitter Data, IARIA (2015), 71-77, SOTICS2015: The Fifth International Conference on Social Media Technologies, Communication, and Informatics.ISBN:978-1-61208-443-5.
K. Reynolds, A. Kontostathis, and L. Edwards, “December 18–21, 2011, Honolulu, Hawaii. IEEE Computer Society, Dec. 2011, pp. 241–244, IEEE, ISBN: 978-0-7695-4607-0,
https://en.wikipedia.org/wiki/Logistic_regression#:~:text=Logistic%20regression%20is%20a%20statistical,a%20form%20of%20binary%20regression
https://en.wikipedia.org/wiki/Supportvectormachine
https://towardsdatascience.com/how-to-make-sgd-classifier-perform-as-well-as-logistic-regression-using-parfit-cc10bca2d3c4
https://en.wikipedia.org/wiki/Random_forest
https://scikitlearn.org/stable/modules/generated/sklearn.naive_bayes.MultinomialNB.html#:~:text=The%20multinomial%20Naive%20Bayes%20classifier,tf%2Didf%20may%20also%20work
https://www.kaggle.com/vkrahul/twitter-hate-speech
https://www.jmir.org/

Index Terms

Computer Science

Information Sciences

Keywords

Cyber Bullying Machine Learning Natural Language Processing (NLP) Twitter Logistic Regression Bully tweets Non-bully tweets Victimization Classification.