CFP last date
20 May 2024
Reseach Article

Anomaly Detection in Time Series using Unsupervised Machine Learning Approach

by Shruti Devlekar, Venkatesh Rallapalli, Sangeeta Oswal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 35
Year of Publication: 2022
Authors: Shruti Devlekar, Venkatesh Rallapalli, Sangeeta Oswal
10.5120/ijca2022922348

Shruti Devlekar, Venkatesh Rallapalli, Sangeeta Oswal . Anomaly Detection in Time Series using Unsupervised Machine Learning Approach. International Journal of Computer Applications. 184, 35 ( Nov 2022), 7-13. DOI=10.5120/ijca2022922348

@article{ 10.5120/ijca2022922348,
author = { Shruti Devlekar, Venkatesh Rallapalli, Sangeeta Oswal },
title = { Anomaly Detection in Time Series using Unsupervised Machine Learning Approach },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2022 },
volume = { 184 },
number = { 35 },
month = { Nov },
year = { 2022 },
issn = { 0975-8887 },
pages = { 7-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number35/32538-2022922348/ },
doi = { 10.5120/ijca2022922348 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:23:10.270076+05:30
%A Shruti Devlekar
%A Venkatesh Rallapalli
%A Sangeeta Oswal
%T Anomaly Detection in Time Series using Unsupervised Machine Learning Approach
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 35
%P 7-13
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Anomaly detection is the process of identifying data points, observations or events which deviate from normal behavior. To detect anomalies, we have used the PyCaret machine learning library. PyCaret is a low code, open-source library that automates our machine learning processes and helps us detect outliers/anomalies Its Anomaly detection and Regression modules contain various machine learning algorithms and frameworks such as XGBoost, CatBoost, Isolation Forest, DBSCAN, etc. It is a deployment-ready library that is easy to use and helps users to perform end-to-end experiments efficiently. In this paper, we applied clustering and regression-based methods on the NAB Twitter dataset for time series anomaly detection. In the regression method, we predicted the tweets, calculated the difference by comparing them with actual tweets, and used the thresholding technique for anomaly detection.

References
  1. Prokhorenkova L, Gusev G, Vorobev A, Dorogush A, Gulin A. CatBoost: unbiased boosting with categorical features.
  2. Astha G, Wenyu, Jules S, Ramasamy S, Chuan-Sheng F. An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time Series. IEEE Transactions on Neural Networks and Learning Systems
  3. Yu Qin, YuanSheng Lou. Hydrological Time Series Anomaly Pattern Detection based on Isolation Forest. 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC 2019)
  4. Yiyang Niu. Walmart Sales Forecasting using XGBoost algorithm and Feature engineering. 2020 International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE)
  5. Subutai Ahmad, Alexander Lavin, Scott Purdy, Zuha Agha, “Unsupervised real-time anomaly detection for streaming data”, Neurocomputing, Volume 262, 1 November 2017, Pages 134-147, ISSN 0925-2312.
  6. S. Zhong, S. Fu, L. Lin, X. Fu, Z. Cui, and R. Wang, "A novel unsupervised anomaly detection for gas turbine using Isolation Forest," 2019 IEEE International
  7. Conference on Prognostics and Health Management (ICPHM), 2019, pp. 1-6, DOI: 10.1109/ICPHM.2019.8819409.
  8. Fattah, Jamal & Ezzine, Latifa & Aman, Zineb & Moussami, Haj & Lachhab, Abdeslam. (2018). Forecasting of demand using ARIMA model. International Journal of Engineering Business Management. 10. 184797901880867. 10.1177/1847979018808673.
  9. Y. Wang, J. Wong and A. Miner, "Anomaly intrusion detection using one-class SVM," Proceedings from the Fifth Annual IEEE SMC Information Assurance Workshop, 2004., 2004, pp. 358-364, DOI: 10.1109/IAW.2004.1437839.
  10. Chun-Hsiang Lee, Xu Lu, Xiunao Lin, Hongfeng Tao, Yaolei Xue, and Chao Wu. 2020. Anomaly Detection of Storage Battery Based on Isolation Forest and Hyperparameter Tuning. In Proceedings of the 2020 5th International Conference on Mathematics and Artificial Intelligence (ICMAI 2020). Association for Computing Machinery, New York, NY, USA, 229–233. https://doi.org/10.1145/3395260.3395271
  11. Jebb, A. T., Tay, L., Wang, W., & Huang, Q. (2015). Time series analysis for psychological research: examining and forecasting change. Frontiers in Psychology. https://doi.org/10.3389/fpsyg.2015.00727
  12. Y. Jin, C. Qiu, L. Sun, X. Peng and J. Zhou, "Anomaly detection in time series via robust PCA," 2017 2nd IEEE International Conference on Intelligent Transportation Engineering (ICITE), 2017, pp. 352-355, DOI: 10.1109/ICITE.2017.8056937.
  13. Wentai Wu, Ligang He, Weiwei Lin, Yi Su, Yuhua Cui, Carsten Maple, Stephen A. Jarvis, "Developing an Unsupervised Real-time Anomaly Detection Scheme for Time Series with Multi-seasonality," in IEEE Transactions on Knowledge and Data Engineering, DOI: 10.1109/TKDE.2020.3035685.
Index Terms

Computer Science
Information Sciences

Keywords

Anomaly Detection Time Series Data Anomalies PyCaret Clustering Regression Machine Learning Isolation Forest XGBoost One Class SVM CatBoost Extra Trees Regressor