CFP last date
22 April 2024
Reseach Article

Mining Spatio-Temporal Data of Fatal Accident

by Aina Musdholifah, Siti Zaiton Mohd Hadhim
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 63 - Number 8
Year of Publication: 2013
Authors: Aina Musdholifah, Siti Zaiton Mohd Hadhim
10.5120/10490-5243

Aina Musdholifah, Siti Zaiton Mohd Hadhim . Mining Spatio-Temporal Data of Fatal Accident. International Journal of Computer Applications. 63, 8 ( February 2013), 40-46. DOI=10.5120/10490-5243

@article{ 10.5120/10490-5243,
author = { Aina Musdholifah, Siti Zaiton Mohd Hadhim },
title = { Mining Spatio-Temporal Data of Fatal Accident },
journal = { International Journal of Computer Applications },
issue_date = { February 2013 },
volume = { 63 },
number = { 8 },
month = { February },
year = { 2013 },
issn = { 0975-8887 },
pages = { 40-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume63/number8/10490-5243/ },
doi = { 10.5120/10490-5243 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:13:50.254805+05:30
%A Aina Musdholifah
%A Siti Zaiton Mohd Hadhim
%T Mining Spatio-Temporal Data of Fatal Accident
%J International Journal of Computer Applications
%@ 0975-8887
%V 63
%N 8
%P 40-46
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Traffic accidents are an important concern of today's governments and societies, due to the high cost of human and economical resources involved. Data mining has been proven able to significantly help in improving traffic safety. Among several data mining tasks, clustering technique is mostly applied on spatio-temporal data, especially for the traffic data. A number of traffic related works proposed different clustering techniques for mining the spatio-temporal of traffic accident. However, some difficulties appeared when analyzing these datasets, such as the size of data, the lack of statistical evaluation methods, and interpreting the valuable patterns. With regard to solving this problem, this paper proposes a clustering approach for mining spatio-temporal data of fatal accident using local triangular kernel clustering (LTKC) algorithm. LTKC is kernel-density-based clustering algorithm that has the ability to determine the number of clusters automatically. We also propose three visualization techniques for use to interpret and present the optimal clustering result in an easy-understanding form. From the experimental results, LTKC approach was found to be able to discover responsible clusters within fatal accident data, which had proven by silhouette and Dunn index values close to 1. In addition, using visual techniques, we can state that the clustering results were well-separated and compact clusters.

References
  1. Beshah, T. and S. Hill. Mining road traffic accident data to improve safety: Role of road-related factors on accident severity in Ethiopia. 2010.
  2. Yin, J. , et al. High-dimensional shared nearest neighbor clustering algorithm. 2005. Changsha.
  3. Figuera, C. , et al. Multivariate spatial clustering of traffic accidents for local profiling of risk factors. in Intelligent Transportation Systems (ITSC), 2011 14th International IEEE Conference on. 2011.
  4. Lavrac, N. , et al. , Mining spatio-temporal data of traffic accidents and spatial pattern visualization. Metodoloski zveski, 2008. 5(1): p. 45-63.
  5. Tran, T. N. , R. Wehrens, and L. M. C. Buydens, KNN-kernel density-based clustering for high-dimensional multivariate data. Computational Statistics & Data Analysis, 2006. 51: p. 513-525.
  6. Ester, M. , et al. , A density-based algorithm for discovering clusters in large spatial database with noise, in 2nd International Conference on Knowledge Discovery and Data Mining. 1996.
  7. Hinneburg, A. and D. A. Keim. An efficient approach to clustering in large multimedia databases with noise. in The fourth international conference on knowledge discovery and data mining (KDD'98). 1998. Menlo Park, CA: AAAI Press.
  8. Zhang, D. and S. Chen. Kernel-based fuzzy and probabilistic c-means clustering. in The International Conference on Artificial Neural Networks. 2003. Istanbul, Turkey.
  9. Hotelling, H. , Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 1933. 24: p. 417–441.
  10. Inselberg, A. and B. Dimsdale. Parallel coordinates: A tool for visualizing multi-dimensional geometry. in IEEE Visualization. 1990.
  11. Tobler, W. , Am Cartogr, 1979. 6: p. 101-106.
  12. Everitt, B. S. , Cluster Analysis. 3rd ed. 2000.
  13. Tan, P. N. , M. Steinbach, and V. Kumar, Introduction to Data Mining. 2006: Addison Wesley.
  14. Knorr-Held, L. and G. Raber, Bayesian detection of clusters and discontinuities in disease maps. Biometrics, 2000. 56(1): p. 13-21.
  15. Fukunaga, K. and L. Hostetler, The estimation of the gradient of a density function, with applications in pattern recognition. Information Theory, IEEE Transactions on, 1975. 21(1): p. 32-40.
  16. Hoffman, F. M. , et al. Multivariate Spatio-Temporal Clustering (MSTC) as a Data Mining Tool for Environmental Applications. in iEMSs 2008:International Congress on Environmental Modeling and Software Integrating Sciences and Information Technology for Environmental Assessment and Decision Making. 2008: International Environmental Modeling and Software Society (iEMSs).
  17. Das, S. , M. Lazarewicz, and L. H. Finkel. Principal Component Analysis of Temporal and Spatial Information for Human Gait Recognition. in The 26th Annual International Conference of IEEE EMBS. 2004. San Francisco, CA, USA: IEEE.
  18. Zhou, H. , et al. , Visual Clustering in Parallel Coordinates. Journal Compilation, 2008. 27(3).
  19. Inselberg, A. , The plane with parallel coordinates. The Visual Computer, 1985: p. 69–92.
  20. Guo, D. , et al. , Multivariate Analysis and Geovisualization with an Integrated Geographic Knowledge Discovery Approach. Cartographic and Geographic Information Science, 2005. 32(2): p. 113-132.
  21. Skupin, A. , The world of geography: Visualizing a knowledge domain with cartographic means. 2004, PNAS. p. 5274-5278.
  22. Gorricha, J. and V. Lobo, Improvements on the visualization of clusters in geo-referenced data using Self-Organizing Maps. Computers and Geosciences, 2012. 43: p. 177-186.
  23. Rousseeuw, P. J. , Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 1987. 20(C): p. 53-65.
  24. Bezdek, J. C. and J. C. Dunn, Optimal Fuzzy Partitions: A Heuristic for Estimating The Parameters in A Mixture of Normal Distributions. IEEE Transactions on Computers, 1975. C-24(8): p. 835-840.
  25. NCSA, N. C. f. S. a. A. Fatality analysis reporting system (FARS) web-based encyclopedia. 2004 [cited; Available from: http://www-fars. nhtsa. dot. gov/.
  26. TESSMER, J. M. , FARS Analytic Reference Guide 1975 to 2002. 2002, National Highway Traffic Safety Administration, Department of Transportation, Washington, D. C.
  27. FARS, Coding and validation manual (2004) 2004, National Center for Statistics and Analysis, National Highway Traffic Safety Administration, Department of Transportation, Washington, D. C.
  28. Michalski, R. S. and W. D. Seeman, Recent Advances in Conceptual Clustering: CLUSTER3. Studies in Classification, Data Analysis, and Knowledge Organization, 2007: p. 285-297.
Index Terms

Computer Science
Information Sciences

Keywords

Fatal accident spatio-temporal data cluster analysis data mining