Call for Paper - November 2023 Edition
IJCA solicits original research papers for the November 2023 Edition. Last date of manuscript submission is October 20, 2023. Read More

Enhanced Web Mining Technique To Clean Web Log File

Print
PDF
International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 96 - Number 16
Year of Publication: 2014
Authors:
Rachit Goel
10.5120/16880-6882

Rachit Goel. Article: Enhanced Web Mining Technique To Clean Web Log File. International Journal of Computer Applications 96(16):25-29, June 2014. Full text available. BibTeX

@article{key:article,
	author = {Rachit Goel},
	title = {Article: Enhanced Web Mining Technique To Clean Web Log File},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {96},
	number = {16},
	pages = {25-29},
	month = {June},
	note = {Full text available}
}

Abstract

The arrival of the computer technology has contributed the ability to produce and store the massive amounts of data. Now the world is not confined only to manually generated files or reports, but has become a giant store where vast amounts of data are collected and exchanged daily. Web pages typically contain a large amount of information that is not part of the main content of the pages, e. g. banner ads, navigation bars, copyright notices, etc. Such noise on web pages usually leads to poor results in Web Mining which mainly depends upon the web page content. Therefore, it becomes very essential to extract information from the bulks of data and structure them into useful knowledge that will be helpful for some type of understanding. This leads to the birth of data mining. Web usage mining is the subject field of Data Mining which deals with the discovery and analysis of usage patterns from web data specifically web logs in order to improve the web based applications. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as frequent access paths, frequent access page groups and user clustering. Through web usage mining, the server log, registration information and other relative information left by user provide foundation for decision making of organizations.

References

  • Frawley W. J. , Piatetsky-Shapiro G. and Matheus C. J. , "Knowledge Discovery in Databases: An Overview", AI Magazine, vol. 13, no. 3, pp. 57-70, 1992.
  • Kloesgen, W. 1996. A Multipattern and Multistrategy
  • Discovery Assistant. In Advances in Knowledge Discovery and Data Mining.
  • Srivastava J. , and Cooley R. , "Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data", ACM SIGKDD Explorations, vol. 1, no. 2, pp. 12-23, January 2000.
  • Bharat K. and Broder A. , "A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines", in Proceedings of the 7th World-Wide Web Conference, pp. 379-388, 1998.
  • Singh B. , Singh H. K. , "Web Data Mining Research", in Proceedings of 2010 IEEE International Conference on Computational Intelligence and Computing Research, pp. 1-10, December 2010.
  • Bayir M. A. , Toroslu I. H. , Cosar A. and Fidan G. ,"Smart Miner: A New Framework for Mining Large Scale Web Usage Data", in Proceedings of the 18th International Conference on World Wide Web, pp. 161-170, 2009.
  • Cooley R. , "Web Usage Mining: Discovery and Application of Interesting Patterns from Web data", PhD thesis, University of Minnesota, Dept. of Computer Science, May 2000.
  • Singh B. , Singh H. K. , "Web Data Mining Research", in Proceedings of 2010 IEEE International Conference on Computational Intelligence and Computing Research, pp. 1-10, December 2010.
  • Zhang Q. , and Segall R. S. , "Web Mining: A Survey of Current Research, Techniques, and Software", International Journal of Information Technology & Decision Making, vol. 7, no. 4, pp. 683-720,2008.
  • Borges J. and Levene M. , "Data Mining of User Navigation Patterns", in Proceedings of the WEBKDD'99 Workshop on Web Usage Analysis and User Profiling, pp. 31-39, August 1999.
  • Madria S. K. , Bhowmick S. S. , Ng W. K. , and Lim E. P. , "Research Issues in Web data Mining", in Proceedings of First International Conference Data Warehousing and Knowledge Discovery, pp. 303-312, 1999.
  • Etzioni O. , "The World Wide Web: Quagmire or Gold Mining?", Communications of the ACM, vol. 39, no. 11, pp. 65-68, November 1996.
  • Blockeel H. and Kosala R. , "Web Mining Research: A Survey", ACM SIGKDD Explorations, vol. 2, no. 1, pp. 1-15, June 2000.
  • Codd E. F. , "A Relational Model of Data for Large Shared Data Banks", Communications of the ACM, vol. 13, no. 6, pp. 377–387, June 19.