CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

An Efficient Preprocessing Methodology of Log File for Web Usage Mining

Published on June 2015 by A. Deepa, P. Raajan
National Conference on Research Issues in Image Analysis and Mining Intelligence
Foundation of Computer Science USA
NCRIIAMI2015 - Number 2
June 2015
Authors: A. Deepa, P. Raajan
fa5c66f5-0cc9-4c78-a2c4-a343009a4124

A. Deepa, P. Raajan . An Efficient Preprocessing Methodology of Log File for Web Usage Mining. National Conference on Research Issues in Image Analysis and Mining Intelligence. NCRIIAMI2015, 2 (June 2015), 15-16.

@article{
author = { A. Deepa, P. Raajan },
title = { An Efficient Preprocessing Methodology of Log File for Web Usage Mining },
journal = { National Conference on Research Issues in Image Analysis and Mining Intelligence },
issue_date = { June 2015 },
volume = { NCRIIAMI2015 },
number = { 2 },
month = { June },
year = { 2015 },
issn = 0975-8887,
pages = { 15-16 },
numpages = 2,
url = { /proceedings/ncriiami2015/number2/21025-4025/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Research Issues in Image Analysis and Mining Intelligence
%A A. Deepa
%A P. Raajan
%T An Efficient Preprocessing Methodology of Log File for Web Usage Mining
%J National Conference on Research Issues in Image Analysis and Mining Intelligence
%@ 0975-8887
%V NCRIIAMI2015
%N 2
%P 15-16
%D 2015
%I International Journal of Computer Applications
Abstract

Now a day, WWW has become important and huge data storage. All users' activities will be stored in log file. The log file shows the interest on the particular website. With a wide usage of internet, the log file size is growing rapidly. Web mining is the process of extracting information from web data. The raw log file won't reveal the users' accessing pattern. Thus, preprocessing has become an important process in web mining. Web Usage Mining is the important domain area of web mining to extract and analyze the usage pattern of users from the server log file. The quality of the input decides the quality of the output. Preprocessing is the noteworthy process before mining the interesting information from data. In this paper we have implemented the preprocessing techniques to convert the log file into user sessions which are suitable for mining and reduce the size of session file by filtering the least requested pages.

References
  1. Raiyani, ashwin g. , and sheetal s. Pandya. "Discovering User Identification Mining Technique for Preprocessed Web Log Data. "
  2. Maideen, C. M. , & Palanivel, M. K. MS Log Cleaner: A framework to discover efficient use of web service.
  3. Castellano, G. , Fanelli, A. M. , and Torsello, M. A. 2007. Log data preparation for mining web usage patterns. In IADIS International Conference Applied Computing (pp. 371-378).
  4. Sumathi, C. P. , et al. , 2011. An Overview of Preprocessing Of Web Log Files For Web Usage Mining. Journal of Theoretical and Applied Information Technology, ISSN: 1992-8645.
  5. Pamutha, T. , Chimphlee, S. , Kimpan, C. , and Sanguansat, P. 2012. Data Preprocessing on Web Server Log Files for Mining Users Access Patterns. International Journal of Research and Reviews in Wireless Communications (IJRRWC) Vol, 2.
  6. Smith, K. A. , & Ng, A. (2003). Web page clustering using a self-organizing map of user navigation patterns. Decision Support Systems, 35(2), 245-256.
  7. Ramya, C. , and Kavitha, G. 2011. An Efficient Preprocessing Methodology for Discovering Patterns and Clustering of Web Users using a Dynamic ART1 Neural Network. In Computer Networks and Intelligent Computing (pp. 198-204). Springer Berlin Heidelberg.
  8. Eltahir, M. , & Dafa-Alla, A. F. (2013, August). Extracting knowledge from web server logs using web usage mining. In Computing, Electrical and Electronics Engineering (ICCEEE), 2013 International Conference on (pp. 413-417). IEEE.
  9. Bhawsar, S. , Pathak, K. , Mariya, S. , & Parihar, S. Extraction of Business Rules from Web logs to Improve Web Usage Mining.
  10. Langhnoja, S. G. , Barot, M. P. , & Mehta, D. B. (2013). Web Usage Mining Using Association Rule Mining on Clustered Data for Pattern Discovery. International Journal.
  11. https://www. microsoft. com/technet/prodtechnol/WindowsServer2003/Library/IIS/be22e074-72f8-46da-bb7e-e27877c85bca. mspx?mfr=true
  12. Common Log Format – Wikipedia: http://en. wikipedia. org/wiki/Common_Log_Format
  13. A Log File Types Supported by Clickstream Intelligence, Oracle9iAS Clickstream Intelligence Administrator's Guide Release 2 (9. 0. 2) Part Number A90500-02 http://docs. oracle. com/cd/A97329_03/bi. 902/a90500/admin-05. htm
  14. Markov, Z. , & Larose, D. T. (2007). Data mining the Web: uncovering patterns in Web content, structure, and usage. John Wiley & Sons.
Index Terms

Computer Science
Information Sciences

Keywords

Web Usage Mining Www Log File Preprocess Filter Session.