CFP last date
20 November 2025
Call for Paper
December Edition
IJCA solicits high quality original research papers for the upcoming December edition of the journal. The last date of research paper submission is 20 November 2025

Submit your paper
Know more
Random Articles
Reseach Article

An Survey of Approaches for Mining Documents on Web based on User based Analysis

Published on June 2013 by S.senthilkumar, G.tholkappia Arasu
International Conference on Innovation in Communication, Information and Computing 2013
Foundation of Computer Science USA
ICICIC2013 - Number 3
June 2013
Authors: S.senthilkumar, G.tholkappia Arasu

S.senthilkumar, G.tholkappia Arasu . An Survey of Approaches for Mining Documents on Web based on User based Analysis. International Conference on Innovation in Communication, Information and Computing 2013. ICICIC2013, 3 (June 2013), 13-17.

@article{
author = { S.senthilkumar, G.tholkappia Arasu },
title = { An Survey of Approaches for Mining Documents on Web based on User based Analysis },
journal = { International Conference on Innovation in Communication, Information and Computing 2013 },
issue_date = { June 2013 },
volume = { ICICIC2013 },
number = { 3 },
month = { June },
year = { 2013 },
issn = 0975-8887,
pages = { 13-17 },
numpages = 5,
url = { /proceedings/icicic2013/number3/12274-0155/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Innovation in Communication, Information and Computing 2013
%A S.senthilkumar
%A G.tholkappia Arasu
%T An Survey of Approaches for Mining Documents on Web based on User based Analysis
%J International Conference on Innovation in Communication, Information and Computing 2013
%@ 0975-8887
%V ICICIC2013
%N 3
%P 13-17
%D 2013
%I International Journal of Computer Applications
Abstract

Web documents contain information that include image, video. The retrieved information are oriented towards the user's input for search. But the popular search engines like google uses algorithms for prioritizing and retrieving the results. However the information accessed also depends on the cookies as part of the local system. In real requirements based on the category , type and interest of users it is required to provide contents. This paper has a complete survey of different approaches/algorithms that are in existence for analyzing the web documents and providing to the user. The paper has a complete study of the different performance parameters that can be used for analyzing the results. The paper also draws conclusions for selecting the appropriate approach based on different scenarios and different user input criteria provided by users.

References
  1. Manjusha, R. ,"Web mining framework for security in e-commerce" International Conference on Recent Trends in Information Technology (ICRTIT),PP. 1043-1048,2011
  2. Sharma Kavita,Shrivastava,Gulshan Kumar, Vikas, "Web mining: Today and tomorrow" ,International Conference on Electronics Computer Technology (ICECT), Vol. 1,PP. 399 - 403,2011.
  3. Buche P. , Dibie-Barthelemy J. , Ibanescu L. , Soler L. ," Fuzzy Web Data Tables Integration Guided by an Ontological and Terminological Resource",IEEE Transactions on Knowledge and Data Engineering, Volume: 25 ,PP. 805 - 819,2013
  4. Guan Ziyu,Miao Gengxin,McLoughlin Russell,Yan Xifeng , Cai Deng,"Co-Occurrence-Based Diffusion for Expert Search on the Web"IEEE Transactions on Knowledge and Data Engineering, Vol. 25,PP. 1001 - 1014,2013.
  5. Skabar Andrew,Abdalgader,Khaled,"Clustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm" ,IEEE Transactions on Knowledge and Data Engineering,Vol. 25 , PP. 62-75,2013
  6. Giatsoglou M. , Vakali A. ,"Capturing Social Data Evolution Using Graph Clustering" IEEE Internet Computing, Vol. 17,PP. 74 - 79,2013
  7. Lim Edward H Y , Tam Hillman W K , Wong Sandy W K , Liu James Nga-Kwok , Lee Raymond S T,"Collaborative content and user-based web ontology learning system", IEEE International Conference on Fuzzy Systems,PP. 1050 - 1055,2009.
  8. Cui Hang,Wen Ji-Rong,Nie Jian-Yun Y. ,Ma Wei-Ying Y. ,"Query expansion by mining user logs",IEEE Transactions on Knowledge and Data Engineering, Vol. 15,PP. 829 - 839,2003
  9. Myat Nyeint Nyeint,Hla Khin Haymar Saw,"Organizing Web Documents Resulting from an Information Retrieval System Using Formal Concept Analysis",Proceedings of the sixth Asia-Pacific Symposium on Information and Telecommunication Technologies PP. 198 - 203,2005.
  10. Huang Chien-Chung,Lin Kuan-Ming,Chien Lee-Feng,"Automatic training corpora acquisition through Web mining",IEEE/WIC/ACM International Conference on Web Intelligence, PP. 193 - 199 ,2005.
  11. Chen Ming-Syan Syan,Park Jong Soo,Yu Phillip S. ,"Efficient data mining for path traversal patterns",IEEE Transactions on Knowledge and Data Engineering, Vol. 10,PP. 209 - 221,1998.
  12. Chengzhi Zhang,Qingguo Zhang, "Topic Navigation Generation Using Topic Extraction and Clustering", International Symposium on Knowledge Acquisition and Modeling, PP. 333 - 339 ,2008
  13. Manavoglu Eren,Pavlov Dmitry,Giles C. Lee, "Probabilistic user behavior models", Third IEEE International Conference on Data Mining,PP. 203-210,2003
  14. Bollegala D. , Weir D. , Carroll J. ,Cross-Domain Sentiment Classification using a Sentiment Sensitive Thesaurus,IEEE Transactions on Knowledge and Data Engineering,2012
  15. Witte R , Krestel R , Kappler T , Lockemann P,"Converting a Historical Encyclopedia of Architecture into a Semantic Knowledge Base", IEEE Intelligent Systems,2009.
  16. Chen Jian,Shtykh Roman Y. ,Jin Qun,"Gradual Adaption Model for Estimation of User Information Access Behavior",3rd International Conference on Systems and Networks Communications,PP. 378 - 383,2008
  17. Gang Xiao ,Jiancang Xie,"Performance Analysis of Chinese Webpage Categorizing Algorithm Based on Support Vector Machines (SVM)",Fifth International Conference on Information Assurance and Security, Vol. 1,PP. 231 - 235,2009.
  18. Othman Zulaiha Hj Ali,Bakar Azuraliza Abu,Hamdan Abdul Razak,Omar Khairuddin Bin,Shuib Nor Liyana Mohd,"Agent based preprocessing",International Conference on Intelligent and Advanced Systems,PP. 219-223,2007
  19. Magdalini Eirinaki,Michalis Vazirgiannis,"Web mining for web personalization",ACM Transactions on Internet Technology,Vol. 3,PP. 1-27,2003.
  20. Chunyu Kit, Jessica Yee Ha Ng,"An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns",Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops,PP. 526-529,2007.
  21. Danielle Medeiros, Uirá Kulesza, André Mauricio Campos,"A framework for implementing web recommendation agents",Proceedings of the 18th Brazilian symposium on Multimedia and the web,2012.
  22. Leona F. Fass,"Some agent theory for the semantic web",SIGSOFT Software Engineering Notes,Vol. 30,2005.
  23. Nick Bassiliades,"Agents and knowledge interoperability in the semantic web era", Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics,2012
  24. Ian Dickinson, Michael Wooldridge,"Towards practical reasoning agents for the semantic web",Proceedings of the second international joint conference on Autonomous agents and multiagent systems,July 2003.
  25. Jean Paul Sansonnet, Daniel Werner Correa, Patricia Jaques, Annelies Braffort, Cyril Verrecchia ,"Developing web fully-integrated conversational assistant agents",Proceedings of the 2012 ACM Research in Applied Computation Symposium,2012.
  26. Gabriel L. Somlo, Adele E. Howe,"Incremental clustering for profile maintenance in information gathering web agents",Proceedings of the fifth international conference on Autonomous agents,2001.
  27. Wolfgang Ketter, Arun Batchu, Gary Berosik, Dan McCreary,"A semantic web architecture for advocate agents to determine preferences and facilitate decision making", Proceedings of the 10th international conference on Electronic commerce(ICEC'08),2008.
  28. Andrea Paola Barraza, Angela Carrillo-Ramos ,"Basic requirements to keep in mind for an ideal agent-based web information retrieval system in ubiquitous environments", Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services(IIWAS'10),2010.
  29. Andreas Gerber, Matthias Klusch, Christian Ruß, Ingo Zinnikus,"Holonic agents for the coordination of supply webs",Proceedings of the fifth international conference on Autonomous agents(AGENTS'01),2001
  30. Kurt D. Bollacker, Steve Lawrence, C. Lee Giles,"CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications",Proceedings of the second international conference on Autonomous agents(AGENTS'98),1998
Index Terms

Computer Science
Information Sciences

Keywords

Data Model Cluster Threshold Filtering Classification Agent Learning