CFP last date
20 May 2024
Reseach Article

Identification and Classification of Web Pages with Specified Domain

by Poonam Nagale, Alka Vishwa
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 118 - Number 11
Year of Publication: 2015
Authors: Poonam Nagale, Alka Vishwa
10.5120/20793-3454

Poonam Nagale, Alka Vishwa . Identification and Classification of Web Pages with Specified Domain. International Journal of Computer Applications. 118, 11 ( May 2015), 41-44. DOI=10.5120/20793-3454

@article{ 10.5120/20793-3454,
author = { Poonam Nagale, Alka Vishwa },
title = { Identification and Classification of Web Pages with Specified Domain },
journal = { International Journal of Computer Applications },
issue_date = { May 2015 },
volume = { 118 },
number = { 11 },
month = { May },
year = { 2015 },
issn = { 0975-8887 },
pages = { 41-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume118/number11/20793-3454/ },
doi = { 10.5120/20793-3454 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:01:27.313767+05:30
%A Poonam Nagale
%A Alka Vishwa
%T Identification and Classification of Web Pages with Specified Domain
%J International Journal of Computer Applications
%@ 0975-8887
%V 118
%N 11
%P 41-44
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Internet is very large source of information. But these flow of information need to be controlled in various organizations, i. e. in companies job portals and personal mail services are blocked, in colleges entertainment related websites are blocked. Consider college scenario, Admin have to keep watch on site the student accessing. He uses proxy services and firewall on sites that are not allowed to student access. But as per the growth of internet every day new sites launched in the market. It is not always feasible to admin to keep track on that, also every time we have to pay for proxy for each new site as well as it is somewhat time consuming. So, we are clustering the web links into five domain and the keywords must be preprocessed by Adaptive preprocessing technique to increase the performance of the system.

References
  1. Indre Zliobait e and Bogdan Gabrys "Adaptive Preprocessing for Streaming Data", IEEE transactions on knowledge and data engineering, vol. 26, no. 2, february 2014
  2. S. M. Kamruzzaman "Web Page Categorization Using Artificial Neural Networks",NetworksProceedings of the 4th International Conference on Electrical EngineeringJanuary, 2006.
  3. Aijun An and Xiangji Huang,"Feature selection with rough sets for webpage categorization", York University, Toronto, Ontario, Canada. 2009.
  4. Dou Shen, Zheng Chen, Qiang Yang, Hua-Jun Zeng, Benyu Zhang, Yuchang Lu, Wei-Ying Ma, "Web-page Classification through Summarization", SIGIR04, Sheffield, South Yorkshire, UK. Copyright 2004 ACM 1-58113-881-4/04/0007, July 2529, 2004.
  5. Arul Prakash Asirvatham Kranthi Kumar. Ravi,"Web Page Classification based on Document Structure", International Institute of Information Technology Hyderabad, INDIA 500019
  6. Makoto Tsukada, Takashi Washio, Hiroshi Motoda, "Automatic Web- Page Classification by Using Machine Learning Methods"
  7. Institute of Scientific and Industrial Research, Osaka University Mihogaoka, Ibaraki,Osaka 567-0047, JAPAN.
  8. Min-Yen Kan,"Web page categorization without the web page" WWW2004, New York, New York, USA. ACM 1-58113-912-8/04/0005. Osaka University Mihogaoka, Ibaraki, Osaka 567-0047, JAPAN, May 1722, 2004.
  9. A. Bifet, G. Holmes, B. Pfahringer, R. Kirkby, and R. Gavalda ,"New Ensemble Methods for Evolving Data Streams" , Proc. 15th ACM SIGKDD Intl Conf. Knowledge Discovery and Data Mining(KDD 09),pp. 139-148, 2009.
  10. E. Ikonomovska, J. Gama, and S. Dzeroski,"Learning Model Trees from Evolving Data Streams", Data Mining Knowledge Discovery,vol. 23, no. 1, pp. 128-168, 2011. .
  11. P. Kadlec and B. Gabrys,"Architecture for Development of Adaptive on-Line Prediction Models", Memetic Computing,vol. 1, no. 4, pp. 241- 269, 2009.
  12. M. Masud, J. Gao, L. Khan, J. Han, and B. Thuraisingham," Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints", IEEE Trans. Knowledge and Data Eng. ,vol. 23, no. 6, pp. 859-874, June 2011.
  13. D. Boley, M. Gini, R. Gross, E-H. S. Han, K. Hastings, G. Karypis, V. Kumar, B. Mobasher, and J. Moore,"Partitioning-based clustering for web document categorization", Decision Support System. ,1999.
  14. SPrabhakar Gold wasser and Eli Uphal Chandra Chekuri Michale,Stamford University,"Web Search Using Automatic Categorization", IBM alamden Research Center, 650 Harry Road,San Jose CA. s
Index Terms

Computer Science
Information Sciences

Keywords

Web mining Feature extraction ANN Preprocessing data.