CFP last date
20 June 2024
Reseach Article

A Three Stage Classifier for Efficient Website Categorization

Published on May 2016 by Dhanashri S. Hulavale, Saurabh H. Deshmukh
National Conference on Advancements in Computer & Information Technology
Foundation of Computer Science USA
NCACIT2016 - Number 2
May 2016
Authors: Dhanashri S. Hulavale, Saurabh H. Deshmukh
2f61370b-c52c-4519-b8ac-cd0981c5b126

Dhanashri S. Hulavale, Saurabh H. Deshmukh . A Three Stage Classifier for Efficient Website Categorization. National Conference on Advancements in Computer & Information Technology. NCACIT2016, 2 (May 2016), 22-24.

@article{
author = { Dhanashri S. Hulavale, Saurabh H. Deshmukh },
title = { A Three Stage Classifier for Efficient Website Categorization },
journal = { National Conference on Advancements in Computer & Information Technology },
issue_date = { May 2016 },
volume = { NCACIT2016 },
number = { 2 },
month = { May },
year = { 2016 },
issn = 0975-8887,
pages = { 22-24 },
numpages = 3,
url = { /proceedings/ncacit2016/number2/24708-3044/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Advancements in Computer & Information Technology
%A Dhanashri S. Hulavale
%A Saurabh H. Deshmukh
%T A Three Stage Classifier for Efficient Website Categorization
%J National Conference on Advancements in Computer & Information Technology
%@ 0975-8887
%V NCACIT2016
%N 2
%P 22-24
%D 2016
%I International Journal of Computer Applications
Abstract

Website categorization is one of the challenging tasks in the world of ever increasing web technologies. There are different way to categorization of web pages using different features and approach. Website contains lot of information like text, images, animation, video and links. So this information is call as features of website. For the website categorization purpose all Feature have most important role. The web has a lot of information in the form of images, video, animation and text etc present in the document. In proposed System uses number of feature of website and use three different classifier for website classification are naive bays classifier, linear classifier- perceptron and stochastic classifier. Here eight major categories of website have been selected for categorization; these are business & economy, job search, and science, education, sports, news & media, government, entertainment. Proposed system gives ranking to website. It will be more helpful for software developer or website designer for evolution of their site using our system so that they can judge that their website belongs to respective category or not.

References
  1. Arul Prakash, Kranti Kumar Ravi,Asirvhatam -Web Page Catego- rization based on Document Structure. In International Institute of IT, Hyderabad, India.
  2. J. B. Leela devi, 2dr. A. Sankar-IMPROVEMENTS IN NEURAL NETWORK FOR CLASSIFICATION OF WEB PAGES 1j. B. Leela devi, 2dr. A. Sankar 1university research scholar, anna university, Tamil Nadu, India 2. Associate Professor, PSG College of Tech- nology, Coimbatore, India.
  3. BRIAN D. DAVISON and XIAOGUANG QI. Web Page Classi- cation: Features and Algorithms. In ACM Computing Surveys, Article 12, Publication date: February 2009
  4. H. Yu, J. Han, and K. C. C. Chang. Pebl, -Positive example based learning for web page categorization. In Canada, KDD, Edmon- ton, Canada, 2002,
  5. In Iran University of Science Technology (IUST), Tehran, Iran. Arash Rezaei, and Behrouz Minaei-Bidgoli-Comparison be- tween the Classi?cation Methods using Type of Attributes and Sample Size.
  6. Sundus Hassan and Muhammad Shahid in Computer Science Department NUCES-FAST, Karachi Campus,Comparing NB Clas- si?ers SVM for Text Classi?cation.
  7. Tat-Seng Chua Hui Yang. Effectiveness of web page categoriza- tion on Finding List Answer, In National University of Singapore.
  8. E-H. S. Han, K. Hastings, D. Boley, M. Gini,V. Kumar, B. Mobasher, R. Gross, E-H. S. Han, J. Moore,K. Hastings and G. Karypis In 1999,DecisionSupportSystem, web document catego- rization using Partitioning-based clustering .
  9. G. W. F. S. Lawrence, W. P. Birmingham, A. Kruger , D. M. Pennock - Improving category speci?c web search by learning query modi?cations,san diego, Saint 2001, California.
  10. Arul Prakash, Kranti Kumar and Ravi,Asirvhatam -Web Page Categorization based on Document Structure. In International Institute of IT at Hyderabad.
  11. Komal Kumar,Pikakshi Manchanda and Sonali Gupta. The Auto- mated Classi?cation of Web Pages Using Arti?cial Neural Net- work . Department of Computer Science, Faridabad, India
Index Terms

Computer Science
Information Sciences

Keywords

Linear Classifier- Perceptron Machine Learning Naive Bayes Stochastic Classifier