CFP last date
22 April 2024
Reseach Article

Study of Different Focused Web Crawler to Search Domain Specific Information

by Nisha N. Pawar, K. Rajeswari
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 136 - Number 11
Year of Publication: 2016
Authors: Nisha N. Pawar, K. Rajeswari
10.5120/ijca2016908146

Nisha N. Pawar, K. Rajeswari . Study of Different Focused Web Crawler to Search Domain Specific Information. International Journal of Computer Applications. 136, 11 ( February 2016), 1-4. DOI=10.5120/ijca2016908146

@article{ 10.5120/ijca2016908146,
author = { Nisha N. Pawar, K. Rajeswari },
title = { Study of Different Focused Web Crawler to Search Domain Specific Information },
journal = { International Journal of Computer Applications },
issue_date = { February 2016 },
volume = { 136 },
number = { 11 },
month = { February },
year = { 2016 },
issn = { 0975-8887 },
pages = { 1-4 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume136/number11/24194-2016908146/ },
doi = { 10.5120/ijca2016908146 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:36:46.606105+05:30
%A Nisha N. Pawar
%A K. Rajeswari
%T Study of Different Focused Web Crawler to Search Domain Specific Information
%J International Journal of Computer Applications
%@ 0975-8887
%V 136
%N 11
%P 1-4
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In human life, use of correct medicinal plant for treating diseases is very important. Medicinal plant is the important part of the Indian Ayurvedic system. So there is a need of heuristic search of medicinal plants over India. The huge amount of information related to medicinal plants is available on World Wide Web. For collecting such domain specific information from the internet, focused web crawler is useful. This paper proposed an efficient focused web crawler using hybrid model of Naïve Bayes classifier and Decision tree classifier. The result of classifier defines that web page is relevant or not relevant. The proposed hybrid classification will improve the accuracy of web crawling.

References
  1. Madjid Khalilian, Hassan Abolhassani, Ali Alijamaat, “PCI: ‘Plants Classification & Identification’ Classification of Web pages for constructing plants web-directory”, Sixth International Conference on Information Technology: New Generations,2009
  2. Anshika Pal, Deepak Singh Tomar, S.C. Shrivastava, “Effective Focused Crawling Based on Content and Link Structure Analysis”, (IJCSIS) International Journal of Computer Science and Information Security,Vol. 2, No. 1, June 2009
  3. Nandar Win Min, Aye Nandar Hlaing, “An effective focused web crawler for web resource discovery”, International Journal of Advanced Research in Computer Engineering & Technoogy(IJARCET) Volume 2, Issue 11, November 2013
  4. Yajun Du,∗, Wenjun Liu, Xianjing Lv, Guoli Peng, “An improved focused crawler based on Semantic Similarity Vector Space Model ”, Applied Soft Computing 36 392–407, 2015.
  5. Sunita Rawat, D. R. Patil, “Efficient Focused Crawling based on Best First Search”, 978-1-4673-4529-3/12/$31.00_c 2012 IEEE.
  6. Sameendra Samarawickrama1, Lakshman Jayaratne, “Focused web crawling using named entity recognition for narrow domains”, IJRET: International Journal of Research in Engineering and Technology, Volume: 02 Issue: 03, Mar-2013.
  7. Nandar Win Min, and Aye Nandar Hlaing, “Ranking Hyperlinks Approach for Focused Web Crawler”, International Conference on Advances in Engineering and Technology (ICAET'2014) March 29-30, 2014 Singapore.
  8. S. Chakrabarti, M. van der Berg, and B. Dom, “Focused crawling: a new approach to topic-specific web resource discovery,” in Proc. of the 8th International World-Wide Web Conference (WWW8), 1999.
  9. J. Cho, H. Garcia-Molina, and L. Page, “Efficient crawling through URL ordering,” in Proceedings of the Seventh World-Wide Web Conference, 1998.
  10. Wenxian Wang, Xingshu Chen, Yongbin Zou,Haizhou Wang, Zongkun Dai, "A Focused Crawler Based on Naive Bayes Classifier", Third International Symposium on Intelligent Information Technology and Security Informatics,2010.
  11. Gunjan H. Agre, Nikita V.Mahajan, "Keyword Focused Web Crawler", IEEE sponsored 2nd international conference on electronics and communication systems(icecs) 2015.
  12. H. Isozaki and H. Kazawa, "Efficient support vector classifiers for named entity recognition", in Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, 2002.
  13. Anish Gupta, Priya Anand, "Focused web crawlers and its approaches", 1st International Conference on Futuristic trend in Computational Analysis and Knowledge Management (ABLAZE),2015.
  14. Mukesh Kumar, Renu Vig, "Learnable Focused Meta Crawling Through Web", 2nd International Conference on Communication, Computing and Security (ICCCS),2012.
  15. Li, Jun, Furuse, K. and Yamaguchi, K.,"Focused Crawling by Exploiting Anchor Text Using Decision Tree", Proceedings of the 14th International World Wide Web Conference. 2005, pp. 1190-1191.
Index Terms

Computer Science
Information Sciences

Keywords

Medicinal plants Focused web crawler Naïve Bayes classifier Decision Tree.