CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Adapted Web Crawler for Mining Offline Web Data using AFHC

by S. Amudha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 81 - Number 8
Year of Publication: 2013
Authors: S. Amudha
10.5120/14030-1398

S. Amudha . Adapted Web Crawler for Mining Offline Web Data using AFHC. International Journal of Computer Applications. 81, 8 ( November 2013), 6-10. DOI=10.5120/14030-1398

@article{ 10.5120/14030-1398,
author = { S. Amudha },
title = { Adapted Web Crawler for Mining Offline Web Data using AFHC },
journal = { International Journal of Computer Applications },
issue_date = { November 2013 },
volume = { 81 },
number = { 8 },
month = { November },
year = { 2013 },
issn = { 0975-8887 },
pages = { 6-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume81/number8/14030-1398/ },
doi = { 10.5120/14030-1398 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:55:31.777116+05:30
%A S. Amudha
%T Adapted Web Crawler for Mining Offline Web Data using AFHC
%J International Journal of Computer Applications
%@ 0975-8887
%V 81
%N 8
%P 6-10
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Adaptive focused hyperlink crawler (AFHC) aim to search the entire inner level sub link of the web pages related to a specific topic and to download unique web pages to local disk. The download web page information searched in offline browsing and avoids the repeated searches in the web server to give a solution to problem. The major problem is to retrieve the maximal set of relevant and quality pages. Crawler software can be retrieve web pages by hyperlinks through internet. The number of web sites using web crawlers cannot retrieve the relevant pages. The system have browser and search engine. Browser using AFHC reduce time to finding accurate content in web pages in the hyperlinks and also restrict to download web pages to local disk. Search engine using extended cocitaion algorithm to retrieve accurate content in the local disk and search based on any word, all word and phrase matching in the local disk. It is useful to the student and organization. The research is easily found to personalize the crawl history and search history for knowing the user transactions.

References
  1. Debashis Hati, Amritesh Kumar, Lizashree Mishra, 2010, "Unvisited URL Relevancy Calculation in Focused Crawling Based on Naïve Bayesian Classification", International Journal of Computer Applications (0975 – 8887), Volume 3 – No. 9, July 2010. PP: 23-30
  2. Debashis Hati, Amritesh Kumar, 2010, "An Approach for Identifying URLs Based on Division Score and Link Score in Focused Crawler", International Journal of Computer Applications (0975 – 8887),Volume 2 – No. 3, May 2010. PP:48-53
  3. M. Sunil Kumar, P. Neelima, 2011, "Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine ",International Journal of Computer Applications(0975 – 8887), Volume 15– No. 7, February 2011. PP:8-13
  4. Niraj Singhal, Ashutosh Dixit, Dr. A. K. Sharma, 2010, "Design of a Priority Based Frequency Regulated Incremental Crawler",2010 International Journal of Computer Applications (0975 – 8887) Volume 1 – No. 1
  5. Parul Gupta, Dr. A. K. Sharma, 2010 ,"Context based Indexing in Search Engines using Ontology", ©2010 International Journal of Computer Applications (0975 – 8887), Volume 1 – No. 14. PP:49-52
  6. S. Thenmalar, T. V. Geetha, 2011," Concept based Focused Crawling using Ontology", International Journal of Computer Applications (0975 – 8887),Volume 26– No. 7, July 2011. PP:29-32
  7. Shekhar Mishra, Anurag Jain,Dr. A. K. Sachan, 2011,"A Query Based Approach To Reduce The Web Crawler Traffic Using HTTP Get Request And Dynamic Web Page", International Journal of Computer Applications (0975 – 8887), Volume 14– No. 3, January 2011. PP:8-14.
  8. Swati Mali, B. B. Meshram, 2011, "Focused Web Crawler with Page Change Detection Policy", 2nd International conference and workshop on Emerging Trends in Technology (ICWET) 2011, Proceedings published by International Journal of Computer Applications (IJCA). PP:51-57.
  9. Debashis Hati, Amritesh Kumar, 2010," UDBFC: An Effective Focused Crawling Approach Based On URL Distance Calculation", 978-1-4244-5539-3/10/$26. 00 ©2010 IEEE
  10. Swati Mali, B. B. Meshram, 2011," Focused Web Crawler with Page Change Detection Policy", 2nd International Conference and workshop on Emerging Trends in Technology (ICWET) 2011, Proceedings published by International Journal of Computer Applications® (IJCA)
Index Terms

Computer Science
Information Sciences

Keywords

Web Crawler Focused Crawler Web Mining.