Web Search Engines: Mining Right Information

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Web Search Engines: Mining Right Information

Published on May 2012 by Naveen, Dharmender Kumar

National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011

Foundation of Computer Science USA

RTMC - Number 2

May 2012

Authors: Naveen, Dharmender Kumar

Naveen, Dharmender Kumar . Web Search Engines: Mining Right Information. National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011. RTMC, 2 (May 2012), 25-27.

@article{

author = { Naveen, Dharmender Kumar },

title = { Web Search Engines: Mining Right Information },

journal = { National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011 },

issue_date = { May 2012 },

volume = { RTMC },

number = { 2 },

month = { May },

year = { 2012 },

issn = 0975-8887,

pages = { 25-27 },

numpages = 3,

url = { /proceedings/rtmc/number2/6632-1015/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011

%A Naveen

%A Dharmender Kumar

%T Web Search Engines: Mining Right Information

%J National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011

%@ 0975-8887

%V RTMC

%N 2

%P 25-27

%D 2012

%I International Journal of Computer Applications

Abstract

A Web Search Engine maintains and catalogs the content of Web pages in order to make them easier to find and browse. There are many Search Engines which are similar, differentiates from the other by the methods for scouring, storing, and retrieving information from the Web. Usually Search Engines search through Web pages for specified keywords, in response they return a list of containing specified keywords documents. After finding the list of specified keywords documents, list is sorted by relevance criteria which try to put at the very first positions the documents that best match the user's query. The usefulness of a search engine to most people is based on the relevance of results it retrieves from the web. This paper tries to address some issues regarding some of the major challenges faced by Search Engines, since the size of the Web is rapidly growing.

References

C. J. Van Rijsbergen. Information Retrieval. Butterworths. Available at http://www. dcs. gla. ac. uk/Keith/Preface. html
Oliver A. McBryan. GENVL and WWWW: Tools for taming the Web. In Proceedings of the First International World Wide Web Conference, Geneva, Switzerland, May 1994.
Steve Lawrence and C. Lee Giles. Accessibility of information on the Web Nature, 400:107-109, July 1999.
Roy T. Fielding, Jim Gettys, Je_rey C. Mogul, Henrik Frystyk, L. Masinter, P. Leach, and Tim Berners-Lee. Hypertext Transfer Protocol HTTP/1. 1. RFC 2616, http://ftp. isi. edu/in-notes/rfc2616. txt, June 1999.
BRIN, S. , AND PAGE, L. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW7 (Brisbane, Australia, May 1998). http://www7. scu. edu. au/programme/fullpapers/1921/com1921. htm.
HEYDON, A. , AND NAJORK, M. Mercator: A Scalable, Extensible Web Crawler. World Wide Web Journal (December 1999), 219 â 229. http://www. research. digital. com/SRC/mercator/.
CHO, J. , GARCÂ´I A-MOLINA, H. , AND PAGE, L. Efficient crawling through URL ordering. Computer Networks and ISDN Systems 30, 1â7 (1998), 161â172.
WITTEN, I. H. , BELL, T. C. , AND MOFFAT, A. Managing Gigabytes: Compressing and Indexing Documents and Images. John Wiley & Sons, Inc. , 1999.
Zamir, O. , Etzioni, O. 1998. Web document clustering: a feasibility demonstration. Proc. of SIGIR '98, Melbourne, Appendix-Questionnaire, pp. 46-54

Index Terms

Computer Science

Information Sciences

Keywords

Web Search Engine Clustering Crawler Hyper Text Transfer Protocol