CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch

Published on August 2011 by Kowsalya N, Dr. C. Chandrasekar
International Conference on Advanced Computer Technology
Foundation of Computer Science USA
ICACT - Number 1
August 2011
Authors: Kowsalya N, Dr. C. Chandrasekar
a670f6fd-de5a-4bf3-bd8e-130ab6e60300

Kowsalya N, Dr. C. Chandrasekar . Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch. International Conference on Advanced Computer Technology. ICACT, 1 (August 2011), 6-11.

@article{
author = { Kowsalya N, Dr. C. Chandrasekar },
title = { Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch },
journal = { International Conference on Advanced Computer Technology },
issue_date = { August 2011 },
volume = { ICACT },
number = { 1 },
month = { August },
year = { 2011 },
issn = 0975-8887,
pages = { 6-11 },
numpages = 6,
url = { /proceedings/icact/number1/3230-icact079/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Advanced Computer Technology
%A Kowsalya N
%A Dr. C. Chandrasekar
%T Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch
%J International Conference on Advanced Computer Technology
%@ 0975-8887
%V ICACT
%N 1
%P 6-11
%D 2011
%I International Journal of Computer Applications
Abstract

This paper provides an in-depth description of MapReduce algorithm and Nutch Distributed File System in Nutch web search engine. Nutch is an open-source Web search engine that can be used at global, local, and even personal scale. To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of web pages involving a comparable number of distinct terms. They answer tens of millions of queries every day. Despite the importance of large-scale search engines on the web, very little academic research has been done on them. Furthermore, due to rapid advance in technology and web proliferation, creating a web search engine today is very different from ten years ago.

References
  1. White, Tom. 2006. “Introduction to Nutch, Part 1: Crawling”. Retrieved from http://today.java.net/pub/a/today/2006/01/10/introduction-to-nutch-1.html.
  2. Smart, John Ferguson. 2006. “Integrate Advanced Search Functionalities Into Your Apps”. Retrieved from
  3. Integrate advanced search functionalities into your apps at: http://www.javaworld.com/javaworld/jw-09-2006/jw-0925-lucene.html .
  4. Open source at: http://en.wikipedia.org/wiki/Open_Source
  5. Welcome to Apache Nutch at: http://lucene.apache.org/nutch/
  6. Welcome to Apache Nutch at: http://www.nutch.org/
  7. M. Cafarella and D. Cutting. Building Nutch: open source search. 2004.
  8. MapReduce and Simplified Data Processing on large Clusters, journal of Jeffrey Dean and Sanjay Ghemawat, Google, Inc.
Index Terms

Computer Science
Information Sciences

Keywords

MapReduce Algorithm Nutch Distributed Nutch