CFP last date
20 May 2024
Reseach Article

Search Engine using Apache Lucene

by Mamatha Balipa, Balasubramani R.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 127 - Number 9
Year of Publication: 2015
Authors: Mamatha Balipa, Balasubramani R.
10.5120/ijca2015906476

Mamatha Balipa, Balasubramani R. . Search Engine using Apache Lucene. International Journal of Computer Applications. 127, 9 ( October 2015), 27-30. DOI=10.5120/ijca2015906476

@article{ 10.5120/ijca2015906476,
author = { Mamatha Balipa, Balasubramani R. },
title = { Search Engine using Apache Lucene },
journal = { International Journal of Computer Applications },
issue_date = { October 2015 },
volume = { 127 },
number = { 9 },
month = { October },
year = { 2015 },
issn = { 0975-8887 },
pages = { 27-30 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume127/number9/22759-2015906476/ },
doi = { 10.5120/ijca2015906476 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:19:28.539264+05:30
%A Mamatha Balipa
%A Balasubramani R.
%T Search Engine using Apache Lucene
%J International Journal of Computer Applications
%@ 0975-8887
%V 127
%N 9
%P 27-30
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The World-Wide Web is a huge network of billions of workstations and this network contains billions of web pages containing information on a wide variety of topics. There are a lot of topics discussed by people, opinions and suggestions shared on various social networking sites that the users are interested in. Low precision and low recall still exists in the current search engines. So a search engine that is effective and one that applies Web mining technology has become very important. A discussion on the various technologies used to implement a search engine and its techniques like indexing and searching on the world wide web is done in this paper. The authors propose to describe the method to create a search engine by using JSoup and Apache Lucene API in the paper.

References
  1. V. V. Vydiswaran, Q. Mei, D. A. Hanauer, and K. Zheng, , 2014, “Mining consumer health vocabulary from community-generated text,” in AMIA Annual Symposium Proceedings, vol. 2014, p. 1150, American Medical Informatics Association. .
  2. H. Sampathkumar, X.-w. Chen, and B. Luo, 2014. “Mining adverse drug reactions from online healthcare forums using hidden markov model,” BMC medical informatics and decision making, vol. 14, no. 1, p. 91.
  3. S. Brin and L. Page,, 2012, “Reprint of: The anatomy of a large-scale hypertextual web search engine,” Computer networks, vol. 56, no. 18, pp. 3825–3833.
  4. S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv, “Xsearch: 2003, A semantic search engine for xml,” in Proceedings of the 29th international conference on Very large data bases-Volume 29, pp. 45–56, VLDB Endowment.
  5. D. Bhagwat and N. Polyzotis, 2003 , “Searching a file system using inferred semantic links,” in Proceedings of the sixteenth ACM conference on Hypertext and hypermedia, pp. 85–87, ACM, 2005.Forman, G.
  6. H.-L. Wang, S.-H. Wu, I. Wang, C.-L. Sung, W.-L. Hsu, and W.-K. Shih, 2000, “Semantic search on internet tabular information extraction for answering queries,” in Proceedings of the ninth international conference on Information and knowledge management, pp. 243–249, ACM.
  7. A. M¨adche, B. Motik, L. Stojanovic, R. Studer, and R. Volz, 2003, “An infrastructure for searching, reusing and evolving distributed ontologies,” in Proceedings of the 12th international conference on World Wide Web, pp. 439–448, ACM.
  8. H. Kian and M. Zahedi, 2011, “An efficient approach for keyword selection; improving accessibility of web”.
  9. P. Gupta and D. A. Sharma, 2010, “Context based indexing in search engines using ontology,” International Journal of Computer Applications (0975–8887), vol. 1, no. 14.
  10. P. Houston, 2013, Instant jsoup How-to. Packt Publishing Ltd,.
  11. A. Sonawane, 2009, “Using apache lucene to search text,” Online At http://www. ibm. com/developerworks/opensource/library/os-apachelucenesearch/(as of 11 December 2013).
Index Terms

Computer Science
Information Sciences

Keywords

Web crawler searching indexing JSoup Apache Lucene.