Call for Paper - December 2020 Edition
IJCA solicits original research papers for the December 2020 Edition. Last date of manuscript submission is November 20, 2020. Read More

Plagiarism Checker: Text Mining

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2016
Anu Saini, Ankita Bahl, Supriya Kumari, Mitali Singh

Anu Saini, Ankita Bahl, Supriya Kumari and Mitali Singh. Article: Plagiarism Checker: Text Mining. International Journal of Computer Applications 134(3):8-11, January 2016. Published by Foundation of Computer Science (FCS), NY, USA. BibTeX

	author = {Anu Saini and Ankita Bahl and Supriya Kumari and Mitali Singh},
	title = {Article: Plagiarism Checker: Text Mining},
	journal = {International Journal of Computer Applications},
	year = {2016},
	volume = {134},
	number = {3},
	pages = {8-11},
	month = {January},
	note = {Published by Foundation of Computer Science (FCS), NY, USA}


In today’s world internet is the answer to every question. So at any time one can easily copy the content from web and use it. This is known as plagiarism .It is growing now days. Usually in plagiarism people reword the documents, copy them, do not give references. It is difficult to detect plagiarism as people rephrase the text do not copy it directly. To detect plagiarism apache lucene have been used. Firstly indexing of the original document is done and then used cosine similarity to compare the plagiarised document with set of documents which are there saved previously.


  1. S.A.Hiremath and M.S.Otari ,”Plagiarism Detection-Different Methods and Their Analysis: Review”, International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Volume 1 Issue 7, August 2014
  2. Ahmad Gull Liaqat & Aijaz Ahmad ,”Plagiarism Detection in Java Code “,Linnaeus University, June 2011
  3. Asim M. El Tahir Ali, Hussam M. Dahwa Abdulla, and V´aclav Sn´aˇsel ,”Overview and Comparison of Plagiarism Detection Tools” 161{172, ISBN 978-80-248-2391-1., 2011
  4. Daniele Anzelmi, Domenico Carlone, Fabio Rizzello, Robert Thomsen, D. M. Akbar Hussain,”Plagiarism Detection Based on SCAM Algorithm”, Proceedings of the International MultiConference of Engineers and Computer Scientists, March 2011
  5. Bela Gipp Norman Meuschke ,”Citation Pattern Matching Algorithms for Citation-based Plagiarism Detection: Greedy Citation Tiling, Citation Chunking and Longest Common Citation Sequence”, Mountain View, CA, USA, September 2011
  7. Romans Lukashenko, Vita Graudina, Janis Grundspenkis, "Computer-Based Plagiarism Detection Methods and Tools: An Overview”, International Conference on Computer Systems and Technologies - CompSysTech’07, 2007
  8. Reena Kharat, Preeti M. Chavan, Vaibhav Jadhav, Kuldeep Rakibe,”Semantically Detecting Plagiarism for Research Papers”, International Journal of Engineering Research and Applications (IJERA), May-Jun 2013


Apache Lucene, Indexing, Cosine similarity, Plagiarism