Plagiarism Checker: Text Mining

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2016
Anu Saini, Ankita Bahl, Supriya Kumari, Mitali Singh

In today’s world internet is the answer to every question. So at any time one can easily copy the content from web and use it. This is known as plagiarism .It is growing now days. Usually in plagiarism people reword the documents, copy them, do not give references. It is difficult to detect plagiarism as people rephrase the text do not copy it directly. To detect plagiarism apache lucene have been used. Firstly indexing of the original document is done and then used cosine similarity to compare the plagiarised document with set of documents which are there saved previously.


Apache Lucene, Indexing, Cosine similarity, Plagiarism