CFP last date
20 May 2024
Reseach Article

Automated Plagiarism Detection System for Malayalam Text Documents

by Sindhu.l, Bindu Baby Thomas, Sumum Mary Idicula
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 106 - Number 15
Year of Publication: 2014
Authors: Sindhu.l, Bindu Baby Thomas, Sumum Mary Idicula
10.5120/18595-9843

Sindhu.l, Bindu Baby Thomas, Sumum Mary Idicula . Automated Plagiarism Detection System for Malayalam Text Documents. International Journal of Computer Applications. 106, 15 ( November 2014), 13-16. DOI=10.5120/18595-9843

@article{ 10.5120/18595-9843,
author = { Sindhu.l, Bindu Baby Thomas, Sumum Mary Idicula },
title = { Automated Plagiarism Detection System for Malayalam Text Documents },
journal = { International Journal of Computer Applications },
issue_date = { November 2014 },
volume = { 106 },
number = { 15 },
month = { November },
year = { 2014 },
issn = { 0975-8887 },
pages = { 13-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume106/number15/18595-9843/ },
doi = { 10.5120/18595-9843 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:39:28.046125+05:30
%A Sindhu.l
%A Bindu Baby Thomas
%A Sumum Mary Idicula
%T Automated Plagiarism Detection System for Malayalam Text Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 106
%N 15
%P 13-16
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detection tool presented here has the mechanism of detecting similarity beyond exact words match of Malayalam documents. . The tool is based on a new comparison algorithm that uses some NLP techniques to compare suspect documents which may not be identified using existing methods for Malayalam document plagiarism detection.

References
  1. Paul Clough 2000 Plagiarism in natural and programming languages: an overview of current tools and technologies, Department of Computer Science, University of Sheffield, technical report
  2. Clough, P. and Stevenson, M. 2009. Developing A Corpus of Plagiarised Short Answers, Language Resources and Evaluation: Special Issue on Plagiarism and Authorship Analysis, In Press. Journal Language Resources and Evaluation.
  3. C. Manning and H. Schutze. 1999, Foundation of Statistical Natural Language Processing , The MIT Press, Massachusetts Institute of technology , Cambridge, USA, ISBN 0-262-13360-1.
  4. Lancaster, T. and Culwin, F. 2007. Preserving academic integrityfighting against nonoriginality agencies. British Journal of Educational Technology. 38, 1 , 153-157.
  5. Z. Ceska. 2008. Plagiarism detection based on Singular value decomposition: Advances in Natural Language Processing 5221, 108-119.
  6. Shivakumar, N. and Garcia-Molina, H. 1995. SCAM: A copy detection mechanism for digital documents. Proceedings of the Second Annual Conference on the Theory and Practice of Digital Libraries.
  7. Hoad, T. C. and Zobel, J. 2003. Methods for identifying versioned and plagiarized documents. Journal of the American Society for Information Science and Technology. 54, 3, 203–215.
  8. L Sindhu,. Bindu Baby Thomas, and Sumam Mary Idicula , 2013„A Copy detection Method for Malayalam Text Documents using n-grams Model?, National conference on Indian Language Computing ,Department of Computer Science, CUSAT.
  9. Sindhu. L, Thomas, B. B. , and Idicula, S. M. 2011. A Study of Plagiarism Detection Tools and Technologies. International Journal of Advanced Research in Technology, vol. 1. pp. 64-70.
Index Terms

Computer Science
Information Sciences

Keywords

Plagiarism Detection Malayalam Natural Language Processing Lemmatization