CFP last date
20 May 2024
Reseach Article

Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree

by D.poongodi, G.tholkkappia Arasu
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 90 - Number 15
Year of Publication: 2014
Authors: D.poongodi, G.tholkkappia Arasu
10.5120/15796-4494

D.poongodi, G.tholkkappia Arasu . Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree. International Journal of Computer Applications. 90, 15 ( March 2014), 19-24. DOI=10.5120/15796-4494

@article{ 10.5120/15796-4494,
author = { D.poongodi, G.tholkkappia Arasu },
title = { Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree },
journal = { International Journal of Computer Applications },
issue_date = { March 2014 },
volume = { 90 },
number = { 15 },
month = { March },
year = { 2014 },
issn = { 0975-8887 },
pages = { 19-24 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume90/number15/15796-4494/ },
doi = { 10.5120/15796-4494 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:11:07.227319+05:30
%A D.poongodi
%A G.tholkkappia Arasu
%T Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree
%J International Journal of Computer Applications
%@ 0975-8887
%V 90
%N 15
%P 19-24
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Plagiarism and clone detection plays an important role in software security protection, software maintenance and license issues. Source-code similarity detection method can be classified as string-based, token-based, parse-tree-based and program-dependency-based. All of these approaches have certain limitations and can not meet the requirements when the source code is large and may produce false positives. But, parse-tree based detection improves the detection ability and efficiency. This paper describes method and statement based source code similarity detection, which detects the simple plagiarized code like exact match, near exact match and longest common sequence using multi-agent based detection which will perform the detection automatically. Automatic plagiarism detection will be helpful for code clone detection in software industry and plagiarism detection in projects.

References
  1. Roy, ChanchalKumar;Cordy, James R. . "A Survey on Software Clone Detection Research". School of Computing , Queen's University, Canada. Vol 115, TR2007-541, September 26, 2007.
  2. Baxter,I. D, Yahin,A. ; Moura, L. ; Sant'Anna,M; Bier, L. "Clone detection using abstract syntax trees", International conference on software maintenance 1998, 598, ISBN:0-8186-8779-7.
  3. Michel Chilowicz, Etienne Duris and Gilles Roussel"Syntax tree fingerprinting: a foundation for source code similarity detection", 17th IEEE International Conference on Program Comprehension (ICPC'09). Vancouver, BC, Canada. May 2009. pp. 243–247. IEEE Computer Society.
  4. IulianNeamtiu;Jeffrey S. Foster;Michael Hicks "Understanding Source Code Evolution Using Abstract Syntax Tree Matching" MSR '05 , Volume 30 Issue 4, Pages 1-5, ISBN:1-59593-123-6, July 2005.
  5. Lingxiao Jiang Ghassan andSt´ephaneGlondu"DECKARD: Scalable and Accurate Tree-based Detection of Code Clones" 29th International Conference on Software Engineering 2007, 96-105, May 2007.
  6. Matt G. Ellis, Claude W. Anderson "Plagiarism Detection in Computer Code", http://citeseerx. ist. psu. edu/viewdoc/download?doi=10. 1. 1. 92. 8027&rep=rep1&type=pdf , March 23, 2005.
  7. William S. Evans Christopher W. Fraser Fei Ma "Clone Detection via Structural Abstraction", Journal of Software QualityControl, Vol 17, Issue 4, 309-330, Dec 2009.
  8. Rainer Koschke, RaimarFalke, Pierre Frenzel "Clone Detection Using Abstract Syntax Suffix Trees" 13th Working Conference on Reverse Engineering (WCRE 2006), 253-262, ISBN:0-7695-2719-1, October 2006.
  9. Kevin Greenan, "Method-Level Code Clone Detection on Transformed Abstract Syntax Trees Using Sequence Matching Algorithms"University of California - Santa Cruz , 2005
  10. Baojiang Cui, Jun Guan, Tao Guo, Lifang Han, Jianxin Wang and Yupeng J "Code Syntax-Comparison Algorithm based on Type-Redefinition-Preprocessing and Rehash Classification" , Journal of Multimedia, Vol 6, No 4 (2011), 320-328, Aug 2011
  11. Young-Chul Kim and Jaeyoung Choi "A Program Plagiarism Evaluation System", ICCSA 2005 http://link. springer. com/bookseries/558Volume 3483, 2005.
  12. A. S. Bin-Habtoor and M. A. Zaher, "A Survey on Plagiarism Detection Systems", International Journal of Computer Theory and Engineering, Vol 4. No. 2, April 2012.
  13. Christian Arwin and S. M. M. Tahaghoghi, "Plagiarism Detection across Programming Lanugages", ACSC'06, Vol. 48, 277-286 , 2006, ACM.
  14. Tahira Khatoon, Priyansha Singh and Shikha shukla, "Abstract Syntax Tree Based Clone Detection for Java Projects", IOSR '12, Vol. 2, 45-47, Issue 12, Dec 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Abstract syntax tree plagiarism detection source code plagiarism detection parse tree code clone.