CFP last date
20 March 2024
Reseach Article

Plagiarism Detection of C Program using Assembly Language

by Shashank Chauhan, Anuja Arora, Yash Singhal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 158 - Number 3
Year of Publication: 2017
Authors: Shashank Chauhan, Anuja Arora, Yash Singhal
10.5120/ijca2017912767

Shashank Chauhan, Anuja Arora, Yash Singhal . Plagiarism Detection of C Program using Assembly Language. International Journal of Computer Applications. 158, 3 ( Jan 2017), 17-22. DOI=10.5120/ijca2017912767

@article{ 10.5120/ijca2017912767,
author = { Shashank Chauhan, Anuja Arora, Yash Singhal },
title = { Plagiarism Detection of C Program using Assembly Language },
journal = { International Journal of Computer Applications },
issue_date = { Jan 2017 },
volume = { 158 },
number = { 3 },
month = { Jan },
year = { 2017 },
issn = { 0975-8887 },
pages = { 17-22 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume158/number3/26888-2017912767/ },
doi = { 10.5120/ijca2017912767 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:04:29.051157+05:30
%A Shashank Chauhan
%A Anuja Arora
%A Yash Singhal
%T Plagiarism Detection of C Program using Assembly Language
%J International Journal of Computer Applications
%@ 0975-8887
%V 158
%N 3
%P 17-22
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Source code plagiarism is becoming a common practice among higher education community. People duplicate and modify the source code of other people and show the program as their own program. In this paper, we want to draw researchers’ attention towards this problem and projected a novel approach which detects plagiarism in C language code by converting it into assembly language which is done with the help of GCC compiler. Assembly language converted by the compiler is not sensitive to all type of different code transformation, for example-swapping variable names, reformation of language, adding extra comment or blanks. Therefore, assembly language gives rise to reduced amount of variations, if there is a modification in the original code. Previous works in plagiarism compares the whole program but in this paper, we proposed a method which split the C program into assembly language code and divide each function of program into blocks and blocks are transformed into token strings. This method compares each function with other program function and provides a statistical output, according to the token string likeness of that function. If the output is above assigned specific plagiarism similarity threshold value then it counts under the case of plagiarism.

References
  1. http://www.historians.org/about-aha-andmembership/ governance/policies-and-documents/statement onplagiarism.
  2. J.A.W Faidhi and S. K. Robinson, ”An empirical approach fordetecting program similarity and plagiarism within a university programming environment,” Comput. Educ.. vol. 11. pp. 11-19, 1987.
  3. Ottenstein, K.J.: An Algorithmic Approach to the Detection and Prevention of Plagiarism.CSD-TR200 103(2), 32–39 (1976).
  4. Schleimer, S., Wilkerson, D., Aiken, A.:Winnowing:LocalAlgorithmsforDocumentFingerprinting. In: ACM SIGMOD 2003, pp. 204–212. ACM Press, SanDiego (2003)
  5. Wise, M.J.: YAP3: improved detection of similarities in computer program and other texts.In: Proceedings of the Twenty-Seventh SIGCSE Technical Symposium on ComputerScience Education, vol. 28(1), pp. 130–134. Association for Computing Machinery, NewYork (1996)
  6. G. Whale, “Plague : plagiarism detection using program structure,” Dept. of Computer Science Technical Report 8805, University of NSW,Kensington, Australia, 2008
  7. H. L. Berghel and D. L. Sallach. Measurements of program similarity in identical task environments. ACM SIGPLAN Notices, 19(8):65–76, August 1984.
  8. John L. Donaldson, Ann-Marie Lancaster, and Paul H. Sposato. A plagiarism detection system. ACM SIGSCE Bulletin (Proc. of 12th SIGSCE Technical Symp.), 13(1):21–25, February 1981
  9. M. J. Wise, “Detection of Similarities in Student Programs: YAP'ing may be Preferable to Plague'ing,” ACM SIGSCE Bulletin (proc. Of 23rd SIGCSE Technical Symp.), 2002.
  10. P. Lutz, M. Guido, and M. Phlippsen, “JPlag: Finding plagiarisms among a set of programs,”Fakultätfür Informatik Technical Report 2000-1, Universität Kalrsruhe, Karlsruhe, Germany, 2000. International Journal of Computer Theory and Engineering Vol. 4, No. 2, April 2012
  11. Sam Grier. A tool that detects plagiarism in Pascal programs. ACM SIGSCE Bulletin (Proc. of 12th SIGSCE Technical Symp.), 13(1):15– 20, February 1981.
  12. DickGrune website regarding to similarity measure URL:http://www.dickgrune.com/Programs/similarity_tester/
  13. Jplag tool site URL: http://jplag.ipd.kit.edu
  14. Divya Luke, Divya P.S, Sony L Johnson, Sreeprabha S, Elizabeth.B.Varghese, 2014, “Software Plagiarism Detection Techniques: A Comparative Study”, International Journal of Computer Science and Information Technologies, Vol. 5 (4), ISSN: 0975-9646
  15. Enrique Flores, Alberto Barr´on-Cede˜no, Paolo Rosso, Lidia Moreno , Jun 2012, “DeSoCoRe: Detecting Source Code Re-Use across Programming Languages” , NAACL-HLT 2012
  16. Shan S.,Guo F.,Ren J.:similarity detection method based on assmebly language and string matching
  17. Gupta A., Singh S.: lexical analysis for the measurement of conceptual duplicity between c programs , in proceedings of vol. 1 issue, AUGUST 2013.
  18. Moussiades L., and Vakali A.,:a clustering approach for detecting plagiarism in source code datasets,in proceeding of the computer journal(november 2005) 48(b):6551-661:10.1092/comjnl/bxh119 first published online/:june 24,2005.
  19. Haritha, N., Bhavani, M., & Thammi Reddy, K. (2011). C Code Plagiarism Detection System. International Journal of Science and Advanced Technology, 1(5), 198-203.
Index Terms

Computer Science
Information Sciences

Keywords

Plagiarism assmebly language string similarity Plagiarism detection method token string