CFP last date
22 April 2024
Reseach Article

Style based Authorship Attribution on English Editorial Documents

by N. V. Ganapathi Raju, Ch. Sadhvi, P. Tejaswini, Y. Mounica
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 159 - Number 4
Year of Publication: 2017
Authors: N. V. Ganapathi Raju, Ch. Sadhvi, P. Tejaswini, Y. Mounica
10.5120/ijca2017912899

N. V. Ganapathi Raju, Ch. Sadhvi, P. Tejaswini, Y. Mounica . Style based Authorship Attribution on English Editorial Documents. International Journal of Computer Applications. 159, 4 ( Feb 2017), 5-8. DOI=10.5120/ijca2017912899

@article{ 10.5120/ijca2017912899,
author = { N. V. Ganapathi Raju, Ch. Sadhvi, P. Tejaswini, Y. Mounica },
title = { Style based Authorship Attribution on English Editorial Documents },
journal = { International Journal of Computer Applications },
issue_date = { Feb 2017 },
volume = { 159 },
number = { 4 },
month = { Feb },
year = { 2017 },
issn = { 0975-8887 },
pages = { 5-8 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume159/number4/26987-2017912899/ },
doi = { 10.5120/ijca2017912899 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:04:49.537344+05:30
%A N. V. Ganapathi Raju
%A Ch. Sadhvi
%A P. Tejaswini
%A Y. Mounica
%T Style based Authorship Attribution on English Editorial Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 159
%N 4
%P 5-8
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The aim of the authorship attribution is identification of the author/s of unknown document(s). Every author has a unique style of writing pattern. The present paper identifies the unique style of an author(s) using lexical stylometric features. The lexical feature vectors of various authors are used in the supervised machine learning algorithms for predicting the unknown document. The highest average accuracy achieved is 97.22 using SVM algorithm.

References
  1. Zheng, Rong, et al. "A framework for authorship identification of online messages:
  2. Writing‐style features and classification techniques." Journal of the American Society for Information Science and Technology 57.3 (2006): 378-393.
  3. Cheng, Na, Rajarathnam Chandramouli, and K. P. Subbalakshmi. "Author gender identification from text." Digital Investigation 8.1 (2011): 78-88
  4. Juola, Patrick. "Authorship attribution." Foundations and Trends in information Retrieval 1.3 (2006): 233-334.
  5. Stamatatos, Efstathios, Nikos Fakotakis, and George Kokkinakis. "Automatic authorship attribution." Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 1999.
  6. Gamon, Michael. "Linguistic correlates of style: authorship classification with deep linguistic analysis features." Proceedings of the 20th international Conference on Computational Linguistics. Association for Computational Linguistics, 2004.
  7. Stamatatos, Efstathios, Nikos Fakotakis, and Georgios Kokkinakis. "Computer-based authorship attribution without lexical measures." Computers and the Humanities 35.2 (2001): 193-214.
  8. Khachatryan, R. Kh. "Desideratum of New reality: forensic authorship attribution." Вестник Московского государственного лингвистического университета 24 (2012).
  9. McMenamin, Gerald R. "Style markers in authorship studies." International Journal of Speech Language and the Law 8.2 (2007): 93-97.
  10. Grieve, Jack. "Quantitative authorship attribution: An evaluation of techniques." Literary and linguistic computing 22.3 (2007): 251-270.
  11. Chen, Zhili, et al. "More than Word Frequencies: Authorship Attribution via Natural Frequency Zoned Word Distribution Analysis." arXiv preprint arXiv:1208.3001 (2012).
  12. Dinu, Liviu P., and Marius Popescu. "Ordinal measures in authorship identification." Proc. SEPLN. 2009.
  13. Zhao, Ying, and Justin Zobel. "Searching with style: authorship attribution in classic literature." Proceedings of the thirtieth Australasian conference on Computer science-Volume 62. Australian Computer Society, Inc., 2007.
  14. Eder, Maciej. "Style-markers in authorship attribution a cross-language study of the authorial fingerprint." Studies in Polish Linguistics 6.1 (2011): 99-114.
  15. Van Halteren, Hans, et al. "New machine learning methods demonstrate the existence of a human stylome." Journal of Quantitative Linguistics 12.1 (2005): 65-77.
Index Terms

Computer Science
Information Sciences

Keywords

Style based Classification Lexical features Function words/Stop words Authorship Attribution/Profiling