CFP last date
22 April 2024
Reseach Article

The Search of Non-Standard Words in the Documents Written in Indonesian Language with Nazief and Adriani Algorithm

by Dewi Soyusiawaty, Oko Carono
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 175 - Number 39
Year of Publication: 2020
Authors: Dewi Soyusiawaty, Oko Carono
10.5120/ijca2020920952

Dewi Soyusiawaty, Oko Carono . The Search of Non-Standard Words in the Documents Written in Indonesian Language with Nazief and Adriani Algorithm. International Journal of Computer Applications. 175, 39 ( Dec 2020), 26-32. DOI=10.5120/ijca2020920952

@article{ 10.5120/ijca2020920952,
author = { Dewi Soyusiawaty, Oko Carono },
title = { The Search of Non-Standard Words in the Documents Written in Indonesian Language with Nazief and Adriani Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2020 },
volume = { 175 },
number = { 39 },
month = { Dec },
year = { 2020 },
issn = { 0975-8887 },
pages = { 26-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume175/number39/31709-2020920952/ },
doi = { 10.5120/ijca2020920952 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:40:51.970425+05:30
%A Dewi Soyusiawaty
%A Oko Carono
%T The Search of Non-Standard Words in the Documents Written in Indonesian Language with Nazief and Adriani Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 175
%N 39
%P 26-32
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Indonesian language has a variety of affixed words used in a document. The words or sentences in a document must be written based on the Great Dictionary of Indonesian Language (KBBI). Errors often occur when writing a word in the document such as errors in writing the standard words. To find out the standard and non-standard forms of an affixed word needs the root. One of methods to find the root of an affixed word is by using Nazief & Adriani Stemming Algorithm. Searching the root words in a document by checking them one by one will take a long time and is not efficient. Therefore, an application that can search the root words is required to make the quick and more efficient search. This research is an implementation of the search of the root and standard words in the documents written in Indonesian language to ease in determining the standard and non-standard words. The method used is by checking the words in the documents then implementing Nazief & Adriani algorithm to find out the root words then checking in KBBI to determine the non-standard words and implementing spell checker method to recommend the standard ones. The testing used in this research is the accuracy testing by using 50 documents written in Indonesian language with 28,023 numbers of words and the result of the accuracy testing is 96.74%.

References
  1. Asian, J., Williams, H. E., & Tahaghoghi, S. M. M. (2007). Stemming Indonesian A confix-stripping Approach. Conferences in Research and Practice in Information Technology Series, 38(January), 307–314. https://doi.org/10.1145/1316457.1316459
  2. Bhaire, V. V., Jadhav, A. A., Pashte, P. A., & P.G, M. M. (2015). Spell check. Nursing Times, 5(4), 38–40. https://doi.org/10.12968/sece.2007.5.260
  3. Chaer, Abdul. (2011). KesantunanBerbahasa. Jakarta: RinekaCipta.
  4. Dini Nopiyanti, K. A. S. (2014). Aplikasi pencarian kata dasar dokumen berbahasa indonesia dengan metode stemming porter menggunakan php & mysql. Kommit, 8(Kommit), 215–222.
  5. Herwijayanti, B., Ratnawati, D. E., & Muflikhah, L. (2018). Klasifikasi berita online denganmenggunakan pembobotan TF-IDF dan cosine similarity. Jurnal PengembanganTeknologi Informasi Dan Ilmu Komputer, 2(1), 306–312.
  6. Joseph, S. R., Hloman, H., Letsholo, K., & Sedimo, K. (2016). Natural Language Processing: A Review. International Journal of Research in Engineering and Applied Sciences, 6(3), 1–8.
  7. Juang, D. (2016). Analisis spam dengan menggunakan naïve bayes. Jurnal Teknovasi, 03(1998), 51–57.
  8. Kosasih, E. dan Hermawan, Wawan. (2012). Bahasa Indonesia Berbasis Kepenulisan Karya Ilmiah dan Jurnal. Bandung: Thursina.
  9. Oeyliawan, R. F., & Gunawan, D. (2017). Aplikasi rekomendasi buku pada katalogperpustakaan Universitas Multimedia Nusantara menggunakan vector spacemodel. ULTIMATICS, Vol. IX, 97–105.
  10. Riyanto, (2014). VALIDASI & VERIFIKASI METODE UJI. Yogyakarta. Deeppublish.
  11. Sulhan, M., & Kurniawan, R. (2014). Metode Stemming Sebagai Preprocessing Pada Filter Kata Porno Melalui Aspek Pendidikan. Seminar Nasional Teknologi Informasi Dan Komunikasi, 2014(Sentika), 52–60.
  12. Soyusiawaty, Dewi, Anna Hendri Soleliza Jones, and Nora Lestari Lestariw. 2020. “The Stemming Application on Affixed Javanese Words by Using Nazief and Adriani Algorithm.” IOP Conference Series: Materials Science and Engineering 771(1).
  13. Wibowo, J. (2016). Aplikasi Penentuan Kata Dasar Berimbuhan Pada Kalimat Bahasa Indonesia Dengan Algoritma Stemming. Jurnal Riset Komputer (JURIKOM), 3(5), 346–350.
Index Terms

Computer Science
Information Sciences

Keywords

Indonesian language Nazief & Adriani Algorithm Non-Standard Words Stemming