CFP last date
20 May 2024
Reseach Article

A Context Free Spell Correction Method using Supervised Machine Learning Algorithms

by Ahmed Yunus, Md Masum
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 176 - Number 27
Year of Publication: 2020
Authors: Ahmed Yunus, Md Masum
10.5120/ijca2020920288

Ahmed Yunus, Md Masum . A Context Free Spell Correction Method using Supervised Machine Learning Algorithms. International Journal of Computer Applications. 176, 27 ( Jun 2020), 36-41. DOI=10.5120/ijca2020920288

@article{ 10.5120/ijca2020920288,
author = { Ahmed Yunus, Md Masum },
title = { A Context Free Spell Correction Method using Supervised Machine Learning Algorithms },
journal = { International Journal of Computer Applications },
issue_date = { Jun 2020 },
volume = { 176 },
number = { 27 },
month = { Jun },
year = { 2020 },
issn = { 0975-8887 },
pages = { 36-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume176/number27/31370-2020920288/ },
doi = { 10.5120/ijca2020920288 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:43:37.110621+05:30
%A Ahmed Yunus
%A Md Masum
%T A Context Free Spell Correction Method using Supervised Machine Learning Algorithms
%J International Journal of Computer Applications
%@ 0975-8887
%V 176
%N 27
%P 36-41
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Spell correction is a modern day necessity for a system that lets a user extract the proper result while searching different things. Misspelled words are highly likely to occur while typing in queries to these systems and when users misspell query, the users may get inconclusive or false information returned by the system. Spell correction can be context-free or context-sensitive based on the usage. This paper traverses a spell correction method using supervised machine learning algorithms in which the wrong word does not rely on any context. Also this paper includes the comparison between different supervised machine learning algorithms for this case and additionally provides the best case and limitation of this spell correction method.

References
  1. K. Kukich, “Techniques for automatically correcting words in text,” ACM Computing Surveys, 24(4), 377–439, 1992.
  2. R. A. Wagner and M. J. Fisher, “The string to string correction problem,” Journal of Assoc. Comp. Mach., 21(1):168-173, 1974
  3. E. J. Yannakoudakis and D. Fawthrop, “An intelligent spelling error corrector,” Information Processing and Management, 19:1, 101-108,1983.
  4. Jin-ming Zhan, Xiaolong Mou, Shuqing Li, Ditang Fang, “A Language Model in a Large-Vocabulary Speech Recognition System,” in Proc. Of Int. Conf. ICSLP98, Sydney, Australia, 1998.
  5. K. Church and W. A. Gale, “Probability scoring for spelling correction,”Statistics and Computing, Vol. 1, No. 1, pp. 93–103, 1991.
  6. Golding, Andrew R.; Roth, Dan (1999). "Journal Article". Machine Learning. SpringerLink. 34: 107–130. doi:10.1023/A:1007545901558
  7. Revised N-Gram based Automatic Spelling Correction Tool to Improve Retrieval Effectiveness,December 2009, DOI: 10.17562/PB-40-6
  8. Personalized Spell Checking using Neural Networks by Tyler Garaas, Mei Xiao, and Marc Pomplun
  9. Arabic Spelling Correction using Supervised Learning, September 2014, DOI: 10.3115/v1/W14-3615
  10. https://medium.com/@BhashkarKunal/spelling-correction-using-deep-learning-how-bi-directional-lstm-with-attention-flow-works-in-366fabcc7a2f
  11. https://englishlive.ef.com/blog/language-lab/many-words-english-language/
  12. https://www.ef.com/wwen/english-resources/english-vocabulary/top-1000-words/
  13. https://ahrefs.com/blog/top-amazon-searches/
  14. https://machinelearningmastery.com/prepare-text-data-machine-learning-scikit-learn/
  15. https://www.geeksforgeeks.org/removing-stop-words-nltk-python/
Index Terms

Computer Science
Information Sciences

Keywords

Supervised Machine Learning Tf-idf Tokenization KNeighbour Classifier Multinomial Naive Bayes Decision Tree Classifier Random Forest Classifier Logistic Regression F1-score Accuracy Precision stop words QWERTY keyboard etc.