![]() |
10.5120/ijca2017911900 |
Manika Gupta and Vineet Khanna. Syntatic Feature based Classification Algorithm to Detect Validity of Text. International Journal of Computer Applications 163(1):1-4, April 2017. BibTeX
@article{10.5120/ijca2017911900, author = {Manika Gupta and Vineet Khanna}, title = {Syntatic Feature based Classification Algorithm to Detect Validity of Text}, journal = {International Journal of Computer Applications}, issue_date = {April 2017}, volume = {163}, number = {1}, month = {Apr}, year = {2017}, issn = {0975-8887}, pages = {1-4}, numpages = {4}, url = {http://www.ijcaonline.org/archives/volume163/number1/27356-2017911900}, doi = {10.5120/ijca2017911900}, publisher = {Foundation of Computer Science (FCS), NY, USA}, address = {New York, USA} }
Abstract
The complexity of a natural language itself is very challenging as the natural language is not free from ambiguity problem. It is almost impossible to identify that the given text is having sense or not. In today's scenario it becomes even much important to detect that input is given by human or a machine. A valid input with sense is needed everywhere from Social media platforms to Business Intelligence. This Classification algorithm aims to detect whether the given input text is valid, or randomly typed in a keyboard. It returns a percentage value where a lower one means valid text, and a higher value means random text. The approach is based on identifying that the amount of unique chars, amount of vowels of letters, the word/char ratio (in %) are in a usual range. Then it further calculates "deviation score" to compute the accuracy of given input.
References
- Ms. Ranju Marwaha, Data Mining Techniques and Applications in Telecommunication Industry,International Journal of advanced research in computer science and software engineering, Volume 4, Issue 9,September 2014
- Jijy George ,Sandhya .N., Suja George,” Classification Problem In Text Mining” International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Volume 1 Issue 8 (September 2014)
- Ah-Hwee Tan,”Text Mining:The state of the art and the challenges”
- Monica Bali, Deipali Gore,” A Survey on Text Classification with Different Types of Classification Methods, International Journal of Innovative Research in Computer and Communication Engineering Vol. 3, Issue 5, May 2015.
- Bhumika, Prof Sukhjit Singh Sehra, Prof Anand Nayyar, A Review Paper On Algorithms Used For Text Classification, Internatioal Journal of Application or Innovation in Engineering & Management (IJAIEM), Volume 2, Issue 3, March 2013
- Pratik Agrawal, Prof. A.J.Agrawal , Implementation of Semantic Analysis Using Domain Ontology, IOSR Journal of Computer Engineering (IOSR-JCE), 8727Volume 11, Issue 3 (May. - Jun. 2013)
- Mita K. Dalal, Mukesh A. Zaveri,” Automatic Text Classification: A Technical Review”, International Journal of Computer Applications (0975 – 8887)Volume 28– No.2, August 2011
- Vandana Korde, C Namrata Mahender,” Text Classification And Classifiers:A Survey, International Journal of Artificial Intelligence & Applications (IJAIA), Vol.3, No.2, March 2012
- Kush Jain, Priya Khatri and Garima Indolia,” Chunked N-Grams for Sentence Validation” 2015 International Conference on Computational Science
- Lakshay Arya,” Sentence Validation by Statistical Language Modeling and Semantic Relations, International Journal of Computer Applications Technology and Research,Volume 3– Issue 12, 812 - 814, 2014
- D.Y. Sakhare, Dr. Raj Kumar,” Syntactic and Sentence Feature Based Hybrid Approach for Text Summarization, I.J. Information Technology and Computer Science, 2014, 03, 38-46 Published Online February 2014 in MECS
- Ian Tenney.” A general-purpose sentence-level nonsense etector”, December 2014
Keywords
Data mining; text mining; text classification; sentence validation ; pattern learning