Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Exploring the Field of Text Mining

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2017
Radha Guha

Radha Guha. Exploring the Field of Text Mining. International Journal of Computer Applications 177(4):11-17, November 2017. BibTeX

	author = {Radha Guha},
	title = {Exploring the Field of Text Mining},
	journal = {International Journal of Computer Applications},
	issue_date = {November 2017},
	volume = {177},
	number = {4},
	month = {Nov},
	year = {2017},
	issn = {0975-8887},
	pages = {11-17},
	numpages = {7},
	url = {},
	doi = {10.5120/ijca2017915682},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


Text mining is the technique of automatically deducing non-obvious but statistically supported novel information from various text data sources written in natural languages. In the big data and cloud computing era of today huge amount of text data are getting generated online. Thus text mining is becoming very essential for business intelligence extraction as volume of internet data generation is growing exponentially. Next generation computing is going to see text mining amongst other disruptive technologies like semantic web, mobile computing, big data generation, and cloud computing phenomena. Text mining needs proven techniques to be developed for it to be most effective. Even though structured data mining field is very active and mature, unstructured text mining field has just emerged. Challenges of text mining field are different from that of structured data analytics field. In this paper, I survey text mining techniques and various interesting and important applications of text mining that can increase business revenue. I give several examples of text mining to show how they can be beneficial for extracting business intelligence. Using text mining and machine learning techniques new challenges for business intelligence extraction from text data can be solved effectively.


  1. G. Miner et al. 2012. Practical Text Mining and Statistical Analysis for Non-Structured Text Data Applications. Elsevier.
  2. Anil Maheswari. 2017. Data Analytics. McGraw Hill Education (India) Private Limited.
  3. Chakrabarti Soumen. 2002. Mining the Web: Analysis of Hypertext and Semi-Structured Data. Morgan Kaufmann, San Francisco.
  4. Manning et al. 1999. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA.
  5. Nisbet et al. 2009. Handbook of Statistical Analysis and Data Mining Applications. Elsevier, Burlington, MA.
  6. S. Valenti et al. 2003. An Overview of Current Research on Automated Essay Grading. Journal of Information Technology Information, Vol. 2.
  7. Escudeiro et al. 2011. Semi-Automatic Grading of Students’ Answers Written in Free Text. The Electronic Journal of e-Learning Vol. 9.
  8. Michael W. Berry et al. 2007. Survey of Text Mining: Clustering, Classification and Retrieval. Springer, Second Edition.
  9. Croft Bruce et al. 2009. Search Engines: Information Retrieval in Practice. Addison-Wesley, Boston, MA.
  10. Manning et al. 2008. Introduction to Information Retrieval. Cambridge University Press, New York.
  11. Radha Guha. 2013. Impact of Semantic Web and Cloud Computing Platform on Software Engineering. Software Engineering Frameworks for the Cloud Computing Paradigm, Computer Communications and Networks, Springer-Verlag-London.
  12. Vignesh Prajapati. 2013. Big Data Analytics with R and Hadoop. PACKT Publishing.
  13. Sergio-Orenga Rogla. 2016. Social Customer Relationship Management: Taking Advantage of Web2.0 and Big Data Technologies. Springer Plus. DOI 10.1186/s40064-016-3128-y.
  14. E.W.T Ngai et al.. 2009. Application of Data Mining Techniques in Customer Relationship Management: A Literature Review and Classification. A transaction on Elsevier Journal on Expert System and its Applications, 2592-2602.
  15. Nan Li et al. 2012. Using Text Mining and Sentiment Analysis for Online Forums Hotspot Detection and Forecast. Elsevier.
  16. G. Vinidhini et al. 2012. Sentiment Analysis and Opinion Mining: A Survey. International Journal of Advanced Research on Computer Science and Software Engineering, Vol. 2.
  17. Wilfried N. Gansterer et al. 2007. Spam Filtering Based on Latent Semantic Indexing. Springer.
  18. Greg Handersion et al. 2007. SAS, an Enterprise Approach to Fraud Detection and Prevention in Government Programs. SAS.
  19. ACFE 2016: A report to the Nation on Occupational Fraud and Abuses. ACFE.
  20. Clifton Phua et al. 2004. A Comprehensive Survey of Data Mining-based Fraud Detection Research.
  21. NaCTeM. 2016. Providing Text Mining Services to UK,


Text mining, Business intelligence (BI), Unstructured data, Data analytics, Automatic text summary.