Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Content based Caption Generation for Images Embedded in News Articles

International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 100 - Number 11
Year of Publication: 2014
Amit Kumar Kohakade
Emmanuel M

Amit Kumar Kohakade and Emmanuel M. Article: Content based Caption Generation for Images Embedded in News Articles. International Journal of Computer Applications 100(11):7-15, August 2014. Full text available. BibTeX

	author = {Amit Kumar Kohakade and Emmanuel M},
	title = {Article: Content based Caption Generation for Images Embedded in News Articles},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {100},
	number = {11},
	pages = {7-15},
	month = {August},
	note = {Full text available}


In current digital world Content based Image retrieval is becoming critical problem as size of data on Internet increasing rapidly. When the image is embedded in news article it is retrieved by manipulating words annotated to that image, text placed surrounding to that image etc. Many times this annotation, caption generation is done manually. It reduces accuracy, increases time span and makes it as tough task. We proposed a new approach for generating caption for such images. Approach presented here focuses on important terms occurring in news like named entities, using term weighting find out weighted terms which helps in describing news. On other hand by image processing we find out who's in picture as it helps in making accurate caption by using face recognition and it will increase image retrieval. Some of experiments presented here shows performance of face recognition algorithms on standard datasets and also on own developed face dataset, also we train NER model on Indian names which gives better results. As it covers text and image content it helps in generating better caption and also for improving image retrieval accuracy.


  • Aleix M. MartõÂnez, Avinash C. Kak. 2001. "PCA versus LDA". IEEE Transactions On Pattern Analysis And Machine Intelligence. Vol. 23. no. 2. pp. 228-233.
  • Allan Hanbury. 2008. "A survey of methods for image annotation". Journal of Visual Languages and Computing Elsevier. Vol. 19, Issue 5. pp. 617–627.
  • Benjamin Z. Yao, Xiong Yang, Liang Lin, Mun Wai Lee, and Song-Chun Zhu. 2010. "I2T: Image Parsing to Text Description". Proceedings of the IEEE. Vol. 98. no. 8.
  • Bhattacharyya, Suman Kumar, and Kumar Rahul. 2013. "Face Recognition By Linear Discriminant Analysis". International Journal of Communication Network Security 2. Vol. 2. Issue 2. pp. 2231-1882.
  • Claire Gardent , Benjamin Got T E Sman, Laura Perez-Beltrachini. 2011. "Using Regular Tree Grammars to enhance Sentence Realisation". Published in "Natural Language Engineering 17. pp. 185-201.
  • Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C Berg, Tamara L Berg. 2011. "Baby Talk: Understanding and Generating Image Descriptions". IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1601 – 1608.
  • Hemant Singh Mittal, Harpreet Kaur. 2013. "Face Recognition Using PCA & Neural Network". International Journal of Emerging Science and Engineering (IJESE). Vol. 1. Issue 6. pp. 71-75.
  • http://nlp. stanford. edu/software/CRF-ER. shtml, 15/05/2014
  • http://vis-www. cs. umass. edu/~vidit/AI/dbase. html, 27/04/2014
  • http://www. cl. cam. ac. uk/research/dtg/attarchive/facedatabase. html, 25/04/2014
  • https://gate. ac. uk/, 10/052014
  • https://opennlp. apache. org/, 20/05/2014
  • Ilia Smirnov. (2008). "Overview of Stemming Algorithms". Mechanical Translation.
  • J. Jeon, V. Lavrenko and R. Manmatha. 2003. "Automatic Image Annotation and Retrieval using CrossMedia Relevance Models". Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval.
  • Jia-Yu Pan, Hyung-Jeong Yang, Pinar Duygulu and Christos Faloutsos. 2004. "Automatic Image Captioning". IEEE International Conference on Multimedia and Expo, ICME '0. , Vol. 3. pp. 1987-1990.
  • Kim, Kyungnam. 1996. "Face recognition using principle component analysis". In International Conference on Computer Vision and Pattern Recognition. pp. 586-591.
  • L. Ferres, A. Parush, S. Roberts, and G. Lindgaard. 2006. "Helping People with Visual Impairments Gain Access to Graphical Information through Natural Language: The igraph System". Proc. 11th Int'l Conference On Computers Helping People with Special Needs. pp. 1122-1130.
  • M. -H. Yan, D. Kriegman, and N. Ahuja. 2002. "Detecting faces in images: A survey". IEEE Transaction on Pattern Analysis and Machine Intelligence 24. no. 1. pp. 34-58.
  • Man Lan, Chew Lim Tan, Jian Su. 2009. "Supervised and Traditional Term Weighting Methods for Automatic Text Categorization". IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 31. Issue 4. pp. 21-735.
  • Michele Banko , Vibhu O. Mittal , Michael J. Witbrock. 2000. "Headline Generation Based on Statistical Translation". Proceedings of the 38th Annual Meeting on Association for Computational Linguistics. pp. 318-325.
  • Mr. Saurabh M Khatri, Emmanuel M. , Dr. Ramesh Babu D. R. . 2013. "A Novel scheme for Term Weighting in Text Categorization: Positive Impact factor". IEEE International Conference on Systems, Man, and Cybernetics (SMC). pp. 2292-2297.
  • Phillip Ian Wilson, Dr. John Fernandez. 2006. "Facial Feature Detection Using Haar Classifiers". Journal of Computing Sciences in Colleges archive, Vol. 21. Issue 4. pp 127-133.
  • Roi Blanco, Christina Lioma. 2012. "Graph-based term weighting for information retrieval". Journal Information Retrieval. Vol. 15. Issue 1. pp. 54-92.
  • Ruichao Wang, John Dunnion, Joe Carthy. 2005. "Machine Learning Approach To Augmenting News Headline Generation". In Proceedings of the International Joint Conference on Natural Language Processing.
  • Shengcai Liao, Anil K. Jain, Fello and Stan Z. Li. 2013. "Partial Face Recognition: Alignment-Free Approach". IEEE Transaction on Pattern Analysis and Machine Intelligence. Vol. 35. Issue 5. pp. 1193-1205.
  • V. Lavrenko, R. Manmatha, and J. Jeon. 2003. "A Model for Learning the Semantics of Pictures". Proc. 16th Conf. On Advances in Neural Information Processing Systems.
  • Yansong Feng, Member and Mirella Lapata. 2013. "Automatic Caption Generation for News Images". IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 35. Issue 4. pp. 797-812
  • Zenghai Chen, Hong Fu, Zheru Chi and David Dagan Feng. 2012. "An Adaptive Recognition Model for Image Annotation", IEEE Transactions on Systems, Man, and Cybernetic Part C: Applications and Reviews. Vol. 42. Issue 6. pp. 1120-1127.