Content based Caption Generation for Images Embedded in News Articles

Amit Kumar Kohakade; Emmanuel M

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Concept based Ranking of Results using an Ontology and Fuzzy Network for a Personalized Web Search Engine

November

2013

Comparing Soft Computing Techniques for Estimating Demand of Season Ticket Holders

Oct

2018

Distance Field based Haptic Rendering of Scattered Oriented Points

January

2013

Application Development Feasibility: DevOps or SRE?

Aug

2023

Reseach Article

Content based Caption Generation for Images Embedded in News Articles

by Amit Kumar Kohakade, Emmanuel M

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 100 - Number 11

Year of Publication: 2014

Authors: Amit Kumar Kohakade, Emmanuel M

10.5120/17567-8231

Amit Kumar Kohakade, Emmanuel M . Content based Caption Generation for Images Embedded in News Articles. International Journal of Computer Applications. 100, 11 ( August 2014), 7-15. DOI=10.5120/17567-8231

@article{ 10.5120/17567-8231,

author = { Amit Kumar Kohakade, Emmanuel M },

title = { Content based Caption Generation for Images Embedded in News Articles },

journal = { International Journal of Computer Applications },

issue_date = { August 2014 },

volume = { 100 },

number = { 11 },

month = { August },

year = { 2014 },

issn = { 0975-8887 },

pages = { 7-15 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume100/number11/17567-8231/ },

doi = { 10.5120/17567-8231 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:29:40.759890+05:30

%A Amit Kumar Kohakade

%A Emmanuel M

%T Content based Caption Generation for Images Embedded in News Articles

%J International Journal of Computer Applications

%@ 0975-8887

%V 100

%N 11

%P 7-15

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In current digital world Content based Image retrieval is becoming critical problem as size of data on Internet increasing rapidly. When the image is embedded in news article it is retrieved by manipulating words annotated to that image, text placed surrounding to that image etc. Many times this annotation, caption generation is done manually. It reduces accuracy, increases time span and makes it as tough task. We proposed a new approach for generating caption for such images. Approach presented here focuses on important terms occurring in news like named entities, using term weighting find out weighted terms which helps in describing news. On other hand by image processing we find out who's in picture as it helps in making accurate caption by using face recognition and it will increase image retrieval. Some of experiments presented here shows performance of face recognition algorithms on standard datasets and also on own developed face dataset, also we train NER model on Indian names which gives better results. As it covers text and image content it helps in generating better caption and also for improving image retrieval accuracy.

References

Aleix M. MartõÂnez, Avinash C. Kak. 2001. "PCA versus LDA". IEEE Transactions On Pattern Analysis And Machine Intelligence. Vol. 23. no. 2. pp. 228-233.
Allan Hanbury. 2008. "A survey of methods for image annotation". Journal of Visual Languages and Computing Elsevier. Vol. 19, Issue 5. pp. 617–627.
Benjamin Z. Yao, Xiong Yang, Liang Lin, Mun Wai Lee, and Song-Chun Zhu. 2010. "I2T: Image Parsing to Text Description". Proceedings of the IEEE. Vol. 98. no. 8.
Bhattacharyya, Suman Kumar, and Kumar Rahul. 2013. "Face Recognition By Linear Discriminant Analysis". International Journal of Communication Network Security 2. Vol. 2. Issue 2. pp. 2231-1882.
Claire Gardent , Benjamin Got T E Sman, Laura Perez-Beltrachini. 2011. "Using Regular Tree Grammars to enhance Sentence Realisation". Published in "Natural Language Engineering 17. pp. 185-201.
Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C Berg, Tamara L Berg. 2011. "Baby Talk: Understanding and Generating Image Descriptions". IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1601 – 1608.
Hemant Singh Mittal, Harpreet Kaur. 2013. "Face Recognition Using PCA & Neural Network". International Journal of Emerging Science and Engineering (IJESE). Vol. 1. Issue 6. pp. 71-75.
http://nlp. stanford. edu/software/CRF-ER. shtml, 15/05/2014
http://vis-www. cs. umass. edu/~vidit/AI/dbase. html, 27/04/2014
http://www. cl. cam. ac. uk/research/dtg/attarchive/facedatabase. html, 25/04/2014
https://gate. ac. uk/, 10/052014
https://opennlp. apache. org/, 20/05/2014
Ilia Smirnov. (2008). "Overview of Stemming Algorithms". Mechanical Translation.
J. Jeon, V. Lavrenko and R. Manmatha. 2003. "Automatic Image Annotation and Retrieval using CrossMedia Relevance Models". Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval.
Jia-Yu Pan, Hyung-Jeong Yang, Pinar Duygulu and Christos Faloutsos. 2004. "Automatic Image Captioning". IEEE International Conference on Multimedia and Expo, ICME '0. , Vol. 3. pp. 1987-1990.
Kim, Kyungnam. 1996. "Face recognition using principle component analysis". In International Conference on Computer Vision and Pattern Recognition. pp. 586-591.
L. Ferres, A. Parush, S. Roberts, and G. Lindgaard. 2006. "Helping People with Visual Impairments Gain Access to Graphical Information through Natural Language: The igraph System". Proc. 11th Int'l Conference On Computers Helping People with Special Needs. pp. 1122-1130.
M. -H. Yan, D. Kriegman, and N. Ahuja. 2002. "Detecting faces in images: A survey". IEEE Transaction on Pattern Analysis and Machine Intelligence 24. no. 1. pp. 34-58.
Man Lan, Chew Lim Tan, Jian Su. 2009. "Supervised and Traditional Term Weighting Methods for Automatic Text Categorization". IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 31. Issue 4. pp. 21-735.
Michele Banko , Vibhu O. Mittal , Michael J. Witbrock. 2000. "Headline Generation Based on Statistical Translation". Proceedings of the 38th Annual Meeting on Association for Computational Linguistics. pp. 318-325.
Mr. Saurabh M Khatri, Emmanuel M. , Dr. Ramesh Babu D. R. . 2013. "A Novel scheme for Term Weighting in Text Categorization: Positive Impact factor". IEEE International Conference on Systems, Man, and Cybernetics (SMC). pp. 2292-2297.
Phillip Ian Wilson, Dr. John Fernandez. 2006. "Facial Feature Detection Using Haar Classifiers". Journal of Computing Sciences in Colleges archive, Vol. 21. Issue 4. pp 127-133.
Roi Blanco, Christina Lioma. 2012. "Graph-based term weighting for information retrieval". Journal Information Retrieval. Vol. 15. Issue 1. pp. 54-92.
Ruichao Wang, John Dunnion, Joe Carthy. 2005. "Machine Learning Approach To Augmenting News Headline Generation". In Proceedings of the International Joint Conference on Natural Language Processing.
Shengcai Liao, Anil K. Jain, Fello and Stan Z. Li. 2013. "Partial Face Recognition: Alignment-Free Approach". IEEE Transaction on Pattern Analysis and Machine Intelligence. Vol. 35. Issue 5. pp. 1193-1205.
V. Lavrenko, R. Manmatha, and J. Jeon. 2003. "A Model for Learning the Semantics of Pictures". Proc. 16th Conf. On Advances in Neural Information Processing Systems.
Yansong Feng, Member and Mirella Lapata. 2013. "Automatic Caption Generation for News Images". IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 35. Issue 4. pp. 797-812
Zenghai Chen, Hong Fu, Zheru Chi and David Dagan Feng. 2012. "An Adaptive Recognition Model for Image Annotation", IEEE Transactions on Systems, Man, and Cybernetic Part C: Applications and Reviews. Vol. 42. Issue 6. pp. 1120-1127.

Index Terms

Computer Science

Information Sciences

Keywords

Caption generation Name entity recognition Text Processing Face Recognition.