International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 100 - Number 11 |
Year of Publication: 2014 |
Authors: Amit Kumar Kohakade, Emmanuel M |
10.5120/17567-8231 |
Amit Kumar Kohakade, Emmanuel M . Content based Caption Generation for Images Embedded in News Articles. International Journal of Computer Applications. 100, 11 ( August 2014), 7-15. DOI=10.5120/17567-8231
In current digital world Content based Image retrieval is becoming critical problem as size of data on Internet increasing rapidly. When the image is embedded in news article it is retrieved by manipulating words annotated to that image, text placed surrounding to that image etc. Many times this annotation, caption generation is done manually. It reduces accuracy, increases time span and makes it as tough task. We proposed a new approach for generating caption for such images. Approach presented here focuses on important terms occurring in news like named entities, using term weighting find out weighted terms which helps in describing news. On other hand by image processing we find out who's in picture as it helps in making accurate caption by using face recognition and it will increase image retrieval. Some of experiments presented here shows performance of face recognition algorithms on standard datasets and also on own developed face dataset, also we train NER model on Indian names which gives better results. As it covers text and image content it helps in generating better caption and also for improving image retrieval accuracy.