Image Caption Generator Using CNN And LSTM

Posted on:2023-05-24

Degree:Master

Type:Thesis

Country:China

Candidate:E H u s s a i n M u z a m

Full Text:PDF

GTID:2568306914978989

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Image Caption has grown in importance in the digital age.Aside from that,there are built-in programs that produce and deliver the caption for a particular picture using deep neural network models.Picture captioning is the procedure of creating a description of an image.It requires a knowledge of the image’s key elements and feature extraction as well as the connections between them.It creates statements that are both grammatically and semantically accurate.In this research,we introduce a computer vision and machine translation-based deep learning model for describing pictures and creating descriptions for them.The goal of this study is to identify several things in a picture,recognize the links between those objects,and provide a caption for each of them.An ML approach called Transfer Learning will be employed with the aid of the Xception model to show the planned experiment using CNN and LSTM to identify the caption of the image.To develop models that can automatically create captions for images,vast datasets and plenty of processing capacity are useful.These are the features that will be included in our Python-based application.Image caption generators may also be utilized for video frames,which is consistent with the premise that consumers would get automatic captions when we use or deploy them on social media or other apps.They’ll be able to do the work of a human interpreter in a matter of minutes.Then there’s the fact that technology has the potential to aid visually impaired individuals tremendously.

Keywords/Search Tags:

Computer vision, Deep learning, Convolutional Neural Networks, Long short-term memory, Recurrent Neural Network, Flickr8k dataset

PDF Full Text Request

Related items

1	Research And Implementation Of Sign Language Recognition Algorithm Using Deep Learning Networks
2	Algorithms For Recurrent Neural Networks With Long-term Memory
3	Research And Realization Of Arithmetic Model Based On Visual Learning
4	Exemplar-based Texture Synthesis And Its Applications Research
5	Research On Sign Language Recogniton Method Based On Convolutional Neural Networks And Recurrent Neural Networks
6	Network Traffic Classification Based On Deep Learning And Research On Intrusion Detection
7	Research On 3D Model Retrieval Technology Based On Deep Learning
8	Research And Implementation Of Lipreading Recognition Based On Deep Learning
9	Long Short Term Memory Recurrent Neural Network Application To Handwritten Recognition
10	Research On Sentiment Analysis Based On Deep Feature Representation