English

Image Caption

Name: Image Caption
Price: 3863 INR
Availability: InStock
ISBN: 9786207647606

Meenatchi R

English

Paperback

₹3863

₹4067

5.02% OFF

(All inclusive*)

Delivery Options

Please enter pincode to check delivery time.

*COD & Shipping Charges may apply on certain items.

Review final details at checkout.

Looking to place a bulk order? SUBMIT DETAILS

Piracy-free

Assured Quality

Secure Transactions

About The Book

Description

Author

Image captioning with audio has emerged as a challenging yet promising task in the field of deep learning. This paper proposes a novel approach to address this task by integrating convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) for sequential audio analysis. Specifically we leverage pre-trained CNNs such as VGG to extract visual features from images and employ spectrogram representations coupled with RNNs such as LSTM or GRU to process audio inputs. Our proposed model based not only on their visual content but also on accompanying audio cues. We evaluate the performance of our model on benchmark datasets and demonstrate its effectiveness in generating coherent and contextually relevant captions for images with corresponding audio inputs. Additionally we conduct tablation studies to analyze the contribution of each modality to the overall captioning performance our results show that the fusion of visual and auditory modalities significantly improves captioning quality compared to using either modality in isolation.

Piracy-free

Assured Quality

Secure Transactions

Delivery Options

Please enter pincode to check delivery time.

*COD & Shipping Charges may apply on certain items.

Review final details at checkout.

Details

ISBN 13

9786207647606

Publication Date

16-05-2024

Pages

Weight

96 grams

Dimensions

152x229x3.89 mm

Publisher

LAP LAMBERT Academic Publishing

Details

LOOKING TO PLACE A BULK ORDER?CLICK HERE