The major contributions of this paper are as follows: We propose MedFuseNet, an attention based multimodal deep learning model for answer categorization and answer generation tasks in medical domain ...
Video Question Answering (Video QA) integrates computer vision and natural language processing to enable systems to answer free-form or multiple-choice questions about dynamic visual content. Central ...