Skip to content

Latest commit

 

History

History
53 lines (36 loc) · 2.31 KB

README.md

File metadata and controls

53 lines (36 loc) · 2.31 KB

About the Project:

Problem : Question Answering for the Medical Images

Implementation: I have taken inspiration from Stacked Attention Network, and the slides as mentioned above, and implemented in Tensorflow 2.0, however, I have made some changes, because I did not understand those things, will improve in the near future.

Setup:

GOOGLE COLAB: What you need to do is:

Download :

    1. trainset.json
    2. testset.json
    3. VQA Image Folder
    4. Cache Folder  (contains the pickle file, for converting the answers to labels, and vice versa, and the mapping for dictionary and answer)

Upload these projects to your google drive, and then follow the instructions that are present in the VQA.ipynb notebook.

Results:

alt text alt text

📺 Some of my projects:

⚡ GitHub Stats

Anurag's github stats