Skip to content

Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.

Notifications You must be signed in to change notification settings

leaderj1001/Vision-Language

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 

Repository files navigation

GQA: Visual Reasoning in the Real World

Data structure

├── Question Number
    ├── Annotations
    |   ├── answer
    |   ├── full Answer
    |   └── question
    │   
    ├── answer
    ├── entailed
    ├── equivalent
    ├── fullAnswer
    ├── groups
    ├── imageId
    ├── isBalanced
    ├── question
    ├── semantic
    ├── semanticStr
    └── types
        ├── detailed
        ├── semantic
        └── structural
  • answer
  • imageId
  • question

Network Architecture

캡처

Image-Question Aggregator

캡처2

Requirements

  • tensorflow-gpu==1.13.1
  • numpy==1.16.2
  • tensorflow-hub==0.4.0
  • python==3.7.3
  • cv2==4.0.0
  • tqdm==4.31.1

About

Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published