Abstract: The diversity of VQA questions bring new challenge for VQA model to predict the answer. Existing models focus on the construction of new attention mechanisms and object recognition, but ...