We use a multilingual dataset, mainly including the COCO2017 dataset and the AI Challenger image Chinese description dataset: ...