Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani (2022), “GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features”, European Conference on Computer Vision., July, 2022.