Multi Modal Paper Conference VATEX:A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research ICCV19 Multimodal Abstractive Summarization for How2 Videos ACL19 How2:A Large-scale Dataset for Multimodal Language Understanding NIPS18