Vision+Language

Grounded language learning by aligning text and video

People

Henry Kautz, Daniel Gildea, Jiebo Luo

Graduate Students: Young Song Chol, Iftekhar Naim, Qiguang Liu

Undergraduate Students: Khezan Chen

Support

ISTCPC: Intel Science & Technology Center for Pervasive Computing

Developing the next generation of pervasive computing systems.

Papers

Young Chol Song, Iftekhar Naim, Abdullah Al Mamun, Kaustubh Kulkarniy, Parag Singlay, Jiebo Luo, Daniel Gildea, Henry Kautz. Unsupervised Alignment of Actions in Video with Text Descriptions. 25th International Joint Conference on Artificial Intelligence (IJCAI-16), New York, NY.

Iftekhar Naim, Young C. Song, Qiguang Liu, Liang Huang, Henry Kautz, Jiebo Luo and Daniel Gildea. Discriminative Unsupervised Alignment of Natural Language Instructions with Corresponding Video Segments, 2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015), Denver, Colorado, 2015.

Iftekhar Naim, Young Chol Song, Qiguang Liu, Henry Kautz, Jiebo Luo, and Daniel Gildea (2014). Unsupervised Alignment of Natural Language Instructions with Video Segments. 28th AAAI Conference on Artificial Intelligence (AAAI-14), Quebec City, Canada, 2014.

Danning Zheng, Tianran Hu, Quanzeng You, Henry Kautz, and Jiebo Luo. Inferring Home Location from User's Photo Collections based on Visual Content and Mobility Patterns. ACM Multimedia Conference, Workshop on Geotagging in Multimedia (GeoMM),November 2014.

Young Chol Song, Henry Kautz, James Allen, Mary Swift, Yuncheng Li, Jiebo Luo, Ce Zhang (2013). A Markov Logic Framework for Recognizing Complex Events from Multimodal Data. 15th ACM International Conference on Multimodal Interaction (ICMI 2013), Sydney, Australia.

Back