Suivre
YoungJae Yu
YoungJae Yu
Allen Institute for AI, Yonsei University
Adresse e-mail validée de yonsei.ac.kr - Page d'accueil
Titre
Citée par
Citée par
Année
Tgif-qa: Toward spatio-temporal reasoning in visual question answering
Y Jang, Y Song, Y Yu, Y Kim, G Kim
Proceedings of the IEEE conference on computer vision and pattern …, 2017
5072017
A joint sequence fusion model for video question answering and retrieval
Y Yu, J Kim, G Kim
Proceedings of the European conference on computer vision (ECCV), 471-487, 2018
3462018
Merlot: Multimodal neural script knowledge models
R Zellers, X Lu, J Hessel, Y Yu, JS Park, J Cao, A Farhadi, Y Choi
Advances in Neural Information Processing Systems 34, 23634-23651, 2021
3212021
End-to-end concept word detection for video captioning, retrieval, and question answering
Y Yu, H Ko, J Choi, G Kim
Proceedings of the IEEE conference on computer vision and pattern …, 2017
288*2017
Merlot reserve: Neural script knowledge through vision and language and sound
R Zellers, J Lu, X Lu, Y Yu, Y Zhao, M Salehi, A Kusupati, J Hessel, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1922022
Neurologic a* esque decoding: Constrained text generation with lookahead heuristics
X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ...
arXiv preprint arXiv:2112.08726, 2021
1172021
Supervising neural attention models for video captioning by human gaze data
Y Yu, J Choi, Y Kim, K Yoo, SH Lee, G Kim
Proceedings of the IEEE conference on computer vision and pattern …, 2017
852017
Parameter efficient multimodal transformers for video representation learning
S Lee, Y Yu, G Kim, T Breuel, J Kautz, Y Song
arXiv preprint arXiv:2012.04124, 2020
832020
Multimodal c4: An open, billion-scale corpus of images interleaved with text
W Zhu, J Hessel, A Awadalla, SY Gadre, J Dodge, A Fang, Y Yu, ...
Advances in Neural Information Processing Systems 36, 2024
712024
Soda: Million-scale dialogue distillation with social commonsense contextualization
H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ...
arXiv preprint arXiv:2212.10465, 2022
702022
A memory network approach for story-based temporal summarization of 360 videos
S Lee, J Sung, Y Yu, G Kim
Proceedings of the IEEE conference on computer vision and pattern …, 2018
682018
Dual compositional learning in interactive image retrieval
J Kim, Y Yu, H Kim, G Kim
Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1771-1779, 2021
662021
Prosocialdialog: A prosocial backbone for conversational agents
H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap
arXiv preprint arXiv:2205.12688, 2022
632022
Pano-avqa: Grounded audio-visual question answering on 360deg videos
H Yun, Y Yu, W Yang, K Lee, G Kim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
542021
Symbolic chain-of-thought distillation: Small models can also" think" step-by-step
LH Li, J Hessel, Y Yu, X Ren, KW Chang, Y Choi
arXiv preprint arXiv:2306.14050, 2023
522023
A deep ranking model for spatio-temporal highlight detection from a 360◦ video
Y Yu, S Lee, J Na, J Kang, G Kim
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
472018
Video question answering with spatio-temporal reasoning
Y Jang, Y Song, CD Kim, Y Yu, Y Kim, G Kim
International Journal of Computer Vision 127, 1385-1412, 2019
442019
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes
I Jung, K Jo, H Kang, H Ahn, Y Yu, S Kim
Bioinformatics 33 (23), 3827-3835, 2017
372017
Augmenting data for sarcasm detection with unlabeled conversation context
H Lee, Y Yu, G Kim
arXiv preprint arXiv:2006.06259, 2020
322020
Curlingnet: Compositional learning between images and text for fashion iq data
Y Yu, S Lee, Y Choi, G Kim
arXiv preprint arXiv:2003.12299, 2020
292020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20