Video summarization using deep semantic features M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya Computer Vision–ACCV 2016: 13th Asian Conference on Computer Vision, Taipei …, 2017 | 131 | 2017 |
Rethinking the Evaluation of Video Summaries M Otani, Y Nakashima, E Rahtu, J Heikkilä IEEE Computer Society Conference on Computer Vision and Pattern Recognition …, 2019 | 117 | 2019 |
Learning joint representations of videos and sentences with web image search M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya European Conference on Computer Vision Workshop, 651-667, 2016 | 97 | 2016 |
Bert representations for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020 | 90 | 2020 |
KnowIT VQA: Answering knowledge-based questions about videos N Garcia, M Otani, C Chu, Y Nakashima Proceedings of the AAAI conference on artificial intelligence 34 (07), 10826 …, 2020 | 64 | 2020 |
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval M Otani, Y Nakashima, E Rahtu, J Heikkilä British Machine Vision Conference, 2020 | 41 | 2020 |
Constrained graphic layout generation via latent optimization K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the 29th ACM International Conference on Multimedia, 88-96, 2021 | 34 | 2021 |
A dataset and baselines for visual question answering on art N Garcia, C Ye, Z Liu, Q Hu, M Otani, C Chu, Y Nakashima, T Mitamura Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020 …, 2020 | 27 | 2020 |
Alleviating cold-start problems in recommendation through pseudo-labelling over knowledge graph R Togashi, M Otani, S Satoh Proceedings of the 14th ACM international conference on web search and data …, 2021 | 25 | 2021 |
Video summarization using textual descriptions for authoring video blogs M Otani, Y Nakashima, T Sato, N Yokoya Multimedia Tools and Applications 76, 12097-12115, 2017 | 14 | 2017 |
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation N Inoue, K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 11 | 2023 |
A comparative study of language transformers for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Neurocomputing 445, 121-133, 2021 | 11 | 2021 |
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image C Chu, M Otani, Y Nakashima International Conference on Computational Linguistics, 3479–3492, 2018 | 11 | 2018 |
Does robustness on imagenet transfer to downstream tasks? Y Yamada, M Otani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 10 | 2022 |
Textual description-based video summarization for video blogs M Otani, Y Nakashima, T Sato, N Yokoya 2015 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2015 | 10 | 2015 |
Modeling visual containment for web page layout optimization K Kikuchi, M Otani, K Yamaguchi, E Simo‐Serra Computer Graphics Forum 40 (7), 33-44, 2021 | 9 | 2021 |
The laughing machine: predicting humor in video Y Kayatani, Z Yang, M Otani, N Garcia, C Chu, Y Nakashima, H Takemura Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 8 | 2021 |
Transferring domain-agnostic knowledge in video question answering T Wu, N Garcia, M Otani, C Chu, Y Nakashima, H Takemura arXiv preprint arXiv:2110.13395, 2021 | 6 | 2021 |
Visual question answering with textual representations for images Y Hirota, N Garcia, M Otani, C Chu, Y Nakashima, I Taniguchi, T Onoye Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 5 | 2021 |
Video colorization based on optical flow and edge-oriented color propagation M Otani, H Hioki Computational Imaging XII 9020, 902002, 2014 | 5 | 2014 |