Generating natural language adversarial examples through probability weighted word saliency S Ren, Y Deng, K He, W Che
ACL 2019, 2019
655 2019 M IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning L Li, Y Yin, S Li, L Chen, P Wang, S Ren, M Li, Y Yang, J Xu, X Sun, ...
arXiv preprint arXiv:2306.04387, 2023
59 * 2023 CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade L Li, Y Lin, D Chen, S Ren, P Li, J Zhou, X Sun
Findings of EMNLP 2021, 2021
41 * 2021 Dynamic Knowledge Distillation for Pre-trained Language Models L Li, Y Lin, S Ren, P Li, J Zhou, X Sun
EMNLP 2021, 2021
33 2021 Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification S Ren, J Zhang, L Li, X Sun, J Zhou
EMNLP 2021, 2021
27 2021 Learning Relation Alignment for Calibrated Cross-modal Retrieval S Ren, J Lin, G Zhao, R Men, A Yang, J Zhou, X Sun, H Yang
ACL 2021, 2021
25 2021 Delving into the Openness of CLIP S Ren, L Li, X Ren, G Zhao, X Sun
Findings of ACL 2023, 2022
15 * 2022 DCA: Diversified Co-Attention towards Informative Live Video Commenting Z Zhang, Z Yin, S Ren, X Li, S Li
NLPCC 2020, 2020
14 2020 Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition S Ren, A Zhang, Y Zhu, S Zhang, S Zheng, M Li, A Smola, X Sun
NeurIPS 2023, 2023
13 2023 PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain L Chen, Y Zhang, S Ren, H Zhao, Z Cai, Y Wang, P Wang, X Meng, T Liu, ...
arXiv preprint arXiv:2402.15527, 2024
12 * 2024 Cuge: A chinese language understanding and generation evaluation benchmark Y Yao, Q Dong, J Guan, B Cao, Z Zhang, C Xiao, X Wang, F Qi, J Bao, ...
arXiv preprint arXiv:2112.13610, 2021
11 2021 FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation Y Liu, L Li, S Ren, R Gao, S Li, S Chen, X Sun, L Hou
NeurIPS 2023 (Datasets and Benchmarks Track), 2023
7 2023 TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding S Ren, L Yao, S Li, X Sun, L Hou
CVPR 2024, 2023
5 2023 TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding S Ren, S Chen, S Li, X Sun, L Hou
Findings of EMNLP 2023, 2023
2 2023 VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models S Li, L Li, S Ren, Y Liu, Y Liu, R Gao, X Sun, L Hou
arXiv preprint arXiv:2311.17404, 2023
1 2023 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Y Wang, S Ren, R Gao, L Yao, Q Guo, K An, J Bai, X Sun
arXiv preprint arXiv:2404.10763, 2024
2024 Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality S Chen, L Li, S Ren, R Gao, Y Liu, X Bi, X Sun, L Hou
arXiv preprint arXiv:2403.19221, 2024
2024 TempCompass: Do Video LLMs Really Understand Videos? Y Liu, S Li, Y Liu, Y Wang, S Ren, L Li, S Chen, X Sun, L Hou
arXiv preprint arXiv:2403.00476, 2024
2024