Follow
Shitao Xiao
Shitao Xiao
Verified email at bupt.edu.cn
Title
Cited by
Cited by
Year
Graphformers: Gnn-nested transformers for representation learning on textual graph
J Yang, Z Liu, S Xiao, C Li, D Lian, S Agrawal, A Singh, G Sun, X Xie
Advances in Neural Information Processing Systems 34, 28798-28810, 2021
822021
RetroMAE: Pre-training Retrieval-oriented Transformers via Masked Auto-Encoder
S Xiao, Z Liu, Y Shao, Z Cao
arXiv preprint arXiv:2205.12035, 2022
80*2022
C-pack: Packaged resources to advance general chinese embedding
S Xiao, Z Liu, P Zhang, N Muennighof
arXiv preprint arXiv:2309.07597, 2023
772023
LECF: recommendation via learnable edge collaborative filtering
S Xiao, Y Shao, Y Li, H Yin, Y Shen, B Cui
Science China Information Sciences 65 (1), 112101, 2022
312022
Retrieve anything to augment large language models
P Zhang, S Xiao, Z Liu, Z Dou, JY Nie
arXiv preprint arXiv:2310.07554, 2023
262023
Training large-scale news recommenders with pretrained language models in the loop
S Xiao, Z Liu, Y Shao, T Di, B Middha, F Wu, X Xie
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
252022
Matching-oriented Product Quantization For Ad-hoc Retrieval
S Xiao, Z Liu, Y Shao, D Lian, X Xie
EMNLP, 2021
182021
Uni-retriever: Towards learning the unified embedding based retriever in bing sponsored search
J Zhang, Z Liu, W Han, S Xiao, R Zheng, Y Shao, H Sun, H Zhu, ...
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
152022
Distill-vq: Learning retrieval oriented vector quantization by distilling knowledge from dense embeddings
S Xiao, Z Liu, W Han, J Zhang, D Lian, Y Gong, Q Chen, F Yang, H Sun, ...
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
152022
Retromae-2: Duplex masked auto-encoder for pre-training retrieval-oriented language models
Z Liu, S Xiao, Y Shao, Z Cao
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
102023
Progressively optimized bi-granular document representation for scalable embedding based retrieval
S Xiao, Z Liu, W Han, J Zhang, Y Shao, D Lian, C Li, H Sun, D Deng, ...
Proceedings of the ACM Web Conference 2022, 286-296, 2022
102022
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon
P Zhang, Z Liu, S Xiao, N Shao, Q Ye, Z Dou
arXiv preprint arXiv:2401.03462, 2024
92024
Bge m3-embedding: Multi-lingual, multi-functionality, multi-granularity text embeddings through self-knowledge distillation
J Chen, S Xiao, P Zhang, K Luo, D Lian, Z Liu
arXiv preprint arXiv:2402.03216, 2024
72024
Mindsim: user simulator for news recommenders
X Luo, Z Liu, S Xiao, X Xie, D Li
Proceedings of the ACM Web Conference 2022, 2067-2077, 2022
52022
Making large language models a better foundation for dense retrieval
C Li, Z Liu, S Xiao, Y Shao
arXiv preprint arXiv:2312.15503, 2023
22023
Lm-cocktail: Resilient tuning of language models via model merging
S Xiao, Z Liu, P Zhang, X Xing
arXiv preprint arXiv:2311.13534, 2023
22023
A Mutually Reinforced Framework for Pretrained Sentence Embeddings
J Yang, Z Liu, S Xiao, J Lian, L Wu, D Lian, G Sun, X Xie
arXiv preprint arXiv:2202.13802, 2022
22022
LibVQ: A Toolkit for Optimizing Vector Quantization and Efficient Neural Retrieval
C Li, Z Liu, S Xiao, Y Shao, D Lian, Z Cao
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
12023
Extensible Embedding: A Flexible Multipler For LLM's Context Length
N Shao, S Xiao, Z Liu, P Zhang
arXiv preprint arXiv:2402.11577, 2024
2024
BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models
K Luo, Z Liu, S Xiao, K Liu
arXiv preprint arXiv:2402.11573, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20