CleanML: A study for evaluating the impact of data cleaning on ml classification tasks P Li, X Rao, J Blase, Y Zhang, X Chu, C Zhang 2021 IEEE 37th International Conference on Data Engineering (ICDE), 13-24, 2021 | 99* | 2021 |
Nearest neighbor classifiers over incomplete information: From certain answers to certain predictions B Karlaš, P Li, R Wu, NM Gürel, X Chu, W Wu, C Zhang arXiv preprint arXiv:2005.05117, 2020 | 33 | 2020 |
Auto-FuzzyJoin: auto-program fuzzy similarity joins without labeled examples P Li, X Cheng, X Chu, Y He, S Chaudhuri Proceedings of the 2021 International Conference on Management of Data, 1064 …, 2021 | 22 | 2021 |
Demonstration of panda: a weakly supervised entity matching system R Wu, P Sakala, P Li, X Chu, Y He arXiv preprint arXiv:2106.10821, 2021 | 8 | 2021 |
A model-agnostic approach for learning with noisy labels of arbitrary distributions S Hao, P Li, R Wu, X Chu 2022 IEEE 38th International Conference on Data Engineering (ICDE), 1219-1231, 2022 | 2 | 2022 |
Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples P Li, Y He, C Yan, Y Wang, S Chauduri arXiv preprint arXiv:2307.14565, 2023 | 1 | 2023 |
DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data P Li, Z Chen, X Chu, K Rong Proceedings of the ACM on Management of Data 1 (2), 1-26, 2023 | 1 | 2023 |
Experiences and Lessons Learned from the SIGMOD Entity Resolution Programming Contests A De Angelis, M Mazzei, F Piai, P Merialdo, G Simonini, L Zecchini, ... ACM SIGMOD Record 52 (2), 43-47, 2023 | | 2023 |