


default search action
Chengliang Chai
Person information
- affiliation: Tsinghua University, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j34]Jintao Zhang
, Chao Zhang
, Guoliang Li
, Chengliang Chai
:
PACE: Poisoning Attacks on Learned Cardinality Estimation. Proc. ACM Manag. Data 2(1): 37:1-37:27 (2024) - [j33]Yuhao Deng
, Chengliang Chai, Lei Cao, Nan Tang, Jiayi Wang, Ju Fan, Ye Yuan, Guoren Wang:
MisDetect: Iterative Mislabel Detection using Early Loss. Proc. VLDB Endow. 17(6): 1159-1172 (2024) - [j32]Yuhao Deng
, Chengliang Chai, Lei Cao
, Qin Yuan, Siyuan Chen, Yanrui Yu, Zhaoze Sun, Junyi Wang, Jiajun Li, Ziqi Cao, Kaisen Jin, Chi Zhang, Yuqing Jiang, Yuanfang Zhang, Yuping Wang, Ye Yuan, Guoren Wang, Nan Tang:
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes. Proc. VLDB Endow. 17(8): 1925-1938 (2024) - [j31]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? [Experiment, Analysis & Benchmark ]. Proc. VLDB Endow. 17(11): 3318-3331 (2024) - [j30]Chengliang Chai, Yuhao Deng, Yutong Zhan, Ziqi Cao, Yuanfang Zhang, Lei Cao, Yu-Ping Wang, Zhiwei Zhang, Ye Yuan, Guoren Wang, Nan Tang:
LakeCompass: An End-to-End System for Table Maintenance, Search and Analysis in Data Lakes. Proc. VLDB Endow. 17(12): 4381-4384 (2024) - [j29]Jiayi Wang, Chengliang Chai
, Jiabin Liu, Guoliang Li
:
Cardinality estimation using normalizing flow. VLDB J. 33(2): 323-348 (2024) - [c31]Jiyun Shi, Yuqiao Wang, Chi Zhang, Zhaojing Luo, Chengliang Chai, Meihui Zhang
:
DMRNet: Effective Network for Accurate Discharge Medication Recommendation. ICDE 2024: 3393-3406 - [c30]Peng Huang, Meihui Zhang
, Ziyue Zhong, Chengliang Chai, Ju Fan:
Representation Learning for Entity Alignment in Knowledge Graph: A Design Space Exploration. ICDE 2024: 3462-3475 - [c29]Chengliang Chai, Kaisen Jin, Nan Tang, Ju Fan, Lianpeng Qiao, Yuping Wang, Yuyu Luo, Ye Yuan, Guoren Wang:
Mitigating Data Scarcity in Supervised Machine Learning Through Reinforcement Learning Guided Data Generation. ICDE 2024: 3613-3626 - [c28]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. ICDE 2024: 3696-3709 - [c27]Xin Tang, Chengliang Chai, Dawei Zhao, Haohai Ma, Yong Zheng, Zhenyong Fan, Xin Wu, Jiaquan Zhang, Rui Zhang, Duanshun Li, Yi He, Keji Huang, Guangbin Meng, Yidong Wang, Yuefeng Zhou, Tao Tao, Lirong Jian, Jiwu Shu, Yuping Wang, Ye Yuan, Guoren Wang, Guoliang Li:
Separation Is for Better Reunion: Data Lake Storage at Huawei. ICDE 2024: 5142-5155 - [c26]Meihui Zhang
, Zhaoxuan Ji, Zhaojing Luo, Yuncheng Wu, Chengliang Chai:
Applications and Challenges for Large Language Models: From Data Management Perspective. ICDE 2024: 5530-5541 - [c25]Yuhao Deng
, Deng Qiyan
, Chengliang Chai
, Lei Cao
, Nan Tang
, Ju Fan
, Jiayi Wang
, Ye Yuan
, Guoren Wang
:
IDE: A System for Iterative Mislabel Detection. SIGMOD Conference Companion 2024: 500-503 - [i6]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? CoRR abs/2406.01265 (2024) - [i5]Jintao Zhang, Chao Zhang, Guoliang Li, Chengliang Chai:
PACE: Poisoning Attacks on Learned Cardinality Estimation. CoRR abs/2409.15990 (2024) - [i4]Jintao Zhang, Chao Zhang, Guoliang Li, Chengliang Chai:
AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation. CoRR abs/2409.16027 (2024) - [i3]Chi Zhang, Huaping Zhong, Kuan Zhang, Chengliang Chai, Rui Wang, Xinlin Zhuang, Tianyi Bai, Jiantao Qiu, Lei Cao, Ye Yuan, Guoren Wang, Conghui He:
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models. CoRR abs/2409.16986 (2024) - 2023
- [j28]Bingxue Zhang, Yang Shi, Yuxing Li, Chengliang Chai, Longfeng Hou:
An enhanced Elo-based student model for polychotomously scored items in adaptive educational system. Interact. Learn. Environ. 31(9): 5477-5494 (2023) - [j27]Yuyu Luo
, Yihui Zhou
, Nan Tang
, Guoliang Li
, Chengliang Chai
, Leixian Shen
:
Learned Data-aware Image Representations of Line Charts for Similarity Search. Proc. ACM Manag. Data 1(1): 88:1-88:29 (2023) - [j26]Sibei Chen
, Nan Tang
, Ju Fan
, Xuemi Yan
, Chengliang Chai
, Guoliang Li
, Xiaoyong Du
:
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation. Proc. ACM Manag. Data 1(1): 91:1-91:26 (2023) - [j25]Chengliang Chai
, Jiabin Liu
, Nan Tang
, Ju Fan
, Dongjing Miao
, Jiayi Wang
, Yuyu Luo
, Guoliang Li
:
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data. Proc. ACM Manag. Data 1(2): 157:1-157:27 (2023) - [j24]Chengliang Chai
, Jiayi Wang
, Yuyu Luo
, Zeping Niu, Guoliang Li
:
Data Management for Machine Learning: A Survey. IEEE Trans. Knowl. Data Eng. 35(5): 4646-4667 (2023) - [j23]Shuang Hao
, Chengliang Chai
, Guoliang Li
, Nan Tang
, Ning Wang
, Xiang Yu:
HOFD: An Outdated Fact Detector for Knowledge Bases. IEEE Trans. Knowl. Data Eng. 35(10): 10775-10789 (2023) - [c24]Tianyu Zhao, Chengliang Chai, Jiabin Liu, Guoliang Li, Jianhua Feng, Zitao Liu:
A Topic-Aware Data Generation Framework for Math Word Problems. DASFAA (4) 2023: 286-302 - [c23]Jintao Zhang, Chao Zhang
, Guoliang Li, Chengliang Chai:
AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation. ICDE 2023: 2621-2633 - [c22]Xuanhe Zhou, Chengliang Chai, Guoliang Li, Ji Sun:
Database Meets Artificial Intelligence: A Survey (Extended Abstract). ICDE 2023: 3901-3902 - [c21]Chengliang Chai
, Jiayi Wang
, Nan Tang
, Ye Yuan
, Jiabin Liu
, Yuhao Deng
, Guoren Wang
:
Efficient Coreset Selection with Cluster-based Methods. KDD 2023: 167-178 - [c20]Chengliang Chai
, Nan Tang
, Ju Fan
, Yuyu Luo
:
Demystifying Artificial Intelligence for Data Preparation. SIGMOD Conference Companion 2023: 13-20 - [i2]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. CoRR abs/2312.03987 (2023) - 2022
- [j22]Xiang Yu, Chengliang Chai, Xinning Zhang, Nan Tang, Ji Sun, Guoliang Li:
AlphaQO: Robust Learned Query Optimizer. Int. J. Softw. Informatics 12(1): 7-29 (2022) - [j21]Guoliang Li, Nan Tang, Chengliang Chai:
Preface. J. Comput. Sci. Technol. 37(5): 1003-1004 (2022) - [j20]Chengliang Chai, Jiabin Liu, Nan Tang, Guoliang Li, Yuyu Luo:
Selective Data Acquisition in the Wild for Model Charging. Proc. VLDB Endow. 15(7): 1466-1478 (2022) - [j19]Jianhong Tu, Xiaoyue Han, Ju Fan, Nan Tang, Chengliang Chai, Guoliang Li, Xiaoyong Du:
DADER: Hands-Off Entity Resolution with Domain Adaptation. Proc. VLDB Endow. 15(12): 3666-3669 (2022) - [j18]Xiang Yu, Chengliang Chai, Guoliang Li, Jiabin Liu:
Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection. Proc. VLDB Endow. 15(13): 3924-3936 (2022) - [j17]Jiayi Wang, Chengliang Chai, Nan Tang, Jiabin Liu, Guoliang Li:
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning. Proc. VLDB Endow. 16(1): 64-76 (2022) - [j16]Yuyu Luo
, Xuedi Qin
, Chengliang Chai
, Nan Tang
, Guoliang Li
, Wenbo Li
:
Steerable Self-Driving Data Visualization. IEEE Trans. Knowl. Data Eng. 34(1): 475-490 (2022) - [j15]Xuanhe Zhou
, Chengliang Chai
, Guoliang Li
, Ji Sun
:
Database Meets Artificial Intelligence: A Survey. IEEE Trans. Knowl. Data Eng. 34(3): 1096-1116 (2022) - [j14]Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang, Chengliang Chai, Xuedi Qin:
Natural Language to Visualization by Neural Machine Translation. IEEE Trans. Vis. Comput. Graph. 28(1): 217-226 (2022) - [j13]Tianyu Zhao, Shuai Huang, Yong Wang, Chengliang Chai, Guoliang Li
:
RNE: computing shortest paths using road network embedding. VLDB J. 31(3): 507-528 (2022) - [j12]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li
, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Interactively discovering and ranking desired tuples by data exploration. VLDB J. 31(4): 753-777 (2022) - [c19]Rong Zhu, Ziniu Wu, Chengliang Chai, Andreas Pfadler, Bolin Ding, Guoliang Li, Jingren Zhou:
Learned Query Optimizer: At the Forefront of AI-Driven Databases. EDBT 2022: 1-4 - [c18]Haowen Dong, Chengliang Chai, Yuyu Luo, Jiabin Liu, Jianhua Feng, Chaoqun Zhan:
RW-Tree: A Learned Workload-aware Framework for R-tree Construction. ICDE 2022: 2073-2085 - [c17]Xuedi Qin, Chengliang Chai, Nan Tang, Jian Li, Yuyu Luo, Guoliang Li, Yaoyu Zhu:
Synthesizing Privacy Preserving Entity Resolution Datasets. ICDE 2022: 2359-2371 - [c16]Jiabin Liu, Chengliang Chai, Yuyu Luo, Yin Lou, Jianhua Feng, Nan Tang:
Feature Augmentation with Reinforcement Learning. ICDE 2022: 3360-3372 - [c15]Jianhong Tu
, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du:
Domain Adaptation for Deep Entity Resolution. SIGMOD Conference 2022: 443-457 - [c14]Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li:
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning. SIGMOD Conference 2022: 945-958 - 2021
- [j11]Minghe Yu
, Chengliang Chai
, Ge Yu
:
A Tree-Based Indexing Approach for Diverse Textual Similarity Search. IEEE Access 9: 8866-8876 (2021) - [j10]Jiabin Liu, Fu Zhu, Chengliang Chai, Yuyu Luo, Nan Tang:
Automatic Data Acquisition for Deep Learning. Proc. VLDB Endow. 14(12): 2739-2742 (2021) - [j9]Xuanhe Zhou, Guoliang Li, Chengliang Chai, Jianhua Feng:
A Learned Query Rewrite System using Monte Carlo Tree Search. Proc. VLDB Endow. 15(1): 46-58 (2021) - [j8]Jiayi Wang, Chengliang Chai, Jiabin Liu, Guoliang Li:
FACE: A Normalizing Flow based Cardinality Estimator. Proc. VLDB Endow. 15(1): 72-84 (2021) - [j7]Chengliang Chai
, Guoliang Li
, Ju Fan
, Yuyu Luo
:
CrowdChart: Crowdsourced Data Extraction From Visualization Charts. IEEE Trans. Knowl. Data Eng. 33(11): 3537-3549 (2021) - [c13]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Ranking Desired Tuples by Database Exploration. ICDE 2021: 1973-1978 - [c12]Yuyu Luo, Nan Tang, Guoliang Li, Chengliang Chai, Wenbo Li, Xuedi Qin:
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks. SIGMOD Conference 2021: 1235-1247 - 2020
- [j6]Chengliang Chai, Guoliang Li:
Human-in-the-loop Techniques in Machine Learning. IEEE Data Eng. Bull. 43(3): 37-52 (2020) - [j5]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
VisClean: Interactive Cleaning for Progressive Visualization. Proc. VLDB Endow. 13(12): 2821-2824 (2020) - [c11]Haojun Zhang, Chengliang Chai, AnHai Doan, Paris Koutris, Esteban Arcaute:
Manually Detecting Errors for Data Cleaning Using Adaptive Crowdsourcing Strategies. EDBT 2020: 311-322 - [c10]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
Interactive Cleaning for Progressive Visualization through Composite Questions. ICDE 2020: 733-744 - [c9]Xiang Yu, Guoliang Li, Chengliang Chai, Nan Tang:
Reinforcement Learning with Tree-LSTM for Join Order Selection. ICDE 2020: 1297-1308 - [c8]Chengliang Chai, Guoliang Li, Ju Fan, Yuyu Luo:
Crowdsourcing-based Data Extraction from Visualization Charts. ICDE 2020: 1814-1817 - [c7]Shuang Hao
, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang
, Xiang Yu:
Outdated Fact Detection in Knowledge Bases. ICDE 2020: 1890-1893 - [c6]Chengliang Chai, Lei Cao
, Guoliang Li, Jian Li, Yuyu Luo, Samuel Madden:
Human-in-the-loop Outlier Detection. SIGMOD Conference 2020: 19-33 - [c5]Xuedi Qin, Chengliang Chai, Yuyu Luo, Nan Tang, Guoliang Li:
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries. SIGMOD Conference 2020: 2745-2748
2010 – 2019
- 2019
- [j4]Tianyu Zhao
, Chengliang Chai, Yuyu Luo, Jianhua Feng, Yan Huang, Songfan Yang, Haitao Yuan, Haoda Li, Kaiyu Li, Fu Zhu, Kang Pan:
Towards Automatic Mathematical Exercise Solving. Data Sci. Eng. 4(3): 179-192 (2019) - [j3]Chaoqun Zhan, Maomeng Su, Chuangxian Wei, Xiaoqiang Peng, Liang Lin, Sheng Wang, Zhe Chen, Feifei Li, Yue Pan, Fang Zheng, Chengliang Chai:
AnalyticDB: Real-time OLAP Database System at Alibaba Cloud. Proc. VLDB Endow. 12(12): 2059-2070 (2019) - [c4]Chengliang Chai, Ju Fan, Guoliang Li, Jiannan Wang, Yudian Zheng:
Crowdsourcing Database Systems: Overview and Challenges. ICDE 2019: 2052-2055 - 2018
- [j2]Guoliang Li, Chengliang Chai, Ju Fan, Xueping Weng, Jian Li, Yudian Zheng, Yuanbing Li, Xiang Yu, Xiaohang Zhang, Haitao Yuan:
CDB: A Crowd-Powered Database System. Proc. VLDB Endow. 11(12): 1926-1929 (2018) - [j1]Chengliang Chai, Guoliang Li
, Jian Li, Dong Deng, Jianhua Feng:
A partial-order-based framework for cost-effective crowdsourced entity resolution. VLDB J. 27(6): 745-770 (2018) - [c3]Chengliang Chai, Ju Fan, Guoliang Li:
Incentive-Based Entity Collection Using Crowdsourcing. ICDE 2018: 341-352 - [i1]Chengliang Chai, Ju Fan, Guoliang Li, Jiannan Wang, Yudian Zheng:
Crowd-Powered Data Mining. CoRR abs/1806.04968 (2018) - 2017
- [c2]Guoliang Li, Chengliang Chai, Ju Fan, Xueping Weng, Jian Li, Yudian Zheng, Yuanbing Li, Xiang Yu, Xiaohang Zhang, Haitao Yuan:
CDB: Optimizing Queries with Crowd-Based Selections and Joins. SIGMOD Conference 2017: 1463-1478 - 2016
- [c1]Chengliang Chai, Guoliang Li, Jian Li, Dong Deng, Jianhua Feng:
Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach. SIGMOD Conference 2016: 969-984
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-21 01:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint