default search action
Qingpeng Cai 0001
Person information
- affiliation: Kuaishou Technology, Beijing, China
- affiliation (former): Alibaba Group
- affiliation (former): Tsinghua University, China
Other persons with the same name
- Qingpeng Cai
- Qingpeng Cai 0002 — National University of Singapore, Singapore
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c27]Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang:
Modeling User Retention through Generative Flow Networks. KDD 2024: 5497-5508 - [c26]Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Kun Gai, Guangming Xie:
Future Impact Decomposition in Request-level Recommendations. KDD 2024: 5905-5916 - [c25]Zijian Zhang, Shuchang Liu, Jiaao Yu, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Ziru Liu, Qidong Liu, Hongwei Zhao, Lantao Hu, Peng Jiang, Kun Gai:
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework. SIGIR 2024: 893-902 - [c24]Ziru Liu, Shuchang Liu, Zijian Zhang, Qingpeng Cai, Xiangyu Zhao, Kesen Zhao, Lantao Hu, Peng Jiang, Kun Gai:
Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention. SIGIR 2024: 1872-1882 - [c23]Qingpeng Cai, Xiangyu Zhao, Ling Pan, Xin Xin, Jin Huang, Weinan Zhang, Li Zhao, Dawei Yin, Grace Hui Yang:
AgentIR: 1st Workshop on Agent-based Information Retrieval. SIGIR 2024: 3025-3028 - [i30]Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Guangming Xie:
Future Impact Decomposition in Request-level Recommendations. CoRR abs/2401.16108 (2024) - [i29]Ziru Liu, Shuchang Liu, Zijian Zhang, Qingpeng Cai, Xiangyu Zhao, Kesen Zhao, Lantao Hu, Peng Jiang, Kun Gai:
Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention. CoRR abs/2404.03637 (2024) - [i28]Zijian Zhang, Shuchang Liu, Jiaao Yu, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Ziru Liu, Qidong Liu, Hongwei Zhao, Lantao Hu, Peng Jiang, Kun Gai:
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework. CoRR abs/2404.18465 (2024) - [i27]Chunhui Li, Cheng-Hao Liu, Dianbo Liu, Qingpeng Cai, Ling Pan:
Bifurcated Generative Flow Networks. CoRR abs/2406.01901 (2024) - [i26]Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan:
Rectifying Reinforcement Learning for Reward Matching. CoRR abs/2406.02213 (2024) - [i25]Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang:
Modeling User Retention through Generative Flow Networks. CoRR abs/2406.06043 (2024) - [i24]Jiaju Chen, Chongming Gao, Shuai Yuan, Shuchang Liu, Qingpeng Cai, Peng Jiang:
DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems. CoRR abs/2408.12470 (2024) - 2023
- [c22]Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor. ICLR 2023 - [c21]Shuchang Liu, Qingpeng Cai, Zhankui He, Bowen Sun, Julian J. McAuley, Dong Zheng, Peng Jiang, Kun Gai:
Generative Flow Network for Listwise Recommendation. KDD 2023: 1524-1534 - [c20]Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement. KDD 2023: 2874-2884 - [c19]Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. NeurIPS 2023 - [c18]Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai:
KuaiSim: A Comprehensive Simulator for Recommender Systems. NeurIPS 2023 - [c17]Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai:
Reinforcing User Retention in a Billion Scale Short Video Recommender System. WWW (Companion Volume) 2023: 421-426 - [c16]Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Peng Jiang, Kun Gai, Xiangyu Zhao, Yongfeng Zhang:
Exploration and Regularization of the Latent Action Space in Recommendation. WWW 2023: 833-844 - [c15]Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai:
Two-Stage Constrained Actor-Critic for Short Video Recommendation. WWW 2023: 865-875 - [c14]Ziru Liu, Jiejie Tian, Qingpeng Cai, Xiangyu Zhao, Jingtong Gao, Shuchang Liu, Dayou Chen, Tonghao He, Dong Zheng, Peng Jiang, Kun Gai:
Multi-Task Recommendations with Reinforcement Learning. WWW 2023: 1273-1282 - [i23]Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai:
Two-Stage Constrained Actor-Critic for Short Video Recommendation. CoRR abs/2302.01680 (2023) - [i22]Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai:
Reinforcing User Retention in a Billion Scale Short Video Recommender System. CoRR abs/2302.01724 (2023) - [i21]Ziru Liu, Jiejie Tian, Qingpeng Cai, Xiangyu Zhao, Jingtong Gao, Shuchang Liu, Dayou Chen, Tonghao He, Dong Zheng, Peng Jiang, Kun Gai:
Multi-Task Recommendations with Reinforcement Learning. CoRR abs/2302.03328 (2023) - [i20]Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Kun Gai, Peng Jiang, Xiangyu Zhao, Yongfeng Zhang:
Exploration and Regularization of the Latent Action Space in Recommendation. CoRR abs/2302.03431 (2023) - [i19]Shuchang Liu, Qingpeng Cai, Zhankui He, Bowen Sun, Julian J. McAuley, Dong Zheng, Peng Jiang, Kun Gai:
Generative Flow Network for Listwise Recommendation. CoRR abs/2306.02239 (2023) - [i18]Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. CoRR abs/2306.03552 (2023) - [i17]Yue Feng, Shuchang Liu, Zhenghai Xue, Qingpeng Cai, Lantao Hu, Peng Jiang, Kun Gai, Fei Sun:
A Large Language Model Enhanced Conversational Recommender System. CoRR abs/2308.06212 (2023) - [i16]Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai:
KuaiSim: A Comprehensive Simulator for Recommender Systems. CoRR abs/2309.12645 (2023) - [i15]Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An:
AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement. CoRR abs/2310.03984 (2023) - 2022
- [i14]Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, Pinghua Gong, Dong Zheng, Peng Jiang:
Constrained Reinforcement Learning for Short Video Recommendation. CoRR abs/2205.13248 (2022) - [i13]Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Bo An:
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor. CoRR abs/2206.02620 (2022) - [i12]Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Bo An:
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement. CoRR abs/2212.02779 (2022) - 2021
- [j1]Ling Pan, Qingpeng Cai, Longbo Huang:
Exploration in policy optimization through multiple paths. Auton. Agents Multi Agent Syst. 35(2): 33 (2021) - 2020
- [c13]Qingpeng Cai, Ling Pan, Pingzhong Tang:
Deterministic Value-Policy Gradients. AAAI 2020: 3316-3323 - [c12]Ling Pan, Qingpeng Cai, Longbo Huang:
Multi-Path Policy Optimization. AAMAS 2020: 1001-1009 - [c11]Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang:
Reinforcement Learning with Dynamic Boltzmann Softmax Updates. IJCAI 2020: 1992-1998 - [c10]Ling Pan, Qingpeng Cai, Longbo Huang:
Softmax Deep Double Deterministic Policy Gradients. NeurIPS 2020 - [i11]Jianxiong Wei, Anxiang Zeng, Yueqiu Wu, Peng Guo, Qingsong Hua, Qingpeng Cai:
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce. CoRR abs/2005.12206 (2020) - [i10]Ling Pan, Qingpeng Cai, Longbo Huang:
Softmax Deep Double Deterministic Policy Gradients. CoRR abs/2010.09177 (2020)
2010 – 2019
- 2019
- [c9]Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang:
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems. AAAI 2019: 1393-1400 - [c8]Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang:
Policy Optimization with Model-Based Explorations. AAAI 2019: 4675-4682 - [c7]Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He:
Policy Gradients for Contextual Recommendations. WWW 2019: 1421-1431 - [i9]Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang, Tie-Yan Liu:
Reinforcement Learning with Dynamic Boltzmann Softmax Updates. CoRR abs/1903.05926 (2019) - [i8]Qingpeng Cai, Will Hang, Azalia Mirhoseini, George Tucker, Jingtao Wang, Wei Wei:
Reinforcement Learning Driven Heuristic Optimization. CoRR abs/1906.06639 (2019) - [i7]Qingpeng Cai, Ling Pan, Pingzhong Tang:
Deterministic Value-Policy Gradients. CoRR abs/1909.03939 (2019) - [i6]Ling Pan, Qingpeng Cai, Longbo Huang:
Multi-Path Policy Optimization. CoRR abs/1911.04207 (2019) - 2018
- [c6]Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for Fraudulent Behaviour in e-Commerce. AAAI 2018: 957-964 - [c5]Qingpeng Cai, Pingzhong Tang, Yulong Zeng:
Ranking Mechanism Design for Price-setting Agents in E-commerce. AAMAS 2018: 1504-1512 - [c4]Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for e-commerce. WWW 2018: 1339-1348 - [i5]Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He:
Policy Gradients for Contextual Bandits. CoRR abs/1802.04162 (2018) - [i4]Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang:
Rebalancing Dockless Bike Sharing Systems. CoRR abs/1802.04592 (2018) - [i3]Qingpeng Cai, Ling Pan, Pingzhong Tang:
Generalized deterministic policy gradient algorithms. CoRR abs/1807.03708 (2018) - [i2]Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang:
Policy Optimization with Model-based Explorations. CoRR abs/1811.07350 (2018) - 2017
- [c3]Chang Liu, Qingpeng Cai, Yukui Zhang:
Multi-armed Bandit Mechanism with Private Histories. AAMAS 2017: 1607-1609 - [i1]Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for e-commerce. CoRR abs/1708.07607 (2017) - 2016
- [c2]Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang:
Facility Location with Minimax Envy. IJCAI 2016: 137-143 - [c1]Qingpeng Cai, Aris Filos-Ratsikas, Chang Liu, Pingzhong Tang:
Mechanism Design for Personalized Recommender Systems. RecSys 2016: 159-166
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-03 23:26 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint