default search action

combined dblp search
author search
venue search
publication search

ask others

Qingpeng Cai 0001

> Home > Persons

Person information

affiliation: Kuaishou Technology, Beijing, China
affiliation (former): Alibaba Group
affiliation (former): Tsinghua University, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Liu0YX0Z0HLJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Liu0YX0Z0HLJ24
Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang:
Modeling User Retention through Generative Flow Networks. KDD 2024: 5497-5508
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Wang0W0HLJGX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Wang0W0HLJGX24
Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Kun Gai, Guangming Xie:
Future Impact Decomposition in Request-level Recommendations. KDD 2024: 5905-5916
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/Zhang0Y00ZLLZH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/Zhang0Y00ZLLZH024
Zijian Zhang, Shuchang Liu, Jiaao Yu, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Ziru Liu, Qidong Liu, Hongwei Zhao, Lantao Hu, Peng Jiang, Kun Gai:
M³oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework. SIGIR 2024: 893-902
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/Liu0Z00ZH0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/Liu0Z00ZH0G24
Ziru Liu, Shuchang Liu, Zijian Zhang, Qingpeng Cai, Xiangyu Zhao, Kesen Zhao, Lantao Hu, Peng Jiang, Kun Gai:
Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention. SIGIR 2024: 1872-1882
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/00010P0H00YY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/00010P0H00YY24
Qingpeng Cai, Xiangyu Zhao, Ling Pan, Xin Xin, Jin Huang, Weinan Zhang, Li Zhao, Dawei Yin, Grace Hui Yang:
AgentIR: 1st Workshop on Agent-based Information Retrieval. SIGIR 2024: 3025-3028
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-16108
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-16108
Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Guangming Xie:
Future Impact Decomposition in Request-level Recommendations. CoRR abs/2401.16108 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-03637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-03637
Ziru Liu, Shuchang Liu, Zijian Zhang, Qingpeng Cai, Xiangyu Zhao, Kesen Zhao, Lantao Hu, Peng Jiang, Kun Gai:
Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention. CoRR abs/2404.03637 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-18465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-18465
Zijian Zhang, Shuchang Liu, Jiaao Yu, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Ziru Liu, Qidong Liu, Hongwei Zhao, Lantao Hu, Peng Jiang, Kun Gai:
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework. CoRR abs/2404.18465 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01901
Chunhui Li, Cheng-Hao Liu, Dianbo Liu, Qingpeng Cai, Ling Pan:
Bifurcated Generative Flow Networks. CoRR abs/2406.01901 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02213
Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan:
Rectifying Reinforcement Learning for Reward Matching. CoRR abs/2406.02213 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06043
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06043
Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang:
Modeling User Retention through Generative Flow Networks. CoRR abs/2406.06043 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12470
Jiaju Chen, Chongming Gao, Shuai Yuan, Shuchang Liu, Qingpeng Cai, Peng Jiang:
DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems. CoRR abs/2408.12470 (2024)
2023
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Xue0ZZ0G023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Xue0ZZ0G023
Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor. ICLR 2023
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/00060HSMZ0G23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/00060HSMZ0G23
Shuchang Liu, Qingpeng Cai, Zhankui He, Bowen Sun, Julian J. McAuley, Dong Zheng, Peng Jiang, Kun Gai:
Generative Flow Network for Listwise Recommendation. KDD 2023: 1524-1534
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Xue0XS0ZJG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Xue0XS0ZJG023
Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement. KDD 2023: 2874-2884
[c19]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Xue00Z0G023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Xue00Z0G023
Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. NeurIPS 2023
[c18]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao000LZJG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao000LZJG23
Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai:
KuaiSim: A Comprehensive Simulator for Recommender Systems. NeurIPS 2023
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/www/00010WZXYZJG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/00010WZXYZJG23
Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai:
Reinforcing User Retention in a Billion Scale Short Video Recommender System. WWW (Companion Volume) 2023: 421-426
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/www/00060SWJZJGZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/00060SWJZJGZZ23
Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Peng Jiang, Kun Gai, Xiangyu Zhao, Yongfeng Zhang:
Exploration and Regularization of the Latent Action Space in Recommendation. WWW 2023: 833-844
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/www/0001XZX0ZWZXZJG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/0001XZX0ZWZXZJG23
Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai:
Two-Stage Constrained Actor-Critic for Short Video Recommendation. WWW 2023: 865-875
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LiuT0ZGLCHZJG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LiuT0ZGLCHZJG23
Ziru Liu, Jiejie Tian, Qingpeng Cai, Xiangyu Zhao, Jingtong Gao, Shuchang Liu, Dayou Chen, Tonghao He, Dong Zheng, Peng Jiang, Kun Gai:
Multi-Task Recommendations with Reinforcement Learning. WWW 2023: 1273-1282
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01680
Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai:
Two-Stage Constrained Actor-Critic for Short Video Recommendation. CoRR abs/2302.01680 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01724
Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai:
Reinforcing User Retention in a Billion Scale Short Video Recommender System. CoRR abs/2302.01724 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03328
Ziru Liu, Jiejie Tian, Qingpeng Cai, Xiangyu Zhao, Jingtong Gao, Shuchang Liu, Dayou Chen, Tonghao He, Dong Zheng, Peng Jiang, Kun Gai:
Multi-Task Recommendations with Reinforcement Learning. CoRR abs/2302.03328 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03431
Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Kun Gai, Peng Jiang, Xiangyu Zhao, Yongfeng Zhang:
Exploration and Regularization of the Latent Action Space in Recommendation. CoRR abs/2302.03431 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02239
Shuchang Liu, Qingpeng Cai, Zhankui He, Bowen Sun, Julian J. McAuley, Dong Zheng, Peng Jiang, Kun Gai:
Generative Flow Network for Listwise Recommendation. CoRR abs/2306.02239 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03552
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03552
Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. CoRR abs/2306.03552 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06212
Yue Feng, Shuchang Liu, Zhenghai Xue, Qingpeng Cai, Lantao Hu, Peng Jiang, Kun Gai, Fei Sun:
A Large Language Model Enhanced Conversational Recommender System. CoRR abs/2308.06212 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12645
Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai:
KuaiSim: A Comprehensive Simulator for Recommender Systems. CoRR abs/2309.12645 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-03984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-03984
Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An:
AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement. CoRR abs/2310.03984 (2023)
2022
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13248
Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, Pinghua Gong, Dong Zheng, Peng Jiang:
Constrained Reinforcement Learning for Short Video Recommendation. CoRR abs/2205.13248 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02620
Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Bo An:
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor. CoRR abs/2206.02620 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02779
Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Bo An:
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement. CoRR abs/2212.02779 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/aamas/PanCH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/PanCH21
Ling Pan, Qingpeng Cai, Longbo Huang:
Exploration in policy optimization through multiple paths. Auton. Agents Multi Agent Syst. 35(2): 33 (2021)
2020
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/CaiPT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/CaiPT20
Qingpeng Cai, Ling Pan, Pingzhong Tang:
Deterministic Value-Policy Gradients. AAAI 2020: 3316-3323
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/PanCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/PanCH20
Ling Pan, Qingpeng Cai, Longbo Huang:
Multi-Path Policy Optimization. AAMAS 2020: 1001-1009
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanCM0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanCM0H20
Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang:
Reinforcement Learning with Dynamic Boltzmann Softmax Updates. IJCAI 2020: 1992-1998
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PanCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanCH20
Ling Pan, Qingpeng Cai, Longbo Huang:
Softmax Deep Double Deterministic Policy Gradients. NeurIPS 2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-12206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-12206
Jianxiong Wei, Anxiang Zeng, Yueqiu Wu, Peng Guo, Qingsong Hua, Qingpeng Cai:
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce. CoRR abs/2005.12206 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-09177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-09177
Ling Pan, Qingpeng Cai, Longbo Huang:
Softmax Deep Double Deterministic Policy Gradients. CoRR abs/2010.09177 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PanCFTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PanCFTH19
Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang:
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems. AAAI 2019: 1393-1400
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PanCZPDHHT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PanCZPDHHT19
Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang:
Policy Optimization with Model-Based Explorations. AAAI 2019: 4675-4682
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/www/PanCTZH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/PanCTZH19
Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He:
Policy Gradients for Contextual Recommendations. WWW 2019: 1421-1431
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-05926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-05926
Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang, Tie-Yan Liu:
Reinforcement Learning with Dynamic Boltzmann Softmax Updates. CoRR abs/1903.05926 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-06639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-06639
Qingpeng Cai, Will Hang, Azalia Mirhoseini, George Tucker, Jingtao Wang, Wei Wei:
Reinforcement Learning Driven Heuristic Optimization. CoRR abs/1906.06639 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-03939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-03939
Qingpeng Cai, Ling Pan, Pingzhong Tang:
Deterministic Value-Policy Gradients. CoRR abs/1909.03939 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04207
Ling Pan, Qingpeng Cai, Longbo Huang:
Multi-Path Policy Optimization. CoRR abs/1911.04207 (2019)
2018
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/CaiFTZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/CaiFTZ18
Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for Fraudulent Behaviour in e-Commerce. AAAI 2018: 957-964
[c5]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/CaiTZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/CaiTZ18
Qingpeng Cai, Pingzhong Tang, Yulong Zeng:
Ranking Mechanism Design for Price-setting Agents in E-commerce. AAMAS 2018: 1504-1512
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/www/CaiFTZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/CaiFTZ18
Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for e-commerce. WWW 2018: 1339-1348
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-04162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-04162
Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He:
Policy Gradients for Contextual Bandits. CoRR abs/1802.04162 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-04592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-04592
Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang:
Rebalancing Dockless Bike Sharing Systems. CoRR abs/1802.04592 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-03708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-03708
Qingpeng Cai, Ling Pan, Pingzhong Tang:
Generalized deterministic policy gradient algorithms. CoRR abs/1807.03708 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07350
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07350
Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang:
Policy Optimization with Model-based Explorations. CoRR abs/1811.07350 (2018)
2017
[c3]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/LiuCZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LiuCZ17
Chang Liu, Qingpeng Cai, Yukui Zhang:
Multi-armed Bandit Mechanism with Private Histories. AAMAS 2017: 1607-1609
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-07607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-07607
Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang:
Reinforcement Mechanism Design for e-commerce. CoRR abs/1708.07607 (2017)
2016
[c2]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/CaiFT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/CaiFT16
Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang:
Facility Location with Minimax Envy. IJCAI 2016: 137-143
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/recsys/CaiFLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/recsys/CaiFLT16
Qingpeng Cai, Aris Filos-Ratsikas, Chang Liu, Pingzhong Tang:
Mechanism Design for Personalized Recommender Systems. RecSys 2016: 159-166

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.