default search action

combined dblp search
author search
venue search
publication search

ask others

Wei Xiong 0015

> Home > Persons

Person information

affiliation: University of Illinois Urbana-Champaign, Department of Computer Science, Urbana, IL, USA
affiliation (former): Hong Kong University of Science and Technology, Hong Kong

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangLXYDQZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangLXYDQZZ24
Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang:
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards. ACL (1) 2024: 8642-8655
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/PiHXZLPZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/PiHXZLPZ24
Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang:
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization. ECCV (33) 2024: 382-398
[c15]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/LinL0DLZP00ZDPZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LinL0DLZP00ZDPZ24
Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. EMNLP 2024: 580-606
[c14]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/00030X0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/00030X0024
Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang:
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts. EMNLP (Findings) 2024: 10582-10592
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0015DYW0J0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0015DYW0J0024
Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang:
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint. ICML 2024
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/DiaoPDSZXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/DiaoPDSZXZ24
Shizhe Diao, Rui Pan, Hanze Dong, Kashun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang:
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. NAACL (Demonstrations) 2024: 116-127
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07314
Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang:
A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference. CoRR abs/2402.07314 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18571
Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang:
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards. CoRR abs/2402.18571 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-08730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-08730
Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang:
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization. CoRR abs/2403.08730 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-18922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-18922
Han Zhong, Guhao Feng, Wei Xiong, Li Zhao, Di He, Jiang Bian, Liwei Wang:
DPO Meets PPO: Reinforced Token Optimization for RLHF. CoRR abs/2404.18922 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-07863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-07863
Hanze Dong, Wei Xiong, Bo Pang, Haoxiang Wang, Han Zhao, Yingbo Zhou, Nan Jiang, Doyen Sahoo, Caiming Xiong, Tong Zhang:
RLHF Workflow: From Reward Modeling to Online RLHF. CoRR abs/2405.07863 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12845
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12845
Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang:
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts. CoRR abs/2406.12845 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02392
Wei Xiong, Chengshuai Shi, Jiaming Shen, Aviv Rosenberg, Zhen Qin, Daniele Calandriello, Misha Khalman, Rishabh Joshi, Bilal Piot, Mohammad Saleh, Chi Jin, Tong Zhang, Tianqi Liu:
Building Math Agents with Multi-Turn Iterative Preference Learning. CoRR abs/2409.02392 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11704
Xuanchang Zhang, Wei Xiong, Lichang Chen, Tianyi Zhou, Heng Huang, Tong Zhang:
From Lists to Emojis: How Format Bias Affects Model Alignment. CoRR abs/2409.11704 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13156
Tianqi Liu, Wei Xiong, Jie Ren, Lichang Chen, Junru Wu, Rishabh Joshi, Yang Gao, Jiaming Shen, Zhen Qin, Tianhe Yu, Daniel Sohn, Anastasiia Makarova, Jeremiah Z. Liu, Yuan Liu, Bilal Piot, Abe Ittycheriah, Aviral Kumar, Mohammad Saleh:
RRM: Robust Reward Model Training Mitigates Reward Hacking. CoRR abs/2409.13156 (2024)
2023
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/Dong0GZCPDZS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Dong0GZCPDZS023
Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang:
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment. Trans. Mach. Learn. Res. 2023 (2023)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/ShiXSY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/ShiXSY23
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Reward Teaching for Federated Multiarmed Bandits. IEEE Trans. Signal Process. 71: 4407-4422 (2023)
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/00150S00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/00150S00023
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Liwei Wang, Tong Zhang:
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game. ICLR 2023
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Shi00Y23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Shi00Y23
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources. ICML 2023: 31353-31388
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YeXGZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YeXGZ23
Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang:
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes. ICML 2023: 39834-39863
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/isit/ShiXSY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isit/ShiXSY23
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Reward Teaching for Federated Multi-armed Bandits. ISIT 2023: 1454-1459
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LiuLXZHZZY023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuLXZHZZY023
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang:
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration. NeurIPS 2023
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-06767
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-06767
Hanze Dong, Wei Xiong, Deepanshu Goyal, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang:
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment. CoRR abs/2304.06767 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-02441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-02441
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Reward Teaching for Federated Multi-armed Bandits. CoRR abs/2305.02441 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18258
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang:
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration. CoRR abs/2305.18258 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08364
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08364
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources. CoRR abs/2306.08364 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-12420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-12420
Shizhe Diao, Rui Pan, Hanze Dong, Kashun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang:
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. CoRR abs/2306.12420 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06256
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06256
Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. CoRR abs/2309.06256 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11456
Wei Xiong, Hanze Dong, Chenlu Ye, Han Zhong, Nan Jiang, Tong Zhang:
Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF. CoRR abs/2312.11456 (2023)
2022
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/XiongZSSZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XiongZSSZ22
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang:
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. ICML 2022: 24496-24523
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ZhongXTWZWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhongXTWZWY22
Han Zhong, Wei Xiong, Jiyuan Tan, Liwei Wang, Tong Zhang, Zhaoran Wang, Zhuoran Yang:
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets. ICML 2022: 27117-27142
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07511
Han Zhong, Wei Xiong, Jiyuan Tan, Liwei Wang, Tong Zhang, Zhaoran Wang, Zhuoran Yang:
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets. CoRR abs/2202.07511 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15512
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Liwei Wang, Tong Zhang:
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game. CoRR abs/2205.15512 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01907
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang:
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. CoRR abs/2210.01907 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01962
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang:
GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond. CoRR abs/2211.01962 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05949
Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang:
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes. CoRR abs/2212.05949 (2022)
2021
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ShiXXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShiXXS21
Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen:
(Almost) Free Incentivized Exploration from Decentralized Learning Agents. NeurIPS 2021: 560-571
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhangCZXQL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangCZXQL21
Pushi Zhang, Xiaoyu Chen, Li Zhao, Wei Xiong, Tao Qin, Tie-Yan Liu:
Distributional Reinforcement Learning for Multi-Dimensional Reward Functions. NeurIPS 2021: 1519-1529
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ShiXSY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShiXSY21
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization. NeurIPS 2021: 22392-22404
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13578
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13578
Pushi Zhang, Xiaoyu Chen, Li Zhao, Wei Xiong, Tao Qin, Tie-Yan Liu:
Distributional Reinforcement Learning for Multi-Dimensional Reward Functions. CoRR abs/2110.13578 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14622
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization. CoRR abs/2110.14622 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14628
Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen:
(Almost) Free Incentivized Exploration from Decentralized Learning Agents. CoRR abs/2110.14628 (2021)
2020
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/ShiXSY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ShiXSY20
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Decentralized Multi-player Multi-armed Bandits with No Collision Information. AISTATS 2020: 1519-1528
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00162
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang:
Decentralized Multi-player Multi-armed Bandits with No Collision Information. CoRR abs/2003.00162 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-15010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-15010
Haishan Ye, Wei Xiong, Tong Zhang:
PMGT-VR: A decentralized proximal-gradient algorithmic framework with variance reduction. CoRR abs/2012.15010 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.