default search action

combined dblp search
author search
venue search
publication search

ask others

Huizhuo Yuan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WuSYJYG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuSYJYG25
Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu:
Self-Play Preference Optimization for Language Model Alignment. ICLR 2025
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06425
Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao:
Tensor Product Attention Is All You Need. CoRR abs/2501.06425 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00030
Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic:
Game-Theoretic Regularized Self-Play Alignment of Large Language Models. CoRR abs/2503.00030 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17478
Yuning Shen, Lihao Wang, Huizhuo Yuan, Yan Wang, Bangji Yang, Quanquan Gu:
Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression. CoRR abs/2505.17478 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17508
Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Yang Yuan, Quanquan Gu, Andrew C. Yao:
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning. CoRR abs/2505.17508 (2025)
2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenDYJG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenDYJG24
Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu:
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models. ICML 2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangWSWYWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangWSWYWG24
Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu:
Protein Conformation Generation via Force-Guided SE(3) Diffusion Models. ICML 2024
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenYLKZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenYLKZG24
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu:
Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time. NeurIPS 2024
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YuanCJG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuanCJG24
Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu:
Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation. NeurIPS 2024
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01335
Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu:
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models. CoRR abs/2401.01335 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10210
Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu:
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation. CoRR abs/2402.10210 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14088
Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu:
Protein Conformation Generation via Force-Guided SE(3) Diffusion Models. CoRR abs/2403.14088 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-00675
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-00675
Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu:
Self-Play Preference Optimization for Language Model Alignment. CoRR abs/2405.00675 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06293
Jiafan He, Huizhuo Yuan, Quanquan Gu:
Accelerated Preference Optimization for Large Language Model Alignment. CoRR abs/2410.06293 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-10438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-10438
Huizhuo Yuan, Yifeng Liu, Shuang Wu, Xun Zhou, Quanquan Gu:
MARS: Unleashing the Power of Variance Reduction for Training Large Models. CoRR abs/2411.10438 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-19444
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-19444
Yuanzhe Tao, Huizhuo Yuan, Xun Zhou, Yuan Cao, Quanquan Gu:
Towards Simple and Provable Parameter-Free Adaptive Gradient Methods. CoRR abs/2412.19444 (2024)
2023
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenLYGJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenLYGJ23
Zixiang Chen, Chris Junchi Li, Huizhuo Yuan, Quanquan Gu, Michael I. Jordan:
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning. ICLR 2023
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiYGGJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiYGGJ23
Chris Junchi Li, Huizhuo Yuan, Gauthier Gidel, Quanquan Gu, Michael I. Jordan:
Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization. ICML 2023: 20351-20383
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09193
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu:
Fast Sampling via De-randomization for Discrete Diffusion Models. CoRR abs/2312.09193 (2023)
2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-03532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-03532
Xiang Zhou, Huizhuo Yuan, Chris Junchi Li, Qingyun Sun:
Stochastic Modified Equations for Continuous Limit of Stochastic ADMM. CoRR abs/2003.03532 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-04302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-04302
Huizhuo Yuan, Xiangru Lian, Ji Liu, Yuren Zhou:
Stochastic Recursive Momentum for Policy Gradient Methods. CoRR abs/2003.04302 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/ChenYL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/ChenYL19
Yu Chen, Huizhuo Yuan, Yujun Li:
Object-Oriented State Abstraction in Reinforcement Learning for Video Games. CoG 2019: 1-4
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YuanZLS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YuanZLS19
Huizhuo Yuan, Yuren Zhou, Chris Junchi Li, Qingyun Sun:
Differential Inclusions for Modeling Nonsmooth ADMM Variants: A Continuous Limit Theory. ICML 2019: 7232-7241
[c2]
- view
- export record
  dblp key:
  - conf/nips/YuanLLLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuanLLLH19
Huizhuo Yuan, Xiangru Lian, Chris Junchi Li, Ji Liu, Wenqing Hu:
Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent. NeurIPS 2019: 6926-6935
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-13515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-13515
Huizhuo Yuan, Xiangru Lian, Ji Liu:
Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization. CoRR abs/1912.13515 (2019)
2018
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/isbi/YuanJZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isbi/YuanJZ18
Huizhuo Yuan, Jinzhu Jia, Zhanxing Zhu:
SIPID: A deep learning framework for sinogram interpolation and image denoising in low-dose CT reconstruction. ISBI 2018: 1521-1524

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.