


default search action
Ziwei Ji
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. The links to all actual bibliographies of persons of the same or a similar name can be found below. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Other persons with the same name
- Ziwei Ji 0001
— Hong Kong University of Science and Technology, Center for Artificial Intelligence Research (CAiRE), Hong Kong
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c17]Sangmin Bae, Adam Fisch, Hrayr Harutyunyan, Ziwei Ji, Seungyeon Kim, Tal Schuster:
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA. ICLR 2025 - [i21]Sangmin Bae, Yujin Kim, Reza Bayat, Sungnyun Kim, Jiyoun Ha, Tal Schuster, Adam Fisch, Hrayr Harutyunyan, Ziwei Ji, Aaron Courville, Se-Young Yun:
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation. CoRR abs/2507.10524 (2025) - 2024
- [c16]Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan:
Think before you speak: Training Language Models With Pause Tokens. ICLR 2024 - [i20]Ziwei Ji, Himanshu Jain, Andreas Veit, Sashank J. Reddi, Sadeep Jayasumana, Ankit Singh Rawat, Aditya Krishna Menon, Felix X. Yu, Sanjiv Kumar:
Efficient Document Ranking with Learnable Late Interactions. CoRR abs/2406.17968 (2024) - [i19]Sangmin Bae, Adam Fisch, Hrayr Harutyunyan, Ziwei Ji, Seungyeon Kim, Tal Schuster:
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA. CoRR abs/2410.20672 (2024) - 2023
- [i18]Samy Jelassi, Boris Hanin, Ziwei Ji, Sashank J. Reddi, Srinadh Bhojanapalli, Sanjiv Kumar:
Depth Dependence of μP Learning Rates in ReLU MLPs. CoRR abs/2305.07810 (2023) - [i17]Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan:
Think before you speak: Training Language Models With Pause Tokens. CoRR abs/2310.02226 (2023) - 2022
- [c15]Yuzheng Hu, Ziwei Ji, Matus Telgarsky:
Actor-critic is implicitly biased towards high entropy optimal policies. ICLR 2022 - [c14]Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp:
Agnostic Learnability of Halfspaces via Logistic Loss. ICML 2022: 10068-10103 - [c13]Kwangjun Ahn, Prateek Jain, Ziwei Ji, Satyen Kale, Praneeth Netrapalli, Gil I. Shamir:
Reproducibility in Optimization: Theoretical Framework and Limits. NeurIPS 2022 - [i16]Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp:
Agnostic Learnability of Halfspaces via Logistic Loss. CoRR abs/2201.13419 (2022) - [i15]Kwangjun Ahn, Prateek Jain, Ziwei Ji, Satyen Kale, Praneeth Netrapalli, Gil I. Shamir:
Reproducibility in Optimization: Theoretical Framework and Limits. CoRR abs/2202.04598 (2022) - [i14]Miroslav Dudík, Ziwei Ji, Robert E. Schapire, Matus Telgarsky:
Convex Analysis at Infinity: An Introduction to Astral Space. CoRR abs/2205.03260 (2022) - 2021
- [c12]Ziwei Ji, Matus Telgarsky:
Characterizing the implicit bias via a primal-dual analysis. ALT 2021: 772-804 - [c11]Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang:
Generalization bounds via distillation. ICLR 2021 - [c10]Ziwei Ji, Nathan Srebro, Matus Telgarsky:
Fast margin maximization via dual acceleration. ICML 2021: 4860-4869 - [c9]Ziwei Ji, Justin D. Li, Matus Telgarsky:
Early-stopped neural networks are consistent. NeurIPS 2021: 1805-1817 - [i13]Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang:
Generalization bounds via distillation. CoRR abs/2104.05641 (2021) - [i12]Ziwei Ji, Justin D. Li, Matus Telgarsky:
Early-stopped neural networks are consistent. CoRR abs/2106.05932 (2021) - [i11]Ziwei Ji, Nathan Srebro, Matus Telgarsky:
Fast Margin Maximization via Dual Acceleration. CoRR abs/2107.00595 (2021) - [i10]Yuzheng Hu, Ziwei Ji, Matus Telgarsky:
Actor-critic is implicitly biased towards high entropy optimal policies. CoRR abs/2110.11280 (2021) - 2020
- [c8]Ziwei Ji, Miroslav Dudík, Robert E. Schapire, Matus Telgarsky:
Gradient descent follows the regularization path for general losses. COLT 2020: 2109-2136 - [c7]Ziwei Ji, Matus Telgarsky:
Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks. ICLR 2020 - [c6]Ziwei Ji, Matus Telgarsky, Ruicheng Xian:
Neural tangent kernels, transportation mappings, and universal approximation. ICLR 2020 - [c5]Ziwei Ji, Matus Telgarsky:
Directional convergence and alignment in deep learning. NeurIPS 2020 - [i9]Ziwei Ji, Matus Telgarsky:
Directional convergence and alignment in deep learning. CoRR abs/2006.06657 (2020) - [i8]Ziwei Ji, Miroslav Dudík, Robert E. Schapire, Matus Telgarsky:
Gradient descent follows the regularization path for general losses. CoRR abs/2006.11226 (2020)
2010 – 2019
- 2019
- [c4]Ziwei Ji, Matus Telgarsky:
The implicit bias of gradient descent on nonseparable data. COLT 2019: 1772-1798 - [c3]Ziwei Ji, Matus Telgarsky:
Gradient descent aligns the layers of deep linear networks. ICLR (Poster) 2019 - [i7]Ziwei Ji, Matus Telgarsky:
A refined primal-dual analysis of the implicit bias. CoRR abs/1906.04540 (2019) - [i6]Ziwei Ji, Matus Telgarsky:
Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks. CoRR abs/1909.12292 (2019) - [i5]Ziwei Ji, Matus Telgarsky, Ruicheng Xian:
Neural tangent kernels, transportation mappings, and universal approximation. CoRR abs/1910.06956 (2019) - 2018
- [c2]Ziwei Ji, Ruta Mehta, Matus Telgarsky:
Social Welfare and Profit Maximization from Revealed Preferences. WINE 2018: 264-281 - [i4]Ziwei Ji, Matus Telgarsky:
Risk and parameter convergence of logistic regression. CoRR abs/1803.07300 (2018) - [i3]Ziwei Ji, Matus Telgarsky:
Gradient descent aligns the layers of deep linear networks. CoRR abs/1810.02032 (2018) - 2017
- [i2]Ziwei Ji, Ruta Mehta, Matus Telgarsky:
Social Welfare and Profit Maximization from Revealed Preferences. CoRR abs/1711.02211 (2017) - [i1]Qi Zhu, Hongwei Ng, Liyuan Liu, Ziwei Ji, Bingjie Jiang, Jiaming Shen, Huan Gui:
Wikidata Vandalism Detection - The Loganberry Vandalism Detector at WSDM Cup 2017. CoRR abs/1712.06922 (2017) - 2016
- [c1]Yu Chen, Xiaotie Deng
, Ziwei Ji, Chao Liao:
The Beachcombers' Problem: Walking and Searching from an Inner Point of a Line. LATA 2016: 270-282
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-08-27 20:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint