Skip to main content

Showing 1–50 of 717 results for author: Wang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.14108  [pdf, other

    stat.AP

    Evaluating the effectiveness of public policies on COVID-19 containment: A PSM-DID approach

    Authors: Zihan Wang

    Abstract: The implementation of public policies is crucial in controlling the spread of COVID-19. However, the effectiveness of different policies can vary across different aspects of epidemic containment. Identifying the most effective policies is essential for providing informed recommendations for pandemic control. This paper examines the relationship between various public policy responses and their imp… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.13146  [pdf, other

    stat.ML cs.LG

    Reproduction of scan B-statistic for kernel change-point detection algorithm

    Authors: Zihan Wang

    Abstract: Change-point detection has garnered significant attention due to its broad range of applications, including epidemic disease outbreaks, social network evolution, image analysis, and wireless communications. In an online setting, where new data samples arrive sequentially, it is crucial to continuously test whether these samples originate from a different distribution. Ideally, the detection algori… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  3. arXiv:2408.06612  [pdf, ps, other

    stat.ME

    Double Robust high dimensional alpha test for linear factor pricing model

    Authors: Ping Zhao, Long Feng, Hongfei Wang, Zhaojun Wang

    Abstract: In this paper, we investigate alpha testing for high-dimensional linear factor pricing models. We propose a spatial sign-based max-type test to handle sparse alternative cases. Additionally, we prove that this test is asymptotically independent of the spatial-sign-based sum-type test proposed by Liu et al. (2023). Based on this result, we introduce a Cauchy Combination test procedure that combines… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  4. arXiv:2408.02045  [pdf, other

    stat.ML cs.LG

    DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation

    Authors: Qinshuo Liu, Zixin Wang, Xi-An Li, Xinyao Ji, Lei Zhang, Lin Liu, Zhonghua Liu

    Abstract: Semiparametric statistics play a pivotal role in a wide range of domains, including but not limited to missing data, causal inference, and transfer learning, to name a few. In many settings, semiparametric theory leads to (nearly) statistically optimal procedures that yet involve numerically solving Fredholm integral equations of the second kind. Traditional numerical methods, such as polynomial o… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: semiparametric statistics, missing data, causal inference, Fredholm integral equations of the second kind, bi-level optimization, deep learning, AI for science

  5. arXiv:2408.01062  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Universality of kernel random matrices and kernel regression in the quadratic regime

    Authors: Parthe Pandit, Zhichao Wang, Yizhe Zhu

    Abstract: Kernel ridge regression (KRR) is a popular class of machine learning models that has become an important tool for understanding deep learning. Much of the focus has been on studying the proportional asymptotic regime, $n \asymp d$, where $n$ is the number of training samples and $d$ is the dimension of the dataset. In this regime, under certain conditions on the data distribution, the kernel rando… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 75 pages

  6. arXiv:2408.01017  [pdf, ps, other

    math.DS econ.EM stat.AP

    Application of Superconducting Technology in the Electricity Industry: A Game-Theoretic Analysis of Government Subsidy Policies and Power Company Equipment Upgrade Decisions

    Authors: Mingyang Li, Maoqin Yuan, Han Pengsihua, Yuan Yuan, Zejun Wang

    Abstract: This study investigates the potential impact of "LK-99," a novel material developed by a Korean research team, on the power equipment industry. Using evolutionary game theory, the interactions between governmental subsidies and technology adoption by power companies are modeled. A key innovation of this research is the introduction of sensitivity analyses concerning time delays and initial subsidy… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  7. arXiv:2407.15388  [pdf, ps, other

    stat.AP q-fin.RM

    A new paradigm of mortality modeling via individual vitality dynamics

    Authors: Xiaobai Zhu, Kenneth Q. Zhou, Zijia Wang

    Abstract: The significance of mortality modeling extends across multiple research areas, including life insurance valuation, longevity risk management, life-cycle hypothesis, and retirement income planning. Despite the variety of existing approaches, such as mortality laws and factor-based models, they often lack compatibility or fail to meet specific research needs. To address these shortcomings, this stud… ▽ More

    Submitted 23 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 45 pages

  8. arXiv:2407.03082  [pdf, other

    cs.LG stat.ML

    Stable Heterogeneous Treatment Effect Estimation across Out-of-Distribution Populations

    Authors: Yuling Zhang, Anpeng Wu, Kun Kuang, Liang Du, Zixun Sun, Zhi Wang

    Abstract: Heterogeneous treatment effect (HTE) estimation is vital for understanding the change of treatment effect across individuals or subgroups. Most existing HTE estimation methods focus on addressing selection bias induced by imbalanced distributions of confounders between treated and control units, but ignore distribution shifts across populations. Thereby, their applicability has been limited to the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by ICDE'2024

  9. arXiv:2407.02539  [pdf

    cs.RO cs.AI cs.LG stat.ML

    Research on Autonomous Robots Navigation based on Reinforcement Learning

    Authors: Zixiang Wang, Hao Yan, Yining Wang, Zhengjia Xu, Zhuoyue Wang, Zhizhong Wu

    Abstract: Reinforcement learning continuously optimizes decision-making based on real-time feedback reward signals through continuous interaction with the environment, demonstrating strong adaptive and self-learning capabilities. In recent years, it has become one of the key methods to achieve autonomous navigation of robots. In this work, an autonomous robot navigation method based on reinforcement learnin… ▽ More

    Submitted 14 August, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  10. arXiv:2407.02501  [pdf, other

    cs.LG cs.CE eess.SY stat.AP

    Data-driven Power Flow Linearization: Theory

    Authors: Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

    Abstract: This two-part tutorial dives into the field of data-driven power flow linearization (DPFL), a domain gaining increased attention. DPFL stands out for its higher approximation accuracy, wide adaptability, and better ability to implicitly incorporate the latest system attributes. This renders DPFL a potentially superior option for managing the significant fluctuations from renewable energy sources,… ▽ More

    Submitted 10 June, 2024; originally announced July 2024.

    Comments: 20 pages

  11. arXiv:2406.15575  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Sketch-GNN: Scalable Graph Neural Networks with Sublinear Training Complexity

    Authors: Mucong Ding, Tahseen Rabbani, Bang An, Evan Z Wang, Furong Huang

    Abstract: Graph Neural Networks (GNNs) are widely applied to graph learning problems such as node classification. When scaling up the underlying graphs of GNNs to a larger size, we are forced to either train on the complete graph and keep the full graph adjacency and node embeddings in memory (which is often infeasible) or mini-batch sample the graph (which results in exponentially growing computational com… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2022

  12. arXiv:2406.12017  [pdf, other

    stat.ML cs.LG stat.CO

    Sparsity-Constraint Optimization via Splicing Iteration

    Authors: Zezhi Wang, Jin Zhu, Junxian Zhu, Borui Tang, Hongmei Lin, Xueqin Wang

    Abstract: Sparsity-constraint optimization has wide applicability in signal processing, statistics, and machine learning. Existing fast algorithms must burdensomely tune parameters, such as the step size or the implementation of precise stop criteria, which may be challenging to determine in practice. To address this issue, we develop an algorithm named Sparsity-Constraint Optimization via sPlicing itEratio… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 34 pages

  13. arXiv:2406.09564  [pdf, other

    cs.LG cs.AI cs.CE cs.CV stat.ML

    Towards Domain Adaptive Neural Contextual Bandits

    Authors: Ziyan Wang, Hao Wang

    Abstract: Contextual bandit algorithms are essential for solving real-world decision making problems. In practice, collecting a contextual bandit's feedback from different domains may involve different costs. For example, measuring drug reaction from mice (as a source domain) and humans (as a target domain). Unfortunately, adapting a contextual bandit algorithm from a source domain to a target domain with d… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.06893  [pdf, other

    stat.ML cs.IT cs.LG

    Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

    Authors: Zixuan Wang, Stanley Wei, Daniel Hsu, Jason D. Lee

    Abstract: The transformer architecture has prevailed in various deep learning settings due to its exceptional capabilities to select and compose structural information. Motivated by these capabilities, Sanford et al. proposed the sparse token selection task, in which transformers excel while fully-connected networks (FCNs) fail in the worst case. Building upon that, we strengthen the FCN lower bound to an a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  15. arXiv:2406.06833  [pdf, other

    eess.SY stat.AP

    Data-driven Power Flow Linearization: Simulation

    Authors: Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

    Abstract: Building on the theoretical insights of Part I, this paper, as the second part of the tutorial, dives deeper into data-driven power flow linearization (DPFL), focusing on comprehensive numerical testing. The necessity of these simulations stems from the theoretical analysis's inherent limitations, particularly the challenge of identifying the differences in real-world performance among DPFL method… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 26 pages

  16. arXiv:2406.05260  [pdf, other

    stat.ML cs.LG

    Generative modeling of density regression through tree flows

    Authors: Zhuoqun Wang, Naoki Awaya, Li Ma

    Abstract: A common objective in the analysis of tabular data is estimating the conditional distribution (in contrast to only producing predictions) of a set of "outcome" variables given a set of "covariates", which is sometimes referred to as the "density regression" problem. Beyond estimation on the conditional distribution, the generative ability of drawing synthetic samples from the learned conditional d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures

  17. arXiv:2406.05225  [pdf, other

    cs.LG stat.ML

    A Manifold Perspective on the Statistical Generalization of Graph Neural Networks

    Authors: Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

    Abstract: Convolutional neural networks have been successfully extended to operate on graphs, giving rise to Graph Neural Networks (GNNs). GNNs combine information from adjacent nodes by successive applications of graph convolutions. GNNs have been implemented successfully in various learning tasks while the theoretical understanding of their generalization capability is still in progress. In this paper, we… ▽ More

    Submitted 20 August, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 34 pages,22 figures

  18. arXiv:2406.05213  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

    Authors: Ziyu Wang, Chris Holmes

    Abstract: Applications of large language models often involve the generation of free-form responses, in which case uncertainty quantification becomes challenging. This is due to the need to identify task-specific uncertainties (e.g., about the semantics) which appears difficult to define in general cases. This work addresses these challenges from a perspective of Bayesian decision theory, starting from the… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  19. arXiv:2406.05193  [pdf, ps, other

    stat.ME stat.CO

    Probabilistic Clustering using Shared Latent Variable Model for Assessing Alzheimers Disease Biomarkers

    Authors: Yizhen Xu, Scott Zeger, Zheyu Wang

    Abstract: The preclinical stage of many neurodegenerative diseases can span decades before symptoms become apparent. Understanding the sequence of preclinical biomarker changes provides a critical opportunity for early diagnosis and effective intervention prior to significant loss of patients' brain functions. The main challenge to early detection lies in the absence of direct observation of the disease sta… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  20. arXiv:2406.04575  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning

    Authors: Zhongzheng Wang, Yuntian Chen, Guodong Chen, Dongxiao Zhang

    Abstract: Maximizing storage performance in geological carbon storage (GCS) is crucial for commercial deployment, but traditional optimization demands resource-intensive simulations, posing computational challenges. This study introduces the multimodal latent dynamic (MLD) model, a deep learning framework for fast flow prediction and well control optimization in GCS. The MLD model includes a representation… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  21. arXiv:2406.04329  [pdf, other

    cs.LG stat.ML

    Simplified and Generalized Masked Diffusion for Discrete Data

    Authors: Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud Doucet, Michalis K. Titsias

    Abstract: Masked (or absorbing) diffusion is actively explored as an alternative to autoregressive models for generative modeling of discrete data. However, existing work in this area has been hindered by unnecessarily complex model formulations and unclear relationships between different perspectives, leading to suboptimal parameterization, training objectives, and ad hoc adjustments to counteract these is… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  22. arXiv:2406.01561  [pdf, other

    cs.CV cs.AI cs.CL cs.LG stat.ML

    Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

    Authors: Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang

    Abstract: Diffusion-based text-to-image generation models trained on extensive text-image pairs have shown the capacity to generate photorealistic images consistent with textual descriptions. However, a significant limitation of these models is their slow sample generation, which requires iterative refinement through the same network. In this paper, we enhance Score identity Distillation (SiD) by developing… ▽ More

    Submitted 8 August, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Code and model checkpoints available at https://github.com/mingyuanzhou/SiD-LSG

  23. arXiv:2406.00793  [pdf, other

    stat.ML cs.LG

    Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

    Authors: Fabian Falck, Ziyu Wang, Chris Holmes

    Abstract: In-context learning (ICL) has emerged as a particularly remarkable characteristic of Large Language Models (LLM): given a pretrained LLM and an observed dataset, LLMs can make predictions for new data points from the same distribution without fine-tuning. Numerous works have postulated ICL as approximately Bayesian inference, rendering this a natural hypothesis. In this work, we analyse this hypot… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at International Conference on Machine Learning (ICML) 2024

  24. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 20 August, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  25. arXiv:2405.18459  [pdf, other

    cs.IT cs.AI cs.LG stat.ME

    Probing the Information Theoretical Roots of Spatial Dependence Measures

    Authors: Zhangyu Wang, Krzysztof Janowicz, Gengchen Mai, Ivan Majic

    Abstract: Intuitively, there is a relation between measures of spatial dependence and information theoretical measures of entropy. For instance, we can provide an intuition of why spatial data is special by stating that, on average, spatial data samples contain less than expected information. Similarly, spatial data, e.g., remotely sensed imagery, that is easy to compress is also likely to show significant… ▽ More

    Submitted 23 July, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: COSIT-2024 Conference Proceedings

  26. arXiv:2405.18395  [pdf, other

    cs.LG cs.AI stat.AP

    MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations

    Authors: Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz, Ni Lao

    Abstract: A wide range of (multivariate) temporal (1D) and spatial (2D) data analysis tasks, such as grouping vehicle sensor trajectories, can be formulated as clustering with given metric constraints. Existing metric-constrained clustering algorithms overlook the rich correlation between feature similarity and metric distance, i.e., metric autocorrelation. The model-based variations of these clustering alg… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML-2024 Proceedings

  27. arXiv:2405.16436  [pdf, other

    cs.LG cs.AI stat.ML

    Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

    Authors: Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang

    Abstract: Aligning generative models with human preference via RLHF typically suffers from overoptimization, where an imperfectly learned reward model can misguide the generative model to output undesired responses. We investigate this problem in a principled manner by identifying the source of the misalignment as a form of distributional shift and uncertainty in learning human preferences. To mitigate over… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 27 pages, 7 figures

  28. arXiv:2405.07761  [pdf, other

    cs.LG cs.AI cs.SC math-ph stat.AP

    LLM4ED: Large Language Models for Automatic Equation Discovery

    Authors: Mengge Du, Yuntian Chen, Zhongzheng Wang, Longfeng Nie, Dongxiao Zhang

    Abstract: Equation discovery is aimed at directly extracting physical laws from data and has emerged as a pivotal research domain. Previous methods based on symbolic mathematics have achieved substantial advancements, but often require the design of implementation of complex algorithms. In this paper, we introduce a new framework that utilizes natural language-based prompts to guide large language models (L… ▽ More

    Submitted 22 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  29. arXiv:2405.04393  [pdf, other

    stat.ML cs.LG

    Efficient Online Set-valued Classification with Bandit Feedback

    Authors: Zhou Wang, Xingye Qiao

    Abstract: Conformal prediction is a distribution-free method that wraps a given machine learning model and returns a set of plausible labels that contain the true label with a prescribed coverage rate. In practice, the empirical coverage achieved highly relies on fully observed label information from data both in the training phase for model fitting and the calibration phase for quantile estimation. This de… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  30. arXiv:2405.03329  [pdf, other

    cs.LG stat.ML

    Policy Learning for Balancing Short-Term and Long-Term Rewards

    Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng

    Abstract: Empirical researchers and decision-makers spanning various domains frequently seek profound insights into the long-term impacts of interventions. While the significance of long-term outcomes is undeniable, an overemphasis on them may inadvertently overshadow short-term gains. Motivated by this, this paper formalizes a new framework for learning the optimal policy that effectively balances both lon… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  31. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  32. arXiv:2404.12312  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

    Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

    Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted

  33. arXiv:2404.08667  [pdf, other

    eess.SY stat.AP

    Traffic State Estimation and Uncertainty Quantification at Signalized Intersections with Low Penetration Rate Vehicle Trajectory Data

    Authors: Xingmin Wang, Zihao Wang, Zachary Jerome, Henry X. Liu

    Abstract: This paper studies the traffic state estimation problem at signalized intersections with low penetration rate vehicle trajectory data. While many existing studies have proposed different methods to estimate unknown traffic states and parameters (e.g., penetration rate, queue length) with this data, most of them only provide a point estimation without knowing the uncertainty of these estimated valu… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  34. arXiv:2404.07323  [pdf, other

    stat.ME math.ST

    Surrogate modeling for probability distribution estimation:uniform or adaptive design?

    Authors: Maijia Su, Ziqi Wang, Oreste Salvatore Bursi, Marco Broccardo

    Abstract: The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF).… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  35. arXiv:2404.06984  [pdf, other

    stat.ME

    Adaptive Strategy of Testing Alphas in High Dimensional Linear Factor Pricing Models

    Authors: Chenxi Zhao, Ping Zhao, Long Feng, Zhaojun Wang

    Abstract: In recent years, there has been considerable research on testing alphas in high-dimensional linear factor pricing models. In our study, we introduce a novel max-type test procedure that performs well under sparse alternatives. Furthermore, we demonstrate that this new max-type test procedure is asymptotically independent from the sum-type test procedure proposed by Pesaran and Yamagata (2017). Bui… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  36. arXiv:2404.04057  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

    Authors: Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

    Abstract: We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator. SiD not only facilitates an exponentially fast reduction in Fréchet inception distance (FID) during distillation but also approaches or even exceeds the FID performance of the original teacher diffusion models. By refo… ▽ More

    Submitted 24 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: ICML 2024, PyTorch implementation: https://github.com/mingyuanzhou/SiD

  37. arXiv:2403.19629  [pdf, other

    cs.LG stat.ML

    Metric Learning from Limited Pairwise Preference Comparisons

    Authors: Zhi Wang, Geelon So, Ramya Korlakai Vinayak

    Abstract: We study metric learning from preference comparisons under the ideal point model, in which a user prefers an item over another if it is closer to their latent ideal item. These items are embedded into $\mathbb{R}^d$ equipped with an unknown Mahalanobis distance shared across users. While recent work shows that it is possible to simultaneously recover the metric and ideal items given… ▽ More

    Submitted 12 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: The 40th Conference on Uncertainty in Artificial Intelligence (UAI-2024)

  38. arXiv:2403.19381  [pdf, other

    stat.ML cs.LG

    On Uncertainty Quantification for Near-Bayes Optimal Algorithms

    Authors: Ziyu Wang, Chris Holmes

    Abstract: Bayesian modelling allows for the quantification of predictive uncertainty which is crucial in safety-critical applications. Yet for many machine learning (ML) algorithms, it is difficult to construct or implement their Bayesian counterpart. In this work we present a promising approach to address this challenge, based on the hypothesis that commonly used ML algorithms are efficient across a wide v… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  39. arXiv:2403.18540  [pdf, other

    stat.ML cs.LG stat.CO

    skscope: Fast Sparsity-Constrained Optimization in Python

    Authors: Zezhi Wang, Jin Zhu, Peng Chen, Huiyang Peng, Xiaoke Zhang, Anran Wang, Junxian Zhu, Xueqin Wang

    Abstract: Applying iterative solvers on sparsity-constrained optimization (SCO) requires tedious mathematical deduction and careful programming/debugging that hinders these solvers' broad impact. In the paper, the library skscope is introduced to overcome such an obstacle. With skscope, users can solve the SCO by just programming the objective function. The convenience of skscope is demonstrated through two… ▽ More

    Submitted 22 August, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 4 pages;add experiment

  40. arXiv:2403.16825  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

    Authors: Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

    Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  41. arXiv:2403.14830  [pdf, other

    stat.ML cs.LG

    Deep Clustering Evaluation: How to Validate Internal Clustering Validation Measures

    Authors: Zeya Wang, Chenglong Ye

    Abstract: Deep clustering, a method for partitioning complex, high-dimensional data using deep neural networks, presents unique evaluation challenges. Traditional clustering validation measures, designed for low-dimensional spaces, are problematic for deep clustering, which involves projecting data into lower-dimensional embeddings before partitioning. Two key issues are identified: 1) the curse of dimensio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  42. arXiv:2403.13196  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    ADAPT to Robustify Prompt Tuning Vision Transformers

    Authors: Masih Eskandar, Tooba Imtiaz, Zifeng Wang, Jennifer Dy

    Abstract: The performance of deep models, including Vision Transformers, is known to be vulnerable to adversarial attacks. Many existing defenses against these attacks, such as adversarial training, rely on full-model fine-tuning to induce robustness in the models. These defenses require storing a copy of the entire model, that can have billions of parameters, for each task. At the same time, parameter-effi… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  43. arXiv:2403.13081  [pdf, other

    stat.AP math.PR q-bio.PE

    Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors

    Authors: Kevin Leder, Ruping Sun, Zicheng Wang, Xuanming Zhang

    Abstract: In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  44. arXiv:2403.11429  [pdf, other

    stat.AP

    Long-range Ising model for regional-scale seismic risk analysis

    Authors: Sebin Oh, Sang-ri Yi, Ziqi Wang

    Abstract: This study introduces the long-range Ising model from statistical mechanics to the Performance-Based Earthquake Engineering (PBEE) framework for regional seismic damage analysis. The application of the PBEE framework at a regional scale involves estimating the damage states of numerous structures, typically performed using fragility function-based stochastic simulations. However, these simulations… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  45. arXiv:2403.00283  [pdf, other

    stat.AP

    Risk Twin: Real-time Risk Visualization and Control for Structural Systems

    Authors: Zeyu Wang, Ziqi Wang

    Abstract: Digital twinning in structural engineering is a rapidly evolving technology that aims to eliminate the gap between physical systems and their digital models through real-time sensing, visualization, and control techniques. Although Digital Twins can offer dynamic insights into physical systems, their accuracy is inevitably compromised by uncertainties in sensing, modeling, simulation, and control.… ▽ More

    Submitted 27 August, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

  46. arXiv:2402.10810  [pdf, ps, other

    cs.LG math.OC stat.ML

    Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

    Authors: Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

    Abstract: We study the Constrained Convex Markov Decision Process (MDP), where the goal is to minimize a convex functional of the visitation measure, subject to a convex constraint. Designing algorithms for a constrained convex MDP faces several challenges, including (1) handling the large state space, (2) managing the exploration/exploitation tradeoff, and (3) solving the constrained optimization where the… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  47. arXiv:2402.10127  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Nonlinear spiked covariance matrices and signal propagation in deep neural networks

    Authors: Zhichao Wang, Denny Wu, Zhou Fan

    Abstract: Many recent works have studied the eigenvalue spectrum of the Conjugate Kernel (CK) defined by the nonlinear feature map of a feedforward neural network. However, existing results only establish weak convergence of the empirical eigenvalue distribution, and fall short of providing precise quantitative characterizations of the ''spike'' eigenvalues and eigenvectors that often capture the low-dimens… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 55 pages

  48. arXiv:2402.08539  [pdf

    cs.LG stat.AP

    Intelligent Diagnosis of Alzheimer's Disease Based on Machine Learning

    Authors: Mingyang Li, Hongyu Liu, Yixuan Li, Zejun Wang, Yuan Yuan, Honglin Dai

    Abstract: This study is based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset and aims to explore early detection and disease progression in Alzheimer's disease (AD). We employ innovative data preprocessing strategies, including the use of the random forest algorithm to fill missing data and the handling of outliers and invalid data, thereby fully mining and utilizing these limited data re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  49. arXiv:2402.07227  [pdf, other

    math.DS econ.GN stat.AP

    Time-Delayed Game Strategy Analysis Among Japan, Other Nations, and the International Atomic Energy Agency in the Context of Fukushima Nuclear Wastewater Discharge Decision

    Authors: Mingyang Li, Han Pengsihua, Fujiao Meng, Zejun Wang, Weian Liu

    Abstract: This academic paper examines the strategic interactions between Japan, other nations, and the International Atomic Energy Agency (IAEA) regarding Japan's decision to release treated nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the sea. It introduces a payoff matrix and time-delay elements in replicator dynamic equations to mirror real-world decision-making delays. The pap… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  50. arXiv:2402.07210  [pdf, other

    math.DS econ.GN physics.soc-ph stat.AP

    Fukushima Nuclear Wastewater Discharge: An Evolutionary Game Theory Approach to International and Domestic Interaction and Strategic Decision-Making

    Authors: Mingyang Li, Han Pengsihua, Songqing Zhao, Zejun Wang, Limin Yang, Weian Liu

    Abstract: On August 24, 2023, Japan controversially decided to discharge nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the ocean, sparking intense domestic and global debates. This study uses evolutionary game theory to analyze the strategic dynamics between Japan, other countries, and the Japan Fisheries Association. By incorporating economic, legal, international aid, and environm… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.