Skip to main content

Showing 1–50 of 528 results for author: Li, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.08252  [pdf, other

    cs.LG cs.AI q-bio.GN stat.ML

    Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

    Authors: Xiner Li, Yulai Zhao, Chenyu Wang, Gabriele Scalia, Gokcen Eraslan, Surag Nair, Tommaso Biancalani, Aviv Regev, Sergey Levine, Masatoshi Uehara

    Abstract: Diffusion models excel at capturing the natural design spaces of images, molecules, DNA, RNA, and protein sequences. However, rather than merely generating designs that are natural, we often aim to optimize downstream reward functions while preserving the naturalness of these design spaces. Existing methods for achieving this goal often require ``differentiable'' proxy models (\textit{e.g.}, class… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: The code is available at https://github.com/masa-ue/SVDD

  2. arXiv:2408.03415  [pdf, other

    stat.ME stat.CO

    A Novel Approximate Bayesian Inference Method for Compartmental Models in Epidemiology using Stan

    Authors: Xiahui Li, Ben Swallow, Fergus J. Chadwick

    Abstract: Mechanistic compartmental models are widely used in epidemiology to study the dynamics of infectious disease transmission. These models have significantly contributed to designing and evaluating effective control strategies during pandemics. However, the increasing complexity and the number of parameters needed to describe rapidly evolving transmission scenarios present significant challenges for… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  3. arXiv:2408.02045  [pdf, other

    stat.ML cs.LG

    DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation

    Authors: Qinshuo Liu, Zixin Wang, Xi-An Li, Xinyao Ji, Lei Zhang, Lin Liu, Zhonghua Liu

    Abstract: Semiparametric statistics play a pivotal role in a wide range of domains, including but not limited to missing data, causal inference, and transfer learning, to name a few. In many settings, semiparametric theory leads to (nearly) statistically optimal procedures that yet involve numerically solving Fredholm integral equations of the second kind. Traditional numerical methods, such as polynomial o… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: semiparametric statistics, missing data, causal inference, Fredholm integral equations of the second kind, bi-level optimization, deep learning, AI for science

  4. arXiv:2407.19373  [pdf, other

    stat.ML cs.LG

    Uncertainty Quantification of Data Shapley via Statistical Inference

    Authors: Mengmeng Wu, Zhihong Liu, Xiang Li, Ruoxi Jia, Xiangyu Chang

    Abstract: As data plays an increasingly pivotal role in decision-making, the emergence of data markets underscores the growing importance of data valuation. Within the machine learning landscape, Data Shapley stands out as a widely embraced method for data valuation. However, a limitation of Data Shapley is its assumption of a fixed dataset, contrasting with the dynamic nature of real-world applications whe… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  5. arXiv:2407.17719  [pdf

    stat.AP

    A new moment-independent uncertainty importance measure based on cumulative residual entropy for developing uncertainty reduction strategies

    Authors: Shi-Shun Chen, Xiao-Yang Li

    Abstract: Uncertainty reduction is vital for improving system reliability and reducing risks. To identify the best target for uncertainty reduction, uncertainty importance measure is commonly used to prioritize the significance of input variable uncertainties. Then, designers will take steps to reduce the uncertainties of variables with high importance. However, for variables with minimal uncertainty, the c… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  6. arXiv:2407.17718  [pdf

    stat.AP

    Comparison of global sensitivity analysis methods for a fire spread model with a segmented characteristic

    Authors: Shi-Shun Chen, Xiao-Yang Li

    Abstract: Global sensitivity analysis (GSA) can provide rich information for controlling output uncertainty. In practical applications, segmented models are commonly used to describe an abrupt model change. For segmented models, the complicated uncertainty propagation during the transition region may lead to different importance rankings of different GSA methods. If an unsuitable GSA method is applied, misl… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  7. arXiv:2407.16739  [pdf, other

    stat.ML cs.LG

    Forecasting Automotive Supply Chain Shortfalls with Heterogeneous Time Series

    Authors: Bach Viet Do, Xingyu Li, Chaoye Pan, Oleg Gusikhin

    Abstract: Operational disruptions can significantly impact companies performance. Ford, with its 37 plants globally, uses 17 billion parts annually to manufacture six million cars and trucks. With up to ten tiers of suppliers between the company and raw materials, any extended disruption in this supply chain can cause substantial financial losses. Therefore, the ability to forecast and identify such disrupt… ▽ More

    Submitted 26 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  8. arXiv:2407.13261  [pdf, other

    stat.ME

    Enhanced inference for distributions and quantiles of individual treatment effects in various experiments

    Authors: Zhe Chen, Xinran Li

    Abstract: Understanding treatment effect heterogeneity has become increasingly important in many fields. In this paper we study distributions and quantiles of individual treatment effects to provide a more comprehensive and robust understanding of treatment effects beyond usual averages, despite they are more challenging to infer due to nonidentifiability from observed data. Recent randomization-based appro… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  9. arXiv:2406.18137  [pdf, ps, other

    stat.ML cs.LG

    Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression

    Authors: Dongya Wu, Xin Li

    Abstract: Generalization theory has been established for sparse deep neural networks under high-dimensional regime. Beyond generalization, parameter estimation is also important since it is crucial for variable selection and interpretability of deep neural networks. Current theoretical studies concerning parameter estimation mainly focus on two-layer neural networks, which is due to the fact that the conver… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  10. arXiv:2406.06980  [pdf, other

    stat.ME

    Sensitivity Analysis for the Test-Negative Design

    Authors: Soumyabrata Kundu, Peng Ding, Xinran Li, Jingshu Wang

    Abstract: The test-negative design has become popular for evaluating the effectiveness of post-licensure vaccines using observational data. In addition to its logistical convenience on data collection, the design is also believed to control for the differential health-care-seeking behavior between vaccinated and unvaccinated individuals, which is an important while often unmeasured confounder between the va… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  11. arXiv:2406.05855  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Distilled Disentangled Learning for Counterfactual Prediction

    Authors: Xinshu Li, Mingming Gong, Lina Yao

    Abstract: The advancements in disentangled representation learning significantly enhance the accuracy of counterfactual predictions by granting precise control over instrumental variables, confounders, and adjustable variables. An appealing method for achieving the independent separation of these factors is mutual information minimization, a task that presents challenges in numerous machine learning scenari… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  12. arXiv:2406.05637  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    A Generalized Version of Chung's Lemma and its Applications

    Authors: Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

    Abstract: Chung's lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 43 pages, 5 figures

    MSC Class: 90C15; 90C30; 90C26

  13. arXiv:2406.05340  [pdf, other

    stat.ME stat.ML

    Selecting the Number of Communities for Weighted Degree-Corrected Stochastic Block Models

    Authors: Yucheng Liu, Xiaodong Li

    Abstract: We investigate how to select the number of communities for weighted networks without a full likelihood modeling. First, we propose a novel weighted degree-corrected stochastic block model (DCSBM), in which the mean adjacency matrix is modeled as the same as in standard DCSBM, while the variance profile matrix is assumed to be related to the mean adjacency matrix through a given variance function.… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 3 figures, 2 tables

  14. arXiv:2406.01653  [pdf, other

    stat.ML cs.LG math.PR stat.AP stat.ME

    An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks

    Authors: Mingtao Xia, Xiangting Li, Qijing Shen, Tom Chou

    Abstract: We analyze the Wasserstein distance ($W$-distance) between two probability distributions associated with two multidimensional jump-diffusion processes. Specifically, we analyze a temporally decoupled squared $W_2$-distance, which provides both upper and lower bounds associated with the discrepancies in the drift, diffusion, and jump amplitude functions between the two jump-diffusion processes. The… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    MSC Class: 60G07; 60J76

  15. arXiv:2405.18373  [pdf, other

    stat.ML cs.LG math.OC

    A Hessian-Aware Stochastic Differential Equation for Modelling SGD

    Authors: Xiang Li, Zebang Shen, Liang Zhang, Niao He

    Abstract: Continuous-time approximation of Stochastic Gradient Descent (SGD) is a crucial tool to study its escaping behaviors from stationary points. However, existing stochastic differential equation (SDE) models fail to fully capture these behaviors, even for simple quadratic objectives. Built on a novel stochastic backward error analysis framework, we derive the Hessian-Aware Stochastic Modified Equatio… ▽ More

    Submitted 5 August, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  16. arXiv:2405.16859  [pdf, other

    stat.ME

    Gaussian Mixture Model with Rare Events

    Authors: Xuetong Li, Jing Zhou, Hansheng Wang

    Abstract: We study here a Gaussian Mixture Model (GMM) with rare events data. In this case, the commonly used Expectation-Maximization (EM) algorithm exhibits extremely slow numerical convergence rate. To theoretically understand this phenomenon, we formulate the numerical convergence problem of the EM algorithm with rare events data as a problem about a contraction operator. Theoretical analysis reveals th… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  17. arXiv:2405.15115  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification

    Authors: Shang Liu, Zhongze Cai, Guanting Chen, Xiaocheng Li

    Abstract: Predicting simple function classes has been widely used as a testbed for developing theory and understanding of the trained Transformer's in-context learning (ICL) ability. In this paper, we revisit the training of Transformers on linear regression tasks, and different from all the existing literature, we consider a bi-objective prediction task of predicting both the conditional expectation… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  18. arXiv:2405.06779  [pdf, other

    econ.EM stat.AP

    Generalization Problems in Experiments Involving Multidimensional Decisions

    Authors: Jiawei Fu, Xiaojun Li

    Abstract: Can the causal effects estimated in experiment be generalized to real-world scenarios? This question lies at the heart of social science studies. External validity primarily assesses whether experimental effects persist across different settings, implicitly presuming the experiment's ecological validity-that is, the consistency of experimental effects with their real-life counterparts. However, we… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  19. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  20. arXiv:2404.19242  [pdf, other

    cs.CV eess.IV stat.ME

    A Minimal Set of Parameters Based Depth-Dependent Distortion Model and Its Calibration Method for Stereo Vision Systems

    Authors: Xin Ma, Puchen Zhu, Xiao Li, Xiaoyin Zheng, Jianshu Zhou, Xuchen Wang, Kwok Wai Samuel Au

    Abstract: Depth position highly affects lens distortion, especially in close-range photography, which limits the measurement accuracy of existing stereo vision systems. Moreover, traditional depth-dependent distortion models and their calibration methods have remained complicated. In this work, we propose a minimal set of parameters based depth-dependent distortion model (MDM), which considers the radial an… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Instrumentation and Measurement

  21. arXiv:2404.17615  [pdf

    stat.ME cs.LG stat.CO stat.ML

    DeepVARMA: A Hybrid Deep Learning and VARMA Model for Chemical Industry Index Forecasting

    Authors: Xiang Li, Hu Yang

    Abstract: Since the chemical industry index is one of the important indicators to measure the development of the chemical industry, forecasting it is critical for understanding the economic situation and trends of the industry. Taking the multivariable nonstationary series-synthetic material index as the main research object, this paper proposes a new prediction model: DeepVARMA, and its variants Deep-VARMA… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  22. arXiv:2404.08472  [pdf, other

    cs.LG stat.ML

    TSLANet: Rethinking Transformers for Time Series Representation Learning

    Authors: Emadeldeen Eldele, Mohamed Ragab, Zhenghua Chen, Min Wu, Xiaoli Li

    Abstract: Time series data, characterized by its intrinsic long and short-range dependencies, poses a unique challenge across analytical applications. While Transformer-based models excel at capturing long-range dependencies, they face limitations in noise sensitivity, computational efficiency, and overfitting with smaller datasets. In response, we introduce a novel Time Series Lightweight Adaptive Network… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted in ICML 2024

  23. arXiv:2404.06013  [pdf, other

    cs.LG math.OC stat.ML

    Feel-Good Thompson Sampling for Contextual Dueling Bandits

    Authors: Xuheng Li, Heyang Zhao, Quanquan Gu

    Abstract: Contextual dueling bandits, where a learner compares two options based on context and receives feedback indicating which was preferred, extends classic dueling bandits by incorporating contextual information for decision-making and preference learning. Several algorithms based on the upper confidence bound (UCB) have been proposed for linear contextual dueling bandits. However, no algorithm based… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 30 pages, 6 figures

  24. arXiv:2404.05933  [pdf, other

    stat.ME stat.CO

    fastcpd: Fast Change Point Detection in R

    Authors: Xingchi Li, Xianyang Zhang

    Abstract: Change point analysis is concerned with detecting and locating structure breaks in the underlying model of a sequence of observations ordered by time, space or other variables. A widely adopted approach for change point analysis is to minimize an objective function with a penalty term on the number of change points. This framework includes several well-established procedures, such as the penalized… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 53 pages, 16 figures

  25. arXiv:2404.05484  [pdf, other

    cs.LG stat.ML

    On Computational Modeling of Sleep-Wake Cycle

    Authors: Xin Li

    Abstract: Why do mammals need to sleep? Neuroscience treats sleep and wake as default and perturbation modes of the brain. It is hypothesized that the brain self-organizes neural activities without environmental inputs. This paper presents a new computational model of the sleep-wake cycle (SWC) for learning and memory. During the sleep mode, the memory consolidation by the thalamocortical system is abstract… ▽ More

    Submitted 17 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  26. arXiv:2404.01245  [pdf, other

    math.ST cs.CL cs.CR cs.LG stat.ML

    A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules

    Authors: Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su

    Abstract: Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical effi… ▽ More

    Submitted 28 August, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  27. arXiv:2404.00474  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Linguistic Calibration of Long-Form Generations

    Authors: Neil Band, Xuechen Li, Tengyu Ma, Tatsunori Hashimoto

    Abstract: Language models (LMs) may lead their users to make suboptimal downstream decisions when they confidently hallucinate. This issue can be mitigated by having the LM verbally convey the probability that its claims are correct, but existing models cannot produce long-form text with calibrated confidence statements. Through the lens of decision-making, we define linguistic calibration for long-form gen… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: ICML 2024. Code available at https://github.com/tatsu-lab/linguistic_calibration

  28. arXiv:2403.13027  [pdf, other

    cs.LG cs.CR cs.IT stat.ML

    Towards Better Statistical Understanding of Watermarking LLMs

    Authors: Zhongze Cai, Shang Liu, Hanzhao Wang, Huaiyang Zhong, Xiaocheng Li

    Abstract: In this paper, we study the problem of watermarking large language models (LLMs). We consider the trade-off between model distortion and detection ability and formulate it as a constrained optimization problem based on the green-red algorithm of Kirchenbauer et al. (2023a). We show that the optimal solution to the optimization problem enjoys a nice analytical property which provides a better under… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  29. arXiv:2403.11163  [pdf, ps, other

    stat.ME cs.LG math.ST stat.CO

    A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

    Authors: Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, Jing Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

    Abstract: This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  30. arXiv:2403.02696  [pdf, ps, other

    math.ST stat.ME

    Low-rank matrix estimation via nonconvex spectral regularized methods in errors-in-variables matrix regression

    Authors: Xin Li, Dongya Wu

    Abstract: High-dimensional matrix regression has been studied in various aspects, such as statistical properties, computational efficiency and application to specific instances including multivariate regression, system identification and matrix compressed sensing. Current studies mainly consider the idealized case that the covariate matrix is obtained without noise, while the more realistic scenario that th… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  31. arXiv:2402.11858  [pdf, ps, other

    stat.ML cs.LG math.OC

    Stochastic Hessian Fittings with Lie Groups

    Authors: Xi-Lin Li

    Abstract: This paper studies the fitting of Hessian or its inverse for stochastic optimizations using a Hessian fitting criterion from the preconditioned stochastic gradient descent (PSGD) method, which is intimately related to many commonly used second order and adaptive gradient optimizers, e.g., BFGS, Gaussian-Newton and natural gradient descent, AdaGrad, etc. Our analyses reveal the efficiency and relia… ▽ More

    Submitted 14 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 13 pages, 6 figures, 3 tables

  32. arXiv:2402.08602  [pdf, other

    math.ST stat.ME stat.ML

    Globally-Optimal Greedy Experiment Selection for Active Sequential Estimation

    Authors: Xiaoou Li, Hongru Zhao

    Abstract: Motivated by modern applications such as computerized adaptive testing, sequential rank aggregation, and heterogeneous data source selection, we study the problem of active sequential estimation, which involves adaptively selecting experiments for sequentially collected data. The goal is to design experiment selection rules for more accurate model estimation. Greedy information-based experiment se… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  33. arXiv:2402.08151  [pdf, other

    stat.ME cs.AI cs.LG math.SP math.ST

    Gradient-flow adaptive importance sampling for Bayesian leave one out cross-validation for sigmoidal classification models

    Authors: Joshua C Chang, Xiangting Li, Shixin Xu, Hao-Ren Yao, Julia Porcino, Carson Chow

    Abstract: We introduce a set of gradient-flow-guided adaptive importance sampling (IS) transformations to stabilize Monte-Carlo approximations of point-wise leave one out cross-validated (LOO) predictions for Bayesian classification models. One can leverage this methodology for assessing model generalizability by for instance computing a LOO analogue to the AIC or computing LOO ROC/PRC curves and derived me… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Submitted

  34. arXiv:2402.05009  [pdf, other

    stat.AP

    A Review on Trajectory Datasets on Advanced Driver Assistance System

    Authors: Hang Zhou, Ke Ma, Xiaopeng Li

    Abstract: This paper presents a comprehensive review of trajectory data of Advanced Driver Assistance System equipped-vehicle, with the aim of precisely model of Autonomous Vehicles (AVs) behavior. This study emphasizes the importance of trajectory data in the development of AV models, especially in car-following scenarios. We introduce and evaluate several datasets: the OpenACC Dataset, the Connected & Aut… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures

  35. arXiv:2402.02701  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

    Authors: Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu

    Abstract: Recently, there are many efforts attempting to learn useful policies for continuous control in visual reinforcement learning (RL). In this scenario, it is important to learn a generalizable policy, as the testing environment may differ from the training environment, e.g., there exist distractors during deployment. Many practical algorithms are proposed to handle this problem. However, to the best… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Part of this work is accepted as AAMAS 2024 extended abstract

  36. arXiv:2401.16308  [pdf, ps, other

    stat.AP q-bio.PE

    A Comprehensive Study of Covid 19 in Florida

    Authors: Julian Bennett, Lauren Eriksen, Xingjie Helen Li

    Abstract: Within the likes of any highly contagious and unpredictable disease, lies a predictable and attainable growth rate that researchers can find in order to make logistical conclusions about that particular disease and its affected regions' counterparts. The foundation that researchers pull from when studying a particular disease and looking for its growth rate is the Susceptible-Infected-Removed (SIR… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    MSC Class: 93A30

  37. arXiv:2401.14142  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

    Authors: Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li

    Abstract: Existing methods, such as concept bottleneck models (CBMs), have been successful in providing concept-based interpretations for black-box deep learning models. They typically work by predicting concepts given the input and then predicting the final class label given the predicted concepts. However, (1) they often fail to capture the high-order, nonlinear interaction between concepts, e.g., correct… ▽ More

    Submitted 26 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024

  38. arXiv:2401.11354  [pdf, other

    math.PR cs.LG stat.ME

    Squared Wasserstein-2 Distance for Efficient Reconstruction of Stochastic Differential Equations

    Authors: Mingtao Xia, Xiangting Li, Qijing Shen, Tom Chou

    Abstract: We provide an analysis of the squared Wasserstein-2 ($W_2$) distance between two probability distributions associated with two stochastic differential equations (SDEs). Based on this analysis, we propose the use of a squared $W_2$ distance-based loss functions in the \textit{reconstruction} of SDEs from noisy data. To demonstrate the practicality of our Wasserstein distance-based loss functions, w… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 37 pages, 5 figures

    MSC Class: 60H10; 49Q22

  39. arXiv:2401.07267  [pdf, other

    stat.ME

    Inference for high-dimensional linear expectile regression with de-biased method

    Authors: Xiang Li, Yu-Ning Li, Li-Xin Zhang, Jun Zhao

    Abstract: In this paper, we address the inference problem in high-dimensional linear expectile regression. We transform the expectile loss into a weighted-least-squares form and apply a de-biased strategy to establish Wald-type tests for multiple constraints within a regularized framework. Simultaneously, we construct an estimator for the pseudo-inverse of the generalized Hessian matrix in high dimension wi… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 34 pages

    MSC Class: 62F05; 62F12; 62J12

  40. arXiv:2401.06348  [pdf, other

    stat.ME stat.AP

    A Fully Bayesian Approach for Comprehensive Mapping of Magnitude and Phase Brain Activation in Complex-Valued fMRI Data

    Authors: Zhengxin Wang, Daniel B. Rowe, Xinyi Li, D. Andrew Brown

    Abstract: Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. These signals, composed of magnitude and phase, offer a rich source of information for understanding brain functions. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  41. arXiv:2401.04857  [pdf, other

    cs.LG stat.AP

    Transportation Marketplace Rate Forecast Using Signature Transform

    Authors: Haotian Gu, Xin Guo, Timothy L. Jacobs, Philip Kaminsky, Xinyu Li

    Abstract: Freight transportation marketplace rates are typically challenging to forecast accurately. In this work, we have developed a novel statistical technique based on signature transforms and have built a predictive and adaptive model to forecast these marketplace rates. Our technique is based on two key elements of the signature transform: one being its universal nonlinearity property, which linearize… ▽ More

    Submitted 14 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  42. arXiv:2401.03893  [pdf, other

    math.OC stat.ML

    Finite-Time Decoupled Convergence in Nonlinear Two-Time-Scale Stochastic Approximation

    Authors: Yuze Han, Xiang Li, Zhihua Zhang

    Abstract: In two-time-scale stochastic approximation (SA), two iterates are updated at varying speeds using different step sizes, with each update influencing the other. Previous studies in linear two-time-scale SA have found that the convergence rates of the mean-square errors for these updates are dependent solely on their respective step sizes, leading to what is referred to as decoupled convergence. How… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  43. arXiv:2401.01064  [pdf, other

    stat.ME econ.EM

    Robust Inference for Multiple Predictive Regressions with an Application on Bond Risk Premia

    Authors: Xiaosai Liao, Xinjue Li, Qingliang Fan

    Abstract: We propose a robust hypothesis testing procedure for the predictability of multiple predictors that could be highly persistent. Our method improves the popular extended instrumental variable (IVX) testing (Phillips and Lee, 2013; Kostakis et al., 2015) in that, besides addressing the two bias effects found in Hosseinkouchack and Demetrescu (2021), we find and deal with the variance-enlargement eff… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  44. arXiv:2312.10920  [pdf, other

    cs.LG stat.ME

    Domain adaption and physical constrains transfer learning for shale gas production

    Authors: Zhaozhong Yang, Liangjie Gou, Chao Min, Duo Yi, Xiaogang Li, Guoquan Wen

    Abstract: Effective prediction of shale gas production is crucial for strategic reservoir development. However, in new shale gas blocks, two main challenges are encountered: (1) the occurrence of negative transfer due to insufficient data, and (2) the limited interpretability of deep learning (DL) models. To tackle these problems, we propose a novel transfer learning methodology that utilizes domain adaptat… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  45. arXiv:2312.09393  [pdf

    stat.AP

    Bi-scale Car-following Model Calibration for Corridor Based on Trajectory

    Authors: Keke Long, Haotian Shi, Zhiwei Chen, Zhaohui Liang, Xiaopeng Li, Felipe de Souza

    Abstract: The precise estimation of macroscopic traffic parameters, such as travel time and fuel consumption, is essential for the optimization of traffic management systems. Despite its importance, the comprehensive acquisition of vehicle trajectory data for the calculation of these macroscopic measures presents a challenge. To bridge this gap, this study aims to calibrate car-following models capable of p… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  46. arXiv:2312.05757  [pdf, ps, other

    cs.LG cs.AI cs.DL cs.SI stat.ME

    Towards Human-like Perception: Learning Structural Causal Model in Heterogeneous Graph

    Authors: Tianqianjin Lin, Kaisong Song, Zhuoren Jiang, Yangyang Kang, Weikang Yuan, Xurui Li, Changlong Sun, Cui Huang, Xiaozhong Liu

    Abstract: Heterogeneous graph neural networks have become popular in various domains. However, their generalizability and interpretability are limited due to the discrepancy between their inherent inference flows and human reasoning logic or underlying causal relationships for the learning problem. This study introduces a novel solution, HG-SCM (Heterogeneous Graph as Structural Causal Model). It can mimic… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 28 pages, 10 figures, 6 tables, accepted by Information Processing & Management

    Journal ref: Information Processing & Management, 60 (2024) 1-21

  47. arXiv:2312.02513  [pdf, other

    stat.ME math.ST

    Asymptotic Theory of the Best-Choice Rerandomization using the Mahalanobis Distance

    Authors: Yuhao Wang, Xinran Li

    Abstract: Rerandomization, a design that utilizes pretreatment covariates and improves their balance between different treatment groups, has received attention recently in both theory and practice. There are at least two types of rerandomization that are used in practice: the first rerandomizes the treatment assignment until covariate imbalance is below a prespecified threshold; the second randomizes the tr… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  48. arXiv:2311.15539  [pdf

    stat.CO

    A Novel Human-Based Meta-Heuristic Algorithm: Dragon Boat Optimization

    Authors: Xiang Li, Long Lan, Husam Lahza, Shaowu Yang, Shuihua Wang, Wenjing Yang, Hengzhu Liu, Yudong Zhang

    Abstract: (Aim) Dragon Boat Racing, a popular aquatic folklore team sport, is traditionally held during the Dragon Boat Festival. Inspired by this event, we propose a novel human-based meta-heuristic algorithm called dragon boat optimization (DBO) in this paper. (Method) It models the unique behaviors of each crew member on the dragon boat during the race by introducing social psychology mechanisms (social… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  49. arXiv:2311.14222  [pdf, other

    cs.LG math.OC stat.ML

    Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

    Authors: Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu

    Abstract: Accelerated stochastic gradient descent (ASGD) is a workhorse in deep learning and often achieves better generalization performance than SGD. However, existing optimization theory can only explain the faster convergence of ASGD, but cannot explain its better generalization. In this paper, we study the generalization of ASGD for overparameterized linear regression, which is possibly the simplest se… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 85 pages, 5 figures

  50. arXiv:2311.14142  [pdf

    stat.AP

    Retention in STEM: Factors Influencing Student Persistence and Employment

    Authors: Linli Zhou, Damji Heo Stratton, Xin Li

    Abstract: This study utilizes data from the Baccalaureate and Beyond Longitudinal Study to explore factors associated with the likelihood of students' employment in STEM fields one year after graduation. We examined various factors related to students' individual characteristics (e.g., gender, race, and financial situation), institutional experiences (e.g., major, academic standing, research involvement, in… ▽ More

    Submitted 24 June, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Journal ref: The Proceedings of the 19th Annual National Symposium on Student Retention, 2023