Skip to main content

Showing 1–50 of 397 results for author: Zhang, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.15020  [pdf

    cs.CY cs.LG stat.ML

    Integrating Attentional Factors and Spacing in Logistic Knowledge Tracing Models to Explore the Impact of Training Sequences on Category Learning

    Authors: Meng Cao, Philip I. Pavlik Jr., Wei Chu, Liang Zhang

    Abstract: In category learning, a growing body of literature has increasingly focused on exploring the impacts of interleaving in contrast to blocking. The sequential attention hypothesis posits that interleaving draws attention to the differences between categories while blocking directs attention toward similarities within categories. Although a recent study underscores the joint influence of memory and a… ▽ More

    Submitted 22 June, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures, Educational Data Mining 2024

  2. arXiv:2407.14335  [pdf, other

    econ.GN cs.CE cs.CR q-fin.CP stat.CO

    Quantifying the Blockchain Trilemma: A Comparative Analysis of Algorand, Ethereum 2.0, and Beyond

    Authors: Yihang Fu, Mingwei Jing, Jiaolun Zhou, Peilin Wu, Ye Wang, Luyao Zhang, Chuang Hu

    Abstract: Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets. However, its potential is constrained by the "Blockchain Trilemma," which necessitates balancing decentralization, security, and scalability. This study evaluates and compares two leading proof-of-stake (PoS) systems, Algorand and Ethereum 2.0, against the… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  3. arXiv:2407.04967  [pdf, other

    stat.CO

    posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms

    Authors: Måns Magnusson, Jakob Torgander, Paul-Christian Bürkner, Lu Zhang, Bob Carpenter, Aki Vehtari

    Abstract: The generality and robustness of inference algorithms is critical to the success of widely used probabilistic programming languages such as Stan, PyMC, Pyro, and Turing.jl. When designing a new general-purpose inference algorithm, whether it involves Monte Carlo sampling or variational approximation, the fundamental problem arises in evaluating its accuracy and efficiency across a range of represe… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  4. arXiv:2406.18603  [pdf, other

    stat.AP cs.LG

    Confidence interval estimation of mixed oil length with conditional diffusion model

    Authors: Yanfeng Yang, Lihong Zhang, Ziqi Chen, Miaomiao Yu, Lei Chen

    Abstract: Accurately estimating the mixed oil length plays a big role in the economic benefit for oil pipeline network. While various proposed methods have tried to predict the mixed oil length, they often exhibit an extremely high probability (around 50\%) of underestimating it. This is attributed to their failure to consider the statistical variability inherent in the estimated length of mixed oil. To add… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.18035  [pdf, other

    cs.LG stat.ML

    Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization

    Authors: Yaoyu Zhang, Leyang Zhang, Zhongwang Zhang, Zhiwei Bai

    Abstract: Determining whether deep neural network (DNN) models can reliably recover target functions at overparameterization is a critical yet complex issue in the theory of deep learning. To advance understanding in this area, we introduce a concept we term "local linear recovery" (LLR), a weaker form of target function recovery that renders the problem more amenable to theoretical analysis. In the sense o… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2211.11623

  6. arXiv:2406.16221  [pdf, other

    cs.LG cs.AI cs.GR econ.EM stat.ME

    F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data

    Authors: Zexing Xu, Linjun Zhang, Sitan Yang, Rasoul Etesami, Hanghang Tong, Huan Zhang, Jiawei Han

    Abstract: Demand prediction is a crucial task for e-commerce and physical retail businesses, especially during high-stake sales events. However, the limited availability of historical data from these peak periods poses a significant challenge for traditional forecasting methods. In this paper, we propose a novel approach that leverages strategically chosen proxy data reflective of potential sales patterns f… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    MSC Class: 68T07; 68T05; 62M10; 62M20; 90C90; 91B84

  7. arXiv:2406.15514  [pdf, other

    physics.soc-ph q-bio.PE stat.ME

    How big does a population need to be before demographers can ignore individual-level randomness in demographic events?

    Authors: John Bryant, Tahu Kukutai, Junni L. Zhang

    Abstract: When studying a national-level population, demographers can safely ignore the effect of individual-level randomness on age-sex structure. When studying a single community, or group of communities, however, the potential importance of individual-level randomness is less clear. We seek to measure the effect of individual-level randomness in births and deaths on standard summary indicators of age-sex… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 28 pages, 8 figures, 3 tables

    MSC Class: 91-XX

  8. arXiv:2406.05304  [pdf, other

    stat.ME

    Polytomous Explanatory Item Response Models for Item Discrimination: Assessing Negative-Framing Effects in Social-Emotional Learning Surveys

    Authors: Joshua B. Gilbert, Lijin Zhang, Esther Ulitzsch, Benjamin W. Domingue

    Abstract: Modeling item parameters as a function of item characteristics has a long history but has generally focused on models for item location. Explanatory item response models for item discrimination are available but rarely used. In this study, we extend existing approaches for modeling item discrimination from dichotomous to polytomous item responses. We illustrate our proposed approach with an applic… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  9. arXiv:2406.04655  [pdf, other

    stat.ME stat.CO

    Bayesian Inference for Spatial-temporal Non-Gaussian Data Using Predictive Stacking

    Authors: Soumyakanti Pan, Lu Zhang, Jonathan R. Bradley, Sudipto Banerjee

    Abstract: Analysing non-Gaussian spatial-temporal data typically requires introducing spatial dependence in generalised linear models through the link function of an exponential family distribution. However, unlike in Gaussian likelihoods, inference is considerably encumbered by the inability to analytically integrate out the random effects and reduce the dimension of the parameter space. Iterative estimati… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 31 pages, 8 figures

  10. arXiv:2406.03707  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions

    Authors: Liyi Zhang, Michael Y. Li, Thomas L. Griffiths

    Abstract: Autoregressive language models have demonstrated a remarkable ability to extract latent structure from text. The embeddings from large language models have been shown to capture aspects of the syntax and semantics of language. But what {\em should} embeddings represent? We connect the autoregressive prediction objective to the idea of constructing predictive sufficient statistics to summarize the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

    ACM Class: I.2; I.5

  11. arXiv:2406.03628  [pdf, other

    stat.ML cs.LG

    Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data Imbalance

    Authors: Ryumei Nakada, Yichen Xu, Lexin Li, Linjun Zhang

    Abstract: Imbalanced data and spurious correlations are common challenges in machine learning and data science. Oversampling, which artificially increases the number of instances in the underrepresented classes, has been widely adopted to tackle these challenges. In this article, we introduce OPAL (\textbf{O}versam\textbf{P}ling with \textbf{A}rtificial \textbf{L}LM-generated data), a systematic oversamplin… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 59 pages, 7 figures

  12. arXiv:2406.02948  [pdf, other

    stat.ME stat.AP

    Copula-based semiparametric nonnormal transformed linear model for survival data with dependent censoring

    Authors: Huazhen Yu, Lixin Zhang

    Abstract: Although the independent censoring assumption is commonly used in survival analysis, it can be violated when the censoring time is related to the survival time, which often happens in many practical applications. To address this issue, we propose a flexible semiparametric method for dependent censored data. Our approach involves fitting the survival time and the censoring time with a joint transfo… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  13. arXiv:2406.01557  [pdf, other

    stat.ME stat.AP

    Bayesian compositional regression with flexible microbiome feature aggregation and selection

    Authors: Satabdi Saha, Liangliang Zhang, Kim-Anh Do, Christine B. Peterson

    Abstract: Ongoing advances in microbiome profiling have allowed unprecedented insights into the molecular activities of microbial communities. This has fueled a strong scientific interest in understanding the critical role the microbiome plays in governing human health, by identifying microbial features associated with clinical outcomes of interest. Several aspects of microbiome data limit the applicability… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.18373  [pdf, other

    stat.ML cs.LG math.OC

    A Hessian-Aware Stochastic Differential Equation for Modelling SGD

    Authors: Xiang Li, Zebang Shen, Liang Zhang, Niao He

    Abstract: Continuous-time approximation of Stochastic Gradient Descent (SGD) is a crucial tool to study its escaping behaviors from stationary points. However, existing stochastic differential equation (SDE) models fail to fully capture these behaviors, even for simple quadratic objectives. Built on a novel stochastic backward error analysis framework, we derive the Hessian-Aware Stochastic Modified Equatio… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  15. arXiv:2405.14780  [pdf, other

    cs.LG stat.ML

    Metric Flow Matching for Smooth Interpolations on the Data Manifold

    Authors: Kacper Kapusniak, Peter Potaptchik, Teodora Reu, Leo Zhang, Alexander Tong, Michael Bronstein, Avishek Joey Bose, Francesco Di Giovanni

    Abstract: Matching objectives underpin the success of modern generative models and rely on constructing conditional paths that transform a source distribution into a target distribution. Despite being a fundamental building block, conditional paths have been designed principally under the assumption of Euclidean geometry, resulting in straight interpolations. However, this can be particularly restrictive fo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  16. arXiv:2405.04026  [pdf, other

    stat.ML cs.LG

    Federated Control in Markov Decision Processes

    Authors: Hao Jin, Yang Peng, Liangyu Zhang, Zhihua Zhang

    Abstract: We study problems of federated control in Markov Decision Processes. To solve an MDP with large state space, multiple learning agents are introduced to collaboratively learn its optimal policy without communication of locally collected experience. In our settings, these agents have limited capabilities, which means they are restricted within different regions of the overall state space during the… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  17. arXiv:2405.03236  [pdf, other

    cs.LG stat.ML

    Federated Reinforcement Learning with Constraint Heterogeneity

    Authors: Hao Jin, Liangyu Zhang, Zhihua Zhang

    Abstract: We study a Federated Reinforcement Learning (FedRL) problem with constraint heterogeneity. In our setting, we aim to solve a reinforcement learning problem with multiple constraints while $N$ training agents are located in $N$ different environments with limited access to the constraint signals and they are expected to collaboratively learn a policy satisfying all constraint signals. Such learning… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  18. arXiv:2405.02225  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.ME

    Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

    Authors: Lujing Zhang, Aaron Roth, Linjun Zhang

    Abstract: This paper introduces a framework for post-processing machine learning models so that their predictions satisfy multi-group fairness guarantees. Based on the celebrated notion of multicalibration, we introduce $(\mathbf{s},\mathcal{G}, α)-$GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional mappings $\mathbf{s}$, constraint set $\mathcal{G}$, and a pre-specified threshold le… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 28 pages, 8 figures, accepted by ICML2024

  19. arXiv:2404.16287  [pdf, other

    stat.ML cs.CR cs.LG math.ST stat.ME

    Differentially Private Federated Learning: Servers Trustworthiness, Estimation, and Statistical Inference

    Authors: Zhe Zhang, Ryumei Nakada, Linjun Zhang

    Abstract: Differentially private federated learning is crucial for maintaining privacy in distributed environments. This paper investigates the challenges of high-dimensional estimation and inference under the constraints of differential privacy. First, we study scenarios involving an untrusted central server, demonstrating the inherent difficulties of accurate estimation in high-dimensional problems. Our f… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 56 pages, 3 figures

  20. arXiv:2404.09353  [pdf, other

    stat.ME stat.AP stat.ML

    A Unified Combination Framework for Dependent Tests with Applications to Microbiome Association Studies

    Authors: Xiufan Yu, Linjun Zhang, Arun Srinivasan, Min-ge Xie, Lingzhou Xue

    Abstract: We introduce a novel meta-analysis framework to combine dependent tests under a general setting, and utilize it to synthesize various microbiome association tests that are calculated from the same dataset. Our development builds upon the classical meta-analysis methods of aggregating $p$-values and also a more recent general method of combining confidence distributions, but makes generalizations t… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  21. arXiv:2404.01608  [pdf, ps, other

    stat.ML cs.LG stat.ME

    FAIRM: Learning invariant representations for algorithmic fairness and domain generalization with minimax optimality

    Authors: Sai Li, Linjun Zhang

    Abstract: Machine learning methods often assume that the test data have the same distribution as the training data. However, this assumption may not hold due to multiple levels of heterogeneity in applications, raising issues in algorithmic fairness and domain generalization. In this work, we address the problem of fair and generalizable machine learning by invariant principles. We propose a training enviro… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  22. arXiv:2403.14926  [pdf, other

    stat.ML cs.LG

    Contrastive Learning on Multimodal Analysis of Electronic Health Records

    Authors: Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou

    Abstract: Electronic health record (EHR) systems contain a wealth of multimodal clinical data including structured data like clinical codes and unstructured data such as clinical notes. However, many existing EHR-focused studies has traditionally either concentrated on an individual modality or merged different modalities in a rather rudimentary fashion. This approach often results in the perception of stru… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 34 pages

  23. arXiv:2403.12859  [pdf, other

    math.OC cs.LG stat.ML

    Primal Methods for Variational Inequality Problems with Functional Constraints

    Authors: Liang Zhang, Niao He, Michael Muehlebach

    Abstract: Constrained variational inequality problems are recognized for their broad applications across various fields including machine learning and operations research. First-order methods have emerged as the standard approach for solving these problems due to their simplicity and scalability. However, they typically rely on projection or linear minimization oracles to navigate the feasible set, which be… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  24. arXiv:2403.09984  [pdf, ps, other

    stat.ME

    Repro Samples Method for High-dimensional Logistic Model

    Authors: Xiaotian Hou, Linjun Zhang, Peng Wang, Min-ge Xie

    Abstract: This paper presents a novel method to make statistical inferences for both the model support and regression coefficients in a high-dimensional logistic regression model. Our method is based on the repro samples framework, in which we conduct statistical inference by generating artificial samples mimicking the actual data-generating process. The proposed method has two major advantages. Firstly, fo… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  25. arXiv:2403.05811  [pdf, ps, other

    stat.ML cs.LG

    Near Minimax-Optimal Distributional Temporal Difference Algorithms and The Freedman Inequality in Hilbert Spaces

    Authors: Yang Peng, Liangyu Zhang, Zhihua Zhang

    Abstract: Distributional reinforcement learning (DRL) has achieved empirical success in various domains. One of the core tasks in the field of DRL is distributional policy evaluation, which involves estimating the return distribution $η^π$ for a given policy $π$. The distributional temporal difference (TD) algorithm has been accordingly proposed, which is an extension of the temporal difference algorithm in… ▽ More

    Submitted 14 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  26. arXiv:2403.05006  [pdf, ps, other

    cs.LG cs.AI stat.ME stat.ML

    Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

    Authors: Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

    Abstract: Reinforcement learning with human feedback (RLHF) is an emerging paradigm to align models with human preferences. Typically, RLHF aggregates preferences from multiple individuals who have diverse viewpoints that may conflict with each other. Our work \textit{initiates} the theoretical study of multi-party RLHF that explicitly models the diverse preferences of multiple individuals. We show how trad… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  27. arXiv:2403.03562  [pdf, other

    cs.LG stat.ML

    Efficient Algorithms for Empirical Group Distributional Robust Optimization and Beyond

    Authors: Dingzhi Yu, Yunuo Cai, Wei Jiang, Lijun Zhang

    Abstract: We investigate the empirical counterpart of group distributionally robust optimization (GDRO), which aims to minimize the maximal empirical risk across $m$ distinct groups. We formulate empirical GDRO as a $\textit{two-level}$ finite-sum convex-concave minimax optimization problem and develop a stochastic variance reduced mirror prox algorithm. Unlike existing methods, we construct the stochastic… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 30 pages, 1 figure

  28. arXiv:2402.16158  [pdf, other

    stat.ML cs.CY cs.LG

    Distribution-Free Fair Federated Learning with Small Samples

    Authors: Qichuan Yin, Junzhou Huang, Huaxiu Yao, Linjun Zhang

    Abstract: As federated learning gains increasing importance in real-world applications due to its capacity for decentralized data training, addressing fairness concerns across demographic groups becomes critically important. However, most existing machine learning algorithms for ensuring fairness are designed for centralized data environments and generally require large-sample and distributional assumptions… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  29. arXiv:2401.08150  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    Differentially Private Sliced Inverse Regression: Minimax Optimality and Algorithm

    Authors: Xintao Xia, Linjun Zhang, Zhanrui Cai

    Abstract: Privacy preservation has become a critical concern in high-dimensional data analysis due to the growing prevalence of data-driven applications. Proposed by Li (1991), sliced inverse regression has emerged as a widely utilized statistical technique for reducing covariate dimensionality while maintaining sufficient statistical information. In this paper, we propose optimally differentially private a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  30. arXiv:2401.07267  [pdf, other

    stat.ME

    Inference for high-dimensional linear expectile regression with de-biased method

    Authors: Xiang Li, Yu-Ning Li, Li-Xin Zhang, Jun Zhao

    Abstract: In this paper, we address the inference problem in high-dimensional linear expectile regression. We transform the expectile loss into a weighted-least-squares form and apply a de-biased strategy to establish Wald-type tests for multiple constraints within a regularized framework. Simultaneously, we construct an estimator for the pseudo-inverse of the generalized Hessian matrix in high dimension wi… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 34 pages

    MSC Class: 62F05; 62F12; 62J12

  31. arXiv:2401.02708  [pdf, other

    cs.LG cs.AI stat.ML

    TripleSurv: Triplet Time-adaptive Coordinate Loss for Survival Analysis

    Authors: Liwen Zhang, Lianzhen Zhong, Fan Yang, Di Dong, Hui Hui, Jie Tian

    Abstract: A core challenge in survival analysis is to model the distribution of censored time-to-event data, where the event of interest may be a death, failure, or occurrence of a specific event. Previous studies have showed that ranking and maximum likelihood estimation (MLE)loss functions are widely-used for survival analysis. However, ranking loss only focus on the ranking of survival time and does not… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 9 pages,6 figures

  32. arXiv:2312.16004  [pdf, other

    stat.AP math.NA

    Computing Gerber-Shiu function in the classical risk model with interest using collocation method

    Authors: Zan Yu, Lianzeng Zhang

    Abstract: The Gerber-Shiu function is a classical research topic in actuarial science.However, exact solutions are only available in the literature for very specific cases where the claim amounts follow distributions such as the exponential distribution. This presents a longstanding challenge, particularly from a computational perspective. For the classical risk process in continuous time, the Gerber-Shiu d… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 24 pages

  33. arXiv:2312.14226  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Deep de Finetti: Recovering Topic Distributions from Large Language Models

    Authors: Liyi Zhang, R. Thomas McCoy, Theodore R. Sumers, Jian-Qiao Zhu, Thomas L. Griffiths

    Abstract: Large language models (LLMs) can produce long, coherent passages of text, suggesting that LLMs, although trained on next-word prediction, must represent the latent structure that characterizes a document. Prior work has found that internal representations of LLMs encode one aspect of latent structure, namely syntax; here we investigate a complementary aspect, namely the document's topic structure.… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 13 pages, 4 figures

    ACM Class: I.2.6; I.2.7

  34. arXiv:2312.10706  [pdf, other

    stat.ME

    Margin-closed regime-switching multivariate time series models

    Authors: Lin Zhang, Harry Joe, Natalia Nolde

    Abstract: A regime-switching multivariate time series model which is closed under margins is built. The model imposes a restriction on all lower-dimensional sub-processes to follow a regime-switching process sharing the same latent regime sequence and having the same Markov order as the original process. The margin-closed regime-switching model is constructed by considering the multivariate margin-closed Ga… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  35. arXiv:2312.04610  [pdf

    cs.LG cs.AI eess.SP stat.OT

    Data-driven Semi-supervised Machine Learning with Surrogate Safety Measures for Abnormal Driving Behavior Detection

    Authors: Yongqi Dong, Lanxin Zhang, Haneen Farah, Arkady Zgonnikov, Bart van Arem

    Abstract: Detecting abnormal driving behavior is critical for road traffic safety and the evaluation of drivers' behavior. With the advancement of machine learning (ML) algorithms and the accumulation of naturalistic driving data, many ML models have been adopted for abnormal driving behavior detection. Most existing ML-based detectors rely on (fully) supervised ML methods, which require substantial labeled… ▽ More

    Submitted 24 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 22 pages, 10 figures, accepted by the 103rd Transportation Research Board (TRB) Annual Meeting, under third round review by Transportation Research Record: Journal of the Transportation Research Board

  36. arXiv:2312.02660  [pdf, other

    econ.GN cs.CE cs.CR cs.CY stat.AP

    Uniswap Daily Transaction Indices by Network

    Authors: Nir Chemaya, Lin William Cong, Emma Jorgensen, Dingyue Liu, Luyao Zhang

    Abstract: DeFi is transforming financial services by removing intermediaries and producing a wealth of open-source data. This transformation is propelled by Layer 2 (L2) solutions, aimed at boosting network efficiency and scalability beyond current Layer 1 (L1) capabilities. This study addresses the lack of detailed L2 impact analysis by examining over 50 million transactions from Uniswap. Our dataset, feat… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  37. arXiv:2312.00219  [pdf, other

    math.ST stat.ME

    The Functional Average Treatment Effect

    Authors: Shane Sparkes, Erika Garcia, Lu Zhang

    Abstract: This paper establishes the functional average as an important estimand for causal inference. The significance of the estimand lies in its robustness against traditional issues of confounding. We prove that this robustness holds even when the probability distribution of the outcome, conditional on treatment or some other vector of adjusting variables, differs almost arbitrarily from its counterfact… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 52 pages (40 main document; 12 supplementary), 1 figure

    MSC Class: 60E05; 62J99 (Primary); 62G30; 62G32 (Secondary)

  38. arXiv:2311.17476  [pdf, other

    stat.ME math.ST

    Inference of Sample Complier Average Causal Effects in Completely Randomized Experiments

    Authors: Zhen Zhong, Per Johansson, Junni L. Zhang

    Abstract: In randomized experiments with non-compliance scholars have argued that the complier average causal effect (CACE) ought to be the main causal estimand. The literature on inference of the complier average treatment effect (CACE) has focused on inference about the population CACE. However, in general individuals in the experiments are volunteers. This means that there is a risk that individuals part… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  39. arXiv:2311.17445  [pdf, ps, other

    stat.ME math.ST

    Interaction tests with covariate-adaptive randomization

    Authors: Likun Zhang, Wei Ma

    Abstract: Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests a… ▽ More

    Submitted 10 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  40. arXiv:2311.14676  [pdf, other

    cs.CY cs.CR cs.HC econ.GN stat.AP

    Decoding Social Sentiment in DAO: A Comparative Analysis of Blockchain Governance Communities

    Authors: Yutong Quan, Xintong Wu, Wanlin Deng, Luyao Zhang

    Abstract: Blockchain technology is leading a revolutionary transformation across diverse industries, with effective governance being critical for the success and sustainability of blockchain projects. Community forums, pivotal in engaging decentralized autonomous organizations (DAOs), significantly impact blockchain governance decisions. Concurrently, Natural Language Processing (NLP), particularly sentimen… ▽ More

    Submitted 25 May, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  41. arXiv:2311.11256  [pdf, other

    stat.ME

    Bayesian Modeling of Incompatible Spatial Data: A Case Study Involving Post-Adrian Storm Forest Damage Assessment

    Authors: Lu Zhang, Andrew O. Finley, Arne Nothdurft, Sudipto Banerjee

    Abstract: Incompatible spatial data modeling is a pervasive challenge in remote sensing data analysis that involves field data. Typical approaches to addressing this challenge aggregate information to a coarser common scale, i.e., compatible resolutions. Such pre-processing aggregation to a common resolution simplifies analysis, but potentially causes information loss and hence compromised inference and pre… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 15 pages, 10 figures

  42. arXiv:2311.10638  [pdf, other

    cs.LG cs.AI stat.ME

    Concept-free Causal Disentanglement with Variational Graph Auto-Encoder

    Authors: Jingyun Feng, Lin Zhang, Lili Yang

    Abstract: In disentangled representation learning, the goal is to achieve a compact representation that consists of all interpretable generative factors in the observational data. Learning disentangled representations for graphs becomes increasingly important as graph data rapidly grows. Existing approaches often rely on Variational Auto-Encoder (VAE) or its causal structure learning-based refinement, which… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  43. arXiv:2311.08434  [pdf, other

    cs.LG cs.AI stat.ML

    Uplift Modeling based on Graph Neural Network Combined with Causal Knowledge

    Authors: Haowen Wang, Xinyan Ye, Yangze Zhou, Zhiyi Zhang, Longhan Zhang, Jing Jiang

    Abstract: Uplift modeling is a fundamental component of marketing effect modeling, which is commonly employed to evaluate the effects of treatments on outcomes. Through uplift modeling, we can identify the treatment with the greatest benefit. On the other side, we can identify clients who are likely to make favorable decisions in response to a certain treatment. In the past, uplift modeling approaches relie… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 6 pages, 6 figures

  44. arXiv:2310.17759  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization

    Authors: Liang Zhang, Junchi Yang, Amin Karbasi, Niao He

    Abstract: Algorithmic reproducibility measures the deviation in outputs of machine learning algorithms upon minor changes in the training process. Previous work suggests that first-order methods would need to trade-off convergence rate (gradient complexity) for better reproducibility. In this work, we challenge this perception and demonstrate that both optimal reproducibility and near-optimal convergence gu… ▽ More

    Submitted 9 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Spotlight

  45. arXiv:2310.16260  [pdf, other

    stat.ME

    Private Estimation and Inference in High-Dimensional Regression with FDR Control

    Authors: Zhanrui Cai, Sai Li, Xintao Xia, Linjun Zhang

    Abstract: This paper presents novel methodologies for conducting practical differentially private (DP) estimation and inference in high-dimensional linear regression. We start by proposing a differentially private Bayesian Information Criterion (BIC) for selecting the unknown sparsity parameter in DP-Lasso, eliminating the need for prior knowledge of model sparsity, a requisite in the existing literature. T… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  46. arXiv:2310.15454  [pdf, other

    cs.LG cs.CR stat.ML

    Private Learning with Public Features

    Authors: Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study a class of private learning problems in which the data is a join of private and public features. This is often the case in private personalization tasks such as recommendation or ad prediction, in which features related to individuals are sensitive, while features related to items (the movies or songs to be recommended, or the ads to be shown to users) are publicly available and do not re… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  47. arXiv:2310.09639  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    DPZero: Private Fine-Tuning of Language Models without Backpropagation

    Authors: Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

    Abstract: The widespread practice of fine-tuning large language models (LLMs) on domain-specific data faces two major challenges in memory and privacy. First, as the size of LLMs continues to grow, the memory demands of gradient-based training methods via backpropagation become prohibitively high. Second, given the tendency of LLMs to memorize training data, it is important to protect potentially sensitive… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  48. arXiv:2310.02507  [pdf, other

    stat.ME math.ST

    Inference of Sample Complier Average Causal Effects under Experiments with Completely Randomized Design and Computer Assisted Balance-Improving Designs

    Authors: Zhen Zhong, Per Johansson, Junni L. Zhang

    Abstract: Non-compliance is common in real world experiments. We focus on inference about the sample complier average causal effect, that is, the average treatment effect for experimental units who are compliers. We present three types of inference strategies for the sample complier average causal effect: the Wald estimator, regression adjustment estimators and model-based Bayesian inference. Because modern… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 42 pages, 2 figures

  49. arXiv:2309.17262  [pdf, other

    stat.ML cs.LG

    Estimation and Inference in Distributional Reinforcement Learning

    Authors: Liangyu Zhang, Yang Peng, Jiadong Liang, Wenhao Yang, Zhihua Zhang

    Abstract: In this paper, we study distributional reinforcement learning from the perspective of statistical efficiency. We investigate distributional policy evaluation, aiming to estimate the complete distribution of the random return (denoted $η^π$) attained by a given policy $π$. We use the certainty-equivalence method to construct our estimator $\hatη^π$, given a generative model is available. We s… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  50. arXiv:2309.09555  [pdf, other

    stat.ME stat.ML

    Multi-dimensional domain generalization with low-rank structures

    Authors: Sai Li, Linjun Zhang

    Abstract: In conventional statistical and machine learning methods, it is typically assumed that the test data are identically distributed with the training data. However, this assumption does not always hold, especially in applications where the target population are not well-represented in the training data. This is a notable issue in health-related studies, where specific ethnic populations may be underr… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.