Skip to main content

Showing 1–50 of 161 results for author: Lu, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.14982  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    In-context Time Series Predictor

    Authors: Jiecheng Lu, Yan Sun, Shihao Yang

    Abstract: Recent Transformer-based large language models (LLMs) demonstrate in-context learning ability to perform various functions based solely on the provided context, without updating model parameters. To fully utilize the in-context capabilities in time series forecasting (TSF) problems, unlike previous Transformer-based or LLM-based time series forecasting methods, we reformulate "time series forecast… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2405.13785  [pdf, other

    cs.LG cs.AI math.PR stat.ML

    Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling

    Authors: Shifan Zhao, Jiaying Lu, Ji Yang, Edmond Chow, Yuanzhe Xi

    Abstract: Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical application… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    ACM Class: G.3; J.3

  3. arXiv:2405.02322  [pdf

    stat.AP

    Towards Causal Interpretation of Sexual Orientation in Regression Analysis: Applications and Challenges

    Authors: Junjie Lu, Zhongyi Guo, David H. Rehkopf

    Abstract: This study presents an approach to analyze health disparities in Sexual and Gender Minority (SGM) populations, with a focus on the role of social support levels as an example to allow causal interpretations of regression models. We advocate for precisely defining the exposure variable and incorporating mediators into analyses, to address the limitations of comparing counterfactual outcomes solely… ▽ More

    Submitted 21 April, 2024; originally announced May 2024.

  4. arXiv:2405.00859  [pdf, other

    stat.AP

    WATCH: A Workflow to Assess Treatment Effect Heterogeneity in Drug Development for Clinical Trial Sponsors

    Authors: Konstantinos Sechidis, Sophie Sun, Yao Chen, Jiarui Lu, Cong Zang, Mark Baillie, David Ohlssen, Marc Vandemeulebroecke, Rob Hemmings, Stephen Ruberg, Björn Bornkamp

    Abstract: This paper proposes a Workflow for Assessing Treatment effeCt Heterogeneity (WATCH) in clinical drug development targeted at clinical trial sponsors. The workflow is designed to address the challenges of investigating treatment effect heterogeneity (TEH) in randomized clinical trials, where sample size and multiplicity limit the reliability of findings. The proposed workflow includes four steps: A… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  5. arXiv:2404.04865  [pdf, other

    cs.LG cs.CV stat.ML

    On the Learnability of Out-of-distribution Detection

    Authors: Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu

    Abstract: Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good general… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by JMLR in 7th of April, 2024. This is a journal extension of the previous NeurIPS 2022 Outstanding Paper "Is Out-of-distribution Detection Learnable?" [arXiv:2210.14707]

  6. arXiv:2403.12284  [pdf, other

    math.ST q-bio.QM stat.AP stat.ME

    The Wreaths of KHAN: Uniform Graph Feature Selection with False Discovery Rate Control

    Authors: Jiajun Liang, Yue Liu, Doudou Zhou, Sinian Zhang, Junwei Lu

    Abstract: Graphical models find numerous applications in biology, chemistry, sociology, neuroscience, etc. While substantial progress has been made in graph estimation, it remains largely unexplored how to select significant graph signals with uncertainty assessment, especially those graph features related to topological structures including cycles (i.e., wreaths), cliques, hubs, etc. These features play a… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2403.01673  [pdf, other

    stat.ML cs.AI cs.LG

    CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

    Authors: Jiecheng Lu, Xu Han, Yan Sun, Shihao Yang

    Abstract: For Multivariate Time Series Forecasting (MTSF), recent deep learning applications show that univariate models frequently outperform multivariate ones. To address the difficiency in multivariate models, we introduce a method to Construct Auxiliary Time Series (CATS) that functions like a 2D temporal-contextual attention mechanism, which generates Auxiliary Time Series (ATS) from Original Time Seri… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  8. arXiv:2402.11705  [pdf, other

    stat.ML cs.LG

    Learning Memory Kernels in Generalized Langevin Equations

    Authors: Quanjun Lang, Jianfeng Lu

    Abstract: We introduce a novel approach for learning memory kernels in Generalized Langevin Equations. This approach initially utilizes a regularized Prony method to estimate correlation functions from trajectory data, followed by regression over a Sobolev norm-based loss function with RKHS regularization. Our method guarantees improved performance within an exponentially weighted L^2 space, with the kernel… ▽ More

    Submitted 1 April, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  9. arXiv:2402.04432  [pdf, other

    stat.AP

    Comprehensive Forecasting of California's Energy Consumption: A Multi-Source and Sectoral Analysis Using ARIMA and ARIMAX Models

    Authors: Zahra Moslemi, Logan Clark, Sarah Kernal, Samantha Rehome, Scott Sprengel, Ahoora Tamizifar, Shawna Tuli, Vish Chokshi, Mo Nomeli, Ella Liang, Moury Bidgoli, Jeff Lu, Manish Dasaur, Marty Hodgett

    Abstract: California's significant role as the second-largest consumer of energy in the United States underscores the importance of accurate energy consumption predictions. With a thriving industrial sector, a burgeoning population, and ambitious environmental goals, the state's energy landscape is dynamic and complex. This paper presents a comprehensive analysis of California's energy consumption trends an… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  10. arXiv:2401.14549  [pdf, other

    stat.ME

    Privacy-preserving Quantile Treatment Effect Estimation for Randomized Controlled Trials

    Authors: Leon Yao, Paul Yiming Li, Jiannan Lu

    Abstract: In accordance with the principle of "data minimization", many internet companies are opting to record less data. However, this is often at odds with A/B testing efficacy. For experiments with units with multiple observations, one popular data minimizing technique is to aggregate data for each unit. However, exact quantile estimation requires the full observation-level data. In this paper, we devel… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to 2023 CODE conference as a parallel presentation

  11. arXiv:2312.17230  [pdf, ps, other

    stat.ME math.OC stat.CO

    Variable Neighborhood Searching Rerandomization

    Authors: Jiuyao Lu, Daogao Liu

    Abstract: Rerandomization discards undesired treatment assignments to ensure covariate balance in randomized experiments. However, rerandomization based on acceptance-rejection sampling is computationally inefficient, especially when numerous independent assignments are required to perform randomization-based statistical inference. Existing acceleration methods are suboptimal and are not applicable in struc… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  12. arXiv:2312.15611  [pdf, other

    stat.ME stat.ML

    Inference of Dependency Knowledge Graph for Electronic Health Records

    Authors: Zhiwei Xu, Ziming Gan, Doudou Zhou, Shuting Shen, Junwei Lu, Tianxi Cai

    Abstract: The effective analysis of high-dimensional Electronic Health Record (EHR) data, with substantial potential for healthcare research, presents notable methodological challenges. Employing predictive modeling guided by a knowledge graph (KG), which enables efficient feature selection, can enhance both statistical efficiency and interpretability. While various methods have emerged for constructing KGs… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  13. arXiv:2312.00234  [pdf, other

    cs.LG math.NA stat.ML

    Deep Equilibrium Based Neural Operators for Steady-State PDEs

    Authors: Tanya Marwah, Ashwini Pokle, J. Zico Kolter, Zachary C. Lipton, Jianfeng Lu, Andrej Risteski

    Abstract: Data-driven machine learning approaches are being increasingly used to solve partial differential equations (PDEs). They have shown particularly striking successes when training an operator, which takes as input a PDE in some family, and outputs its solution. However, the architectural design space, especially given structural knowledge of the PDE family of interest, is still poorly understood. We… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  14. arXiv:2310.17582  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Convergence of flow-based generative models via proximal gradient descent in Wasserstein space

    Authors: Xiuyuan Cheng, Jianfeng Lu, Yixin Tan, Yao Xie

    Abstract: Flow-based generative models enjoy certain advantages in computing the data generation and the likelihood, and have recently shown competitive empirical performance. Compared to the accumulating theoretical studies on related score-based diffusion models, analysis of flow-based models, which are deterministic in both forward (data-to-noise) and reverse (noise-to-data) directions, remain sparse. In… ▽ More

    Submitted 3 July, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  15. arXiv:2310.09488  [pdf, other

    stat.ML cs.LG

    ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

    Authors: Jiecheng Lu, Xu Han, Shihao Yang

    Abstract: Long-term time series forecasting (LTSF) is important for various domains but is confronted by challenges in handling the complex temporal-contextual relationships. As multivariate input models underperforming some recent univariate counterparts, we posit that the issue lies in the inefficiency of existing multivariate LTSF Transformers to model series-wise relationships: the characteristic differ… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  16. arXiv:2309.10276  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Diffusion Methods for Generating Transition Paths

    Authors: Luke Triplett, Jianfeng Lu

    Abstract: In this work, we seek to simulate rare transitions between metastable states using score-based generative models. An efficient method for generating high-quality transition paths is valuable for the study of molecular systems since data is often difficult to obtain. We develop two novel methods for path generation in this paper: a chain-based approach and a midpoint-based approach. The first biase… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures

  17. arXiv:2309.04072  [pdf, ps, other

    math.NA cs.LG stat.ML

    Riemannian Langevin Monte Carlo schemes for sampling PSD matrices with fixed rank

    Authors: Tianmin Yu, Shixin Zheng, Jianfeng Lu, Govind Menon, Xiangxiong Zhang

    Abstract: This paper introduces two explicit schemes to sample matrices from Gibbs distributions on $\mathcal S^{n,p}_+$, the manifold of real positive semi-definite (PSD) matrices of size $n\times n$ and rank $p$. Given an energy function $\mathcal E:\mathcal S^{n,p}_+\to \mathbb{R}$ and certain Riemannian metrics $g$ on $\mathcal S^{n,p}_+$, these schemes rely on an Euler-Maruyama discretization of the Ri… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  18. arXiv:2308.13135  [pdf, other

    stat.ML cs.LG

    Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery

    Authors: Patrick Emedom-Nnamdi, Timothy R. Smith, Jukka-Pekka Onnela, Junwei Lu

    Abstract: We propose a nonparametric additive model for estimating interpretable value functions in reinforcement learning. Learning effective adaptive clinical interventions that rely on digital phenotyping features is a major for concern medical practitioners. With respect to spine surgery, different post-operative recovery recommendations concerning patient mobilization can lead to significant variation… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 28 pages, 13 figures

  19. arXiv:2307.06555  [pdf, other

    cs.LG stat.ML

    Deep Network Approximation: Beyond ReLU to Diverse Activation Functions

    Authors: Shijun Zhang, Jianfeng Lu, Hongkai Zhao

    Abstract: This paper explores the expressive power of deep neural networks for a diverse range of activation functions. An activation function set $\mathscr{A}$ is defined to encompass the majority of commonly used activation functions, such as $\mathtt{ReLU}$, $\mathtt{LeakyReLU}$, $\mathtt{ReLU}^2$, $\mathtt{ELU}$, $\mathtt{CELU}$, $\mathtt{SELU}$, $\mathtt{Softplus}$, $\mathtt{GELU}$, $\mathtt{SiLU}$,… ▽ More

    Submitted 31 January, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Journal ref: Journal of Machine Learning Research, 25(35):1--39, 2024

  20. arXiv:2307.03832  [pdf, other

    stat.ME stat.AP

    A Bayesian Circadian Hidden Markov Model to Infer Rest-Activity Rhythms Using 24-hour Actigraphy Data

    Authors: Jiachen Lu, Qian Xiao, Cici Bauer

    Abstract: 24-hour actigraphy data collected by wearable devices offer valuable insights into physical activity types, intensity levels, and rest-activity rhythms (RAR). RARs, or patterns of rest and activity exhibited over a 24-hour period, are regulated by the body's circadian system, synchronizing physiological processes with external cues like the light-dark cycle. Disruptions to these rhythms, such as i… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  21. arXiv:2306.06857  [pdf, other

    stat.ME

    FADI: Fast Distributed Principal Component Analysis With High Accuracy for Large-Scale Federated Data

    Authors: Shuting Shen, Junwei Lu, Xihong Lin

    Abstract: Principal component analysis (PCA) is one of the most popular methods for dimension reduction. In light of the rapidly growing large-scale data in federated ecosystems, the traditional PCA method is often not applicable due to privacy protection considerations and large computational burden. Algorithms were proposed to lower the computational cost, but few can handle both high dimensionality and m… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  22. arXiv:2305.19997  [pdf, other

    stat.ML math.ST

    Knowledge Graph Embedding with Electronic Health Records Data via Latent Graphical Block Model

    Authors: Junwei Lu, Jin Yin, Tianxi Cai

    Abstract: Due to the increasing adoption of electronic health records (EHR), large scale EHRs have become another rich data source for translational clinical research. Despite its potential, deriving generalizable knowledge from EHR data remains challenging. First, EHR data are generated as part of clinical care with data elements too detailed and fragmented for research. Despite recent progress in mapping… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  23. arXiv:2305.16459  [pdf, other

    stat.ME stat.AP

    All about sample-size calculations for A/B testing: Novel extensions and practical guide

    Authors: Jing Zhou, Jiannan Lu, Anas Shallah

    Abstract: While there exists a large amount of literature on the general challenges of and best practices for trustworthy online A/B testing, there are limited studies on sample size estimation, which plays a crucial role in trustworthy and efficient A/B testing that ensures the resulting inference has a sufficient power and type I error control. For example, when sample size is under-estimated, the statist… ▽ More

    Submitted 16 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted by CIKM'23

    MSC Class: 62

  24. arXiv:2305.11798  [pdf, ps, other

    cs.LG math.ST stat.ML

    The probability flow ODE is provably fast

    Authors: Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim

    Abstract: We provide the first polynomial-time convergence guarantees for the probability flow ODE implementation (together with a corrector step) of score-based generative modeling. Our analysis is carried out in the wake of recent results obtaining such guarantees for the SDE-based implementation (i.e., denoising diffusion probabilistic modeling or DDPM), but requires the development of novel techniques f… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 23 pages, 2 figures

  25. arXiv:2305.05529  [pdf, other

    stat.CO cs.LG math.PR math.ST stat.ML

    Accelerate Langevin Sampling with Birth-Death process and Exploration Component

    Authors: Lezhi Tan, Jianfeng Lu

    Abstract: Sampling a probability distribution with known likelihood is a fundamental task in computational science and engineering. Aiming at multimodality, we propose a new sampling method that takes advantage of both birth-death process and exploration component. The main idea of this method is \textit{look before you leap}. We keep two sets of samplers, one at warmer temperature and one at original tempe… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: 23 pages, 10 figures

  26. arXiv:2304.09221  [pdf, ps, other

    cs.LG math.OC stat.ML

    Convergence of stochastic gradient descent under a local Lojasiewicz condition for deep neural networks

    Authors: Jing An, Jianfeng Lu

    Abstract: We study the convergence of stochastic gradient descent (SGD) for non-convex objective functions. We establish the local convergence with positive probability under the local Łojasiewicz condition introduced by Chatterjee in \cite{chatterjee2022convergence} and an additional local structural assumption of the loss function landscape. A key component of our proof is to ensure that the whole traject… ▽ More

    Submitted 12 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: v2 fixed several mistakes. Some parts have been rewritten

  27. arXiv:2303.10516  [pdf, other

    stat.AP stat.ME

    Detect influential points of feature rankings

    Authors: Shuo Wang, Junyan Lu

    Abstract: Background Deriving feature rankings is essential in bioinformatics studies since the ordered features are important in guiding subsequent research. Feature rankings may be distorted by influential points (IP), but such effects are rarely mentioned in previous studies. This study aimed to investigate the impact of IPs on feature rankings and propose a new method to detect IPs. Method The present s… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  28. arXiv:2303.06726  [pdf, other

    stat.ML cs.LG

    Global Optimality of Elman-type RNN in the Mean-Field Regime

    Authors: Andrea Agazzi, Jianfeng Lu, Sayan Mukherjee

    Abstract: We analyze Elman-type Recurrent Reural Networks (RNNs) and their training in the mean-field regime. Specifically, we show convergence of gradient descent training dynamics of the RNN to the corresponding mean-field formulation in the large width limit. We also show that the fixed points of the limiting infinite-width dynamics are globally optimal, under some assumptions on the initialization of th… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: 31 pages, 2 figures

  29. arXiv:2302.13231  [pdf

    eess.SY stat.AP

    A Synthetic Texas Backbone Power System with Climate-Dependent Spatio-Temporal Correlated Profiles

    Authors: Jin Lu, Xingpeng Li, Hongyi Li, Taher Chegini, Carlos Gamarra, Y. C. Ethan Yang, Margaret Cook, Gavin Dillingham

    Abstract: Most power system test cases only have electrical parameters and can be used only for studies based on a snapshot of system profiles. To facilitate more comprehensive and practical studies, a synthetic power system including spatio-temporal correlated profiles for the entire year of 2019 at one-hour resolution has been created in this work. This system, referred to as the synthetic Texas 123-bus b… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: 10 pages, 14 figures, 12 tables

  30. arXiv:2302.04611  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    A Text-guided Protein Design Framework

    Authors: Shengchao Liu, Yanjing Li, Zhuoxinran Li, Anthony Gitter, Yutao Zhu, Jiarui Lu, Zhao Xu, Weili Nie, Arvind Ramanathan, Chaowei Xiao, Jian Tang, Hongyu Guo, Anima Anandkumar

    Abstract: Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework tha… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  31. arXiv:2301.12353  [pdf, other

    cs.LG math.DS stat.ML

    On Enhancing Expressive Power via Compositions of Single Fixed-Size ReLU Network

    Authors: Shijun Zhang, Jianfeng Lu, Hongkai Zhao

    Abstract: This paper explores the expressive power of deep neural networks through the framework of function compositions. We demonstrate that the repeated compositions of a single fixed-size ReLU network exhibit surprising expressive power, despite the limited expressive capabilities of the individual network itself. Specifically, we prove by construction that… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:41452-41487, 2023

  32. arXiv:2301.12254  [pdf, other

    stat.ML cs.LG

    Combinatorial Inference on the Optimal Assortment in Multinomial Logit Models

    Authors: Shuting Shen, Xi Chen, Ethan X. Fang, Junwei Lu

    Abstract: Assortment optimization has received active explorations in the past few decades due to its practical importance. Despite the extensive literature dealing with optimization algorithms and latent score estimation, uncertainty quantification for the optimal assortment still needs to be explored and is of great practical significance. Instead of estimating and recovering the complete optimal offer se… ▽ More

    Submitted 3 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

  33. arXiv:2212.10789  [pdf, other

    cs.LG cs.CL q-bio.QM stat.ML

    Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

    Authors: Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao, Ling Liu, Jian Tang, Chaowei Xiao, Anima Anandkumar

    Abstract: There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions and predict complex biological activities. Her… ▽ More

    Submitted 29 January, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

  34. arXiv:2211.07861  [pdf, other

    stat.ML cs.LG math.AP math.NA math.ST stat.CO

    Regularized Stein Variational Gradient Flow

    Authors: Ye He, Krishnakumar Balasubramanian, Bharath K. Sriperumbudur, Jianfeng Lu

    Abstract: The Stein Variational Gradient Descent (SVGD) algorithm is a deterministic particle method for sampling. However, a mean-field analysis reveals that the gradient flow corresponding to the SVGD algorithm (i.e., the Stein Variational Gradient Flow) only provides a constant-order approximation to the Wasserstein Gradient Flow corresponding to the KL-divergence minimization. In this work, we propose t… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  35. arXiv:2210.14707  [pdf, other

    cs.LG stat.ML

    Is Out-of-Distribution Detection Learnable?

    Authors: Zhen Fang, Yixuan Li, Jie Lu, Jiahua Dong, Bo Han, Feng Liu

    Abstract: Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good general… ▽ More

    Submitted 23 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 Outstanding Paper

  36. arXiv:2210.08486  [pdf, other

    cs.LG cs.AI stat.ML

    Streaming PAC-Bayes Gaussian process regression with a performance guarantee for online decision making

    Authors: Tianyu Liu, Jie Lu, Zheng Yan, Guangquan Zhang

    Abstract: As a powerful Bayesian non-parameterized algorithm, the Gaussian process (GP) has performed a significant role in Bayesian optimization and signal processing. GPs have also advanced online decision-making systems because their posterior distribution has a closed-form solution. However, its training and inference process requires all historic data to be stored and the GP model to be trained from sc… ▽ More

    Submitted 26 October, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

  37. arXiv:2209.12381  [pdf, ps, other

    cs.LG math.PR math.ST stat.ML

    Convergence of score-based generative modeling for general data distributions

    Authors: Holden Lee, Jianfeng Lu, Yixin Tan

    Abstract: Score-based generative modeling (SGM) has grown to be a hugely successful method for learning to generate samples from complex data distributions such as that of images and audio. It is based on evolving an SDE that transforms white noise into a sample from the learned distribution, using estimates of the score function, or gradient log-pdf. Previous convergence analyses for these methods have suf… ▽ More

    Submitted 3 October, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

  38. arXiv:2206.06227  [pdf, ps, other

    cs.LG math.PR math.ST stat.ML

    Convergence for score-based generative modeling with polynomial complexity

    Authors: Holden Lee, Jianfeng Lu, Yixin Tan

    Abstract: Score-based generative modeling (SGM) is a highly successful approach for learning a probability distribution from data and generating further samples. We prove the first polynomial convergence guarantees for the core mechanic behind SGM: drawing samples from a probability density $p$ given a score estimate (an estimate of $\nabla \ln p$) that is accurate in $L^2(p)$. Compared to previous works, w… ▽ More

    Submitted 3 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 43 pages

    Journal ref: Advances in Neural Information Processing Systems 35 (2022), 22870--22882

  39. arXiv:2206.05581  [pdf, other

    stat.ML cs.LG stat.ME

    Federated Offline Reinforcement Learning

    Authors: Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai

    Abstract: Evidence-based or data-driven dynamic treatment regimes are essential for personalized medicine, which can benefit from offline reinforcement learning (RL). Although massive healthcare data are available across medical institutions, they are prohibited from sharing due to privacy constraints. Besides, heterogeneity exists in different sites. As a result, federated offline RL algorithms are necessa… ▽ More

    Submitted 27 January, 2024; v1 submitted 11 June, 2022; originally announced June 2022.

  40. arXiv:2206.02204  [pdf, other

    stat.ME

    A weighted average distributed estimator for high dimensional parameter

    Authors: Jun Lu, Mengyao Li, Chenping Hou

    Abstract: Distributed sparse learning for high dimensional parameters has attached vast attentions due to its wide application in prediction and classification in diverse fields of machine learning. Existing distributed sparse regression usually takes an average way to ensemble the local results produced by distributed machines, which enjoys low communication cost but is statistical inefficient. To address… ▽ More

    Submitted 25 October, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  41. arXiv:2206.02017  [pdf, ps, other

    stat.ME

    Feature screening for multi-response linear models by empirical likelihood

    Authors: Jun Lu, Qinqin Hu, Lu Lin

    Abstract: This paper proposes a new feature screening method for the multi-response ultrahigh dimensional linear model by empirical likelihood. Through a multivariate moment condition, the empirical likelihood induced ranking statistics can exploit the joint effect among responses, and thus result in a much better performance than the methods considering responses individually. More importantly, by the use… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  42. arXiv:2205.11025  [pdf, other

    cs.LG cs.IT stat.ML

    Flexible and Hierarchical Prior for Bayesian Nonnegative Matrix Factorization

    Authors: Jun Lu, Xuanyu Ye

    Abstract: In this paper, we introduce a probabilistic model for learning nonnegative matrix factorization (NMF) that is commonly used for predicting missing values and finding hidden patterns in the data, in which the matrix factors are latent variables associated with each data dimension. The nonnegativity constraint for the latent factors is handled by choosing priors with support on the nonnegative subsp… ▽ More

    Submitted 19 June, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

  43. arXiv:2203.14702  [pdf, other

    cs.CV cs.LG stat.ML

    Bi-level Doubly Variational Learning for Energy-based Latent Variable Models

    Authors: Ge Kan, Jinhu Lü, Tian Wang, Baochang Zhang, Aichun Zhu, Lei Huang, Guodong Guo, Hichem Snoussi

    Abstract: Energy-based latent variable models (EBLVMs) are more expressive than conventional energy-based models. However, its potential on visual tasks are limited by its training process based on maximum likelihood estimate that requires sampling from two intractable distributions. In this paper, we propose Bi-level doubly variational learning (BiDVL), which is based on a new bi-level optimization framewo… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  44. arXiv:2202.00618  [pdf, other

    stat.ME stat.AP

    Penalized Estimation of Frailty-Based Illness-Death Models for Semi-Competing Risks

    Authors: Harrison T. Reeder, Junwei Lu, Sebastien Haneuse

    Abstract: Semi-competing risks refers to the survival analysis setting where the occurrence of a non-terminal event is subject to whether a terminal event has occurred, but not vice versa. Semi-competing risks arise in a broad range of clinical contexts, with a novel example being the pregnancy condition preeclampsia, which can only occur before the `terminal' event of giving birth. Models that acknowledge… ▽ More

    Submitted 15 April, 2024; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: This is the final "accepted" version of the article available in Biometrics at the below citation/doi. It has been uploaded following an embargo period from original publication

    Journal ref: Biometrics, 79(3), 1657-1669

  45. A zero-inflated endemic-epidemic model with an application to measles time series in Germany

    Authors: Junyi Lu, Sebastian Meyer

    Abstract: Count data with excessive zeros are often encountered when modelling infectious disease occurrence. The degree of zero inflation can vary over time due to non-epidemic periods as well as by age group or region. The existing endemic-epidemic modelling framework (aka HHH) lacks a proper treatment for surveillance data with excessive zeros as it is limited to Poisson and negative binomial distributio… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  46. arXiv:2110.06897  [pdf, other

    math.NA cs.LG math.ST physics.comp-ph stat.ML

    Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

    Authors: Yiping Lu, Haoxuan Chen, Jianfeng Lu, Lexing Ying, Jose Blanchet

    Abstract: In this paper, we study the statistical limits of deep learning techniques for solving elliptic partial differential equations (PDEs) from random samples using the Deep Ritz Method (DRM) and Physics-Informed Neural Networks (PINNs). To simplify the problem, we focus on a prototype elliptic PDE: the Schrödinger equation on a hypercube with zero Dirichlet boundary condition, which has wide applicati… ▽ More

    Submitted 12 November, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: add a proof Proof Sketch in section 4.1

  47. arXiv:2110.00151  [pdf, other

    stat.ML cs.LG math.ST

    Lagrangian Inference for Ranking Problems

    Authors: Yue Liu, Ethan X. Fang, Junwei Lu

    Abstract: We propose a novel combinatorial inference framework to conduct general uncertainty quantification in ranking problems. We consider the widely adopted Bradley-Terry-Luce (BTL) model, where each item is assigned a positive preference score that determines the Bernoulli distributions of pairwise comparisons' outcomes. Our proposed method aims to infer general ranking properties of the BTL model. The… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

  48. arXiv:2109.11929  [pdf, other

    stat.ML cs.AI cs.LG

    Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

    Authors: Adi Lin, Jie Lu, Junyu Xuan, Fujin Zhu, Guangquan Zhang

    Abstract: Causal effect estimation for dynamic treatment regimes (DTRs) contributes to sequential decision making. However, censoring and time-dependent confounding under DTRs are challenging as the amount of observational data declines over time due to a reducing sample size but the feature dimension increases over time. Long-term follow-up compounds these challenges. Another challenge is the highly comple… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  49. arXiv:2108.11753  [pdf, other

    cs.LG cs.AI stat.ML

    A survey on Bayesian inference for Gaussian mixture model

    Authors: Jun Lu

    Abstract: Clustering has become a core technology in machine learning, largely due to its application in the field of unsupervised learning, clustering, classification, and density estimation. A frequentist approach exists to hand clustering based on mixture model which is known as the EM algorithm where the parameters of the mixture model are usually estimated into a maximum likelihood estimation framework… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  50. arXiv:2108.09904  [pdf, other

    stat.ME math.ST

    StarTrek: Combinatorial Variable Selection with False Discovery Rate Control

    Authors: Lu Zhang, Junwei Lu

    Abstract: Variable selection on the large-scale networks has been extensively studied in the literature. While most of the existing methods are limited to the local functionals especially the graph edges, this paper focuses on selecting the discrete hub structures of the networks. Specifically, we propose an inferential method, called StarTrek filter, to select the hub nodes with degrees larger than a certa… ▽ More

    Submitted 14 September, 2023; v1 submitted 22 August, 2021; originally announced August 2021.