Skip to main content

Showing 1–50 of 240 results for author: Chen, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.17719  [pdf

    stat.AP

    A new moment-independent uncertainty importance measure based on cumulative residual entropy for developing uncertainty reduction strategies

    Authors: Shi-Shun Chen, Xiao-Yang Li

    Abstract: Uncertainty reduction is vital for improving system reliability and reducing risks. To identify the best target for uncertainty reduction, uncertainty importance measure is commonly used to prioritize the significance of input variable uncertainties. Then, designers will take steps to reduce the uncertainties of variables with high importance. However, for variables with minimal uncertainty, the c… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  2. arXiv:2407.17718  [pdf

    stat.AP

    Comparison of global sensitivity analysis methods for a fire spread model with a segmented characteristic

    Authors: Shi-Shun Chen, Xiao-Yang Li

    Abstract: Global sensitivity analysis (GSA) can provide rich information for controlling output uncertainty. In practical applications, segmented models are commonly used to describe an abrupt model change. For segmented models, the complicated uncertainty propagation during the transition region may lead to different importance rankings of different GSA methods. If an unsuitable GSA method is applied, misl… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  3. arXiv:2407.17592  [pdf, other

    stat.ME stat.CO

    Robust Maximum $L_q$-Likelihood Covariance Estimation for Replicated Spatial Data

    Authors: Sihan Chen, Joydeep Chowdhury, Marc G. Genton

    Abstract: Parameter estimation with the maximum $L_q$-likelihood estimator (ML$q$E) is an alternative to the maximum likelihood estimator (MLE) that considers the $q$-th power of the likelihood values for some $q<1$. In this method, extreme values are down-weighted because of their lower likelihood values, which yields robust estimates. In this work, we study the properties of the ML$q$E for spatial data wi… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  4. arXiv:2406.16605  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    CLEAR: Can Language Models Really Understand Causal Graphs?

    Authors: Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu

    Abstract: Causal reasoning is a cornerstone of how humans interpret the world. To model and reason about causality, causal graphs offer a concise yet effective solution. Given the impressive advancements in language models, a crucial question arises: can they really understand causal graphs? To this end, we pioneer an investigation into language models' understanding of causal graphs. Specifically, we devel… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  5. arXiv:2406.00924  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Faster Diffusion-based Sampling with Randomized Midpoints: Sequential and Parallel

    Authors: Shivam Gupta, Linda Cai, Sitan Chen

    Abstract: In recent years, there has been a surge of interest in proving discretization bounds for diffusion models. These works show that for essentially any data distribution, one can approximately sample in polynomial time given a sufficiently accurate estimate of its score functions at different noise levels. In this work, we propose a new discretization scheme for diffusion models inspired by Shen and… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  6. arXiv:2406.00695  [pdf, other

    physics.flu-dyn cs.LG cs.SC stat.AP

    Discovering an interpretable mathematical expression for a full wind-turbine wake with artificial intelligence enhanced symbolic regression

    Authors: Ding Wang, Yuntian Chen, Shiyi Chen

    Abstract: The rapid expansion of wind power worldwide underscores the critical significance of engineering-focused analytical wake models in both the design and operation of wind farms. These theoretically-derived ana lytical wake models have limited predictive capabilities, particularly in the near-wake region close to the turbine rotor, due to assumptions that do not hold. Knowledge discovery methods can… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  7. arXiv:2405.15950  [pdf, ps, other

    stat.ML cs.LG stat.ME

    A Systematic Bias of Machine Learning Regression Models and Its Correction: an Application to Imaging-based Brain Age Prediction

    Authors: Hwiyoung Lee, Shuo Chen

    Abstract: Machine learning models for continuous outcomes often yield systematically biased predictions, particularly for values that largely deviate from the mean. Specifically, predictions for large-valued outcomes tend to be negatively biased, while those for small-valued outcomes are positively biased. We refer to this linear central tendency warped bias as the "systematic bias of machine learning regre… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2404.18893  [pdf, other

    cs.DS cs.LG stat.ML

    Learning general Gaussian mixtures with efficient score matching

    Authors: Sitan Chen, Vasilis Kontonis, Kulin Shah

    Abstract: We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the means and covariances lie in a ball of bounded radius. We give an algorithm that draws $d^{\mathrm{poly}(k/\varepsilon)}$ samples from the target mixture, runs in… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 57 pages

  9. arXiv:2404.10884  [pdf, other

    stat.ME

    Modeling Interconnected Modules in Multivariate Outcomes: Evaluating the Impact of Alcohol Intake on Plasma Metabolomics

    Authors: Yifan Yang, Chixiang Chen, Hwiyoung Lee, Ming Wang, Shuo Chen

    Abstract: Alcohol consumption has been shown to influence cardiovascular mechanisms in humans, leading to observable alterations in the plasma metabolomic profile. Regression models are commonly employed to investigate these effects, treating metabolomics features as the outcomes and alcohol intake as the exposure. Given the latent dependence structure among the numerous metabolomic features (e.g., co-expre… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 25 pages, 5 figures

  10. arXiv:2404.03160  [pdf, other

    stat.AP

    Simultaneous clustering and estimation of additive shape invariant models for recurrent event data

    Authors: Zitong Zhang, Shizhe Chen

    Abstract: Technological advancements have enabled the recording of spiking activities from large neuron ensembles, presenting an exciting yet challenging opportunity for statistical analysis. This project considers the challenges from a common type of neuroscience experiments, where randomized interventions are applied over the course of each trial. The objective is to identify groups of neurons with unique… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  11. arXiv:2403.17852  [pdf, other

    cs.LG stat.ML

    Counterfactual Fairness through Transforming Data Orthogonal to Bias

    Authors: Shuyi Chen, Shixiang Zhu

    Abstract: Machine learning models have shown exceptional prowess in solving complex issues across various domains. However, these models can sometimes exhibit biased decision-making, resulting in unequal treatment of different groups. Despite substantial research on counterfactual fairness, methods to reduce the impact of multivariate and continuous sensitive variables on decision-making outcomes are still… ▽ More

    Submitted 29 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  12. arXiv:2403.08699  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Implicit Regularization of Gradient Flow on One-Layer Softmax Attention

    Authors: Heejune Sheen, Siyu Chen, Tianhao Wang, Harrison H. Zhou

    Abstract: We study gradient flow on the exponential loss for a classification problem with a one-layer softmax attention model, where the key and query weight matrices are trained separately. Under a separability assumption on the data, we show that when gradient flow achieves the minimal loss value, it further implicitly minimizes the nuclear norm of the product of the key and query weight matrices. Such i… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 34 pages

  13. arXiv:2403.01633  [pdf, other

    cs.LG cs.CV stat.ML

    Critical windows: non-asymptotic theory for feature emergence in diffusion models

    Authors: Marvin Li, Sitan Chen

    Abstract: We develop theory to understand an intriguing property of diffusion models for image generation that we term critical windows. Empirically, it has been observed that there are narrow time intervals in sampling during which particular features of the final image emerge, e.g. the image class or background color (Ho et al., 2020b; Meng et al., 2022; Choi et al., 2022; Raya & Ambrogioni, 2023; Georgie… ▽ More

    Submitted 24 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  14. arXiv:2402.19442  [pdf, other

    cs.LG cs.AI math.OC math.ST stat.ML

    Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

    Authors: Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang

    Abstract: We study the dynamics of gradient flow for training a multi-head softmax attention model for in-context learning of multi-task linear regression. We establish the global convergence of gradient flow under suitable choices of initialization. In addition, we prove that an interesting "task allocation" phenomenon emerges during the gradient flow dynamics, where each attention head focuses on solving… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 141 pages, 7 figures

  15. arXiv:2402.09356  [pdf, other

    stat.CO stat.ME

    On the Impact of Spatial Covariance Matrix Ordering on Tile Low-Rank Estimation of Matérn Parameters

    Authors: Sihan Chen, Sameh Abdulah, Ying Sun, Marc G. Genton

    Abstract: Spatial statistical modeling and prediction involve generating and manipulating an n*n symmetric positive definite covariance matrix, where n denotes the number of spatial locations. However, when n is large, processing this covariance matrix using traditional methods becomes prohibitive. Thus, coupling parallel processing with approximation can be an elegant solution to this challenge by relying… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 31 pages, 13 figures

  16. arXiv:2402.07134  [pdf, other

    q-fin.RM stat.AP

    Tail risk forecasting with semi-parametric regression models by incorporating overnight information

    Authors: Cathy W. S. Chen, Takaaki Koike, Wei-Hsuan Shau

    Abstract: This research incorporates realized volatility and overnight information into risk models, wherein the overnight return often contributes significantly to the total return volatility. Extending a semi-parametric regression model based on asymmetric Laplace distribution, we propose a family of RES-CAViaR-oc models by adding overnight return and realized measures as a nowcasting technique for simult… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  17. arXiv:2402.05569  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    Simplifying Hypergraph Neural Networks

    Authors: Bohan Tang, Zexi Liu, Keyue Jiang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraphs are crucial for modeling higher-order interactions in real-world data. Hypergraph neural networks (HNNs) effectively utilise these structures by message passing to generate informative node features for various downstream tasks like node classification. However, the message passing block in existing HNNs typically requires a computationally intensive training process, which limits thei… ▽ More

    Submitted 22 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  18. arXiv:2402.04084  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Provably learning a multi-head attention layer

    Authors: Sitan Chen, Yuanzhi Li

    Abstract: The multi-head attention layer is one of the key components of the transformer architecture that sets it apart from traditional feed-forward models. Given a sequence length $k$, attention matrices $\mathbfΘ_1,\ldots,\mathbfΘ_m\in\mathbb{R}^{d\times d}$, and projection matrices $\mathbf{W}_1,\ldots,\mathbf{W}_m\in\mathbb{R}^{d\times d}$, the corresponding multi-head attention layer… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 105 pages, comments welcome

  19. arXiv:2401.13943  [pdf, other

    stat.AP stat.ME

    Is the age pension in Australia sustainable and fair? Evidence from forecasting the old-age dependency ratio using the Hamilton-Perry model

    Authors: Sizhe Chen, Han Lin Shang, Yang Yang

    Abstract: The age pension aims to assist eligible elderly Australians meet specific age and residency criteria in maintaining basic living standards. In designing efficient pension systems, government policymakers seek to satisfy the expectations of the overall aging population in Australia. However, the population's unique demographic characteristics at the state and territory level are often overlooked du… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 31 pages, 14 figures, 1 table

    MSC Class: 62R10

  20. arXiv:2401.04856  [pdf, other

    cs.LG stat.ML

    A Good Score Does not Lead to A Good Generative Model

    Authors: Sixu Li, Shi Chen, Qin Li

    Abstract: Score-based Generative Models (SGMs) is one leading method in generative modeling, renowned for their ability to generate high-quality samples from complex, high-dimensional data distributions. The method enjoys empirical success and is supported by rigorous theoretical convergence properties. In particular, it has been shown that SGMs can generate samples from a distribution that is close to the… ▽ More

    Submitted 27 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  21. arXiv:2401.00624  [pdf, other

    stat.ME

    Semi-Confirmatory Factor Analysis for High-Dimensional Data with Interconnected Community Structures

    Authors: Yifan Yang, Tianzhou Ma, Chuan Bi, Shuo Chen

    Abstract: Confirmatory factor analysis (CFA) is a statistical method for identifying and confirming the presence of latent factors among observed variables through the analysis of their covariance structure. Compared to alternative factor models, CFA offers interpretable common factors with enhanced specificity and a more adaptable approach to modeling covariance structures. However, the application of CFA… ▽ More

    Submitted 27 March, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  22. arXiv:2312.08583  [pdf, other

    cs.CL stat.ML

    ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

    Authors: Xiaoxia Wu, Haojun Xia, Stephen Youn, Zhen Zheng, Shiyang Chen, Arash Bakhtiari, Michael Wyatt, Reza Yazdani Aminabadi, Yuxiong He, Olatunji Ruwase, Leon Song, Zhewei Yao

    Abstract: This study examines 4-bit quantization methods like GPTQ in large language models (LLMs), highlighting GPTQ's overfitting and limited enhancement in Zero-Shot tasks. While prior works merely focusing on zero-shot measurement, we extend task scope to more generative categories such as code generation and abstractive summarization, in which we found that INT4 quantization can significantly underperf… ▽ More

    Submitted 18 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  23. arXiv:2311.03776  [pdf, other

    physics.flu-dyn stat.AP stat.ML

    Filtered Partial Differential Equations: a robust surrogate constraint in physics-informed deep learning framework

    Authors: Dashan Zhang, Yuntian Chen, Shiyi Chen

    Abstract: Embedding physical knowledge into neural network (NN) training has been a hot topic. However, when facing the complex real-world, most of the existing methods still strongly rely on the quantity and quality of observation data. Furthermore, the neural networks often struggle to converge when the solution to the real equation is very complex. Inspired by large eddy simulation in computational fluid… ▽ More

    Submitted 14 May, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

  24. arXiv:2310.18567  [pdf

    stat.AP

    Reliability modeling and statistical inference of accelerated degradation data with memory effects and unit-to-unit variability

    Authors: Shi-Shun Chen, Xiao-Yang Li, Wenrui Xie

    Abstract: Accelerated degradation testing (ADT) is an effective way to evaluate the lifetime and reliability of highly reliable products. Markovian stochastic processes are usually applied to describe the degradation process. However, the degradation processes of some products are non-Markovian due to the interaction with environments. Besides, owing to the differences in materials and manufacturing process… ▽ More

    Submitted 24 July, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  25. arXiv:2310.18533  [pdf, other

    stat.ME q-bio.NC q-bio.QM stat.CO

    Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based vector-on-matrix regression

    Authors: Tong Lu, Yuan Zhang, Vince Lyzinski, Chuan Bi, Peter Kochunov, Elliot Hong, Shuo Chen

    Abstract: The joint analysis of multimodal neuroimaging data is critical in the field of brain research because it reveals complex interactive relationships between neurobiological structures and functions. In this study, we focus on investigating the effects of structural imaging (SI) features, including white matter micro-structure integrity (WMMI) and cortical thickness, on the whole brain functional con… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 20 pages, 5 figures, 2 tables

  26. arXiv:2310.18527  [pdf, other

    stat.ME stat.AP stat.CO

    Multiple Imputation Method for High-Dimensional Neuroimaging Data

    Authors: Tong Lu, Chixiang Chen, Hsin-Hsiung Huang, Peter Kochunov, Elliot Hong, Shuo Chen

    Abstract: Missingness is a common issue for neuroimaging data, and neglecting it in downstream statistical analysis can introduce bias and lead to misguided inferential conclusions. It is therefore crucial to conduct appropriate statistical methods to address this issue. While multiple imputation is a popular technique for handling missing data, its application to neuroimaging data is hindered by high dimen… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages, 5 figures

  27. arXiv:2308.14172  [pdf, other

    cs.LG cs.AI cs.SI eess.SP stat.ML

    Hypergraph Structure Inference From Data Under Smoothness Prior

    Authors: Bohan Tang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraphs are important for processing data with higher-order relationships involving more than two entities. In scenarios where explicit hypergraphs are not readily available, it is desirable to infer a meaningful hypergraph structure from the node features to capture the intrinsic relations within the data. However, existing methods either adopt simple pre-defined rules that fail to precisely… ▽ More

    Submitted 31 August, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  28. arXiv:2308.12460  [pdf, other

    stat.ME stat.AP

    Bayesian blockwise inference for joint models of longitudinal and multistate processes

    Authors: Sida Chen, Danilo Alvares, Christopher Jackson, Jessica Barrett

    Abstract: Joint models (JM) for longitudinal and survival data have gained increasing interest and found applications in a wide range of clinical and biomedical settings. These models facilitate the understanding of the relationship between outcomes and enable individualized predictions. In many applications, more complex event processes arise, necessitating joint longitudinal and multistate models. However… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  29. arXiv:2308.08562  [pdf, other

    stat.AP q-bio.CB

    Bayesian Inference of Phenotypic Plasticity of Cancer Cells Based on Dynamic Model for Temporal Cell Proportion Data

    Authors: Shuli Chen, Yuman Wang, Da Zhou, Jie Hu

    Abstract: Mounting evidence underscores the prevalent hierarchical organization of cancer tissues. At the foundation of this hierarchy reside cancer stem cells, a subset of cells endowed with the pivotal role of engendering the entire cancer tissue through cell differentiation. In recent times, substantial attention has been directed towards the phenomenon of cancer cell plasticity, where the dynamic interc… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  30. arXiv:2308.07047  [pdf, other

    cs.LG stat.ML

    No Regularization is Needed: An Efficient and Effective Model for Incomplete Label Distribution Learning

    Authors: Xiang Li, Songcan Chen

    Abstract: Label Distribution Learning (LDL) assigns soft labels, a.k.a. degrees, to a sample. In reality, it is always laborious to obtain complete degrees, giving birth to the Incomplete LDL (InLDL). However, InLDL often suffers from performance degeneration. To remedy it, existing methods need one or more explicit regularizations, leading to burdensome parameter tuning and extra computation. We argue that… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 9 pages, 4 figures

    Journal ref: The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  31. arXiv:2307.14085  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks

    Authors: Siyu Chen, Mengdi Wang, Zhuoran Yang

    Abstract: We study reinforcement learning (RL) for learning a Quantal Stackelberg Equilibrium (QSE) in an episodic Markov game with a leader-follower structure. In specific, at the outset of the game, the leader announces her policy to the follower and commits to it. The follower observes the leader's policy and, in turn, adopts a quantal response policy by solving an entropy-regularized policy optimization… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 129 pages, 1 figure

  32. arXiv:2307.12496  [pdf, ps, other

    cs.LG cs.DS stat.ML

    A faster and simpler algorithm for learning shallow networks

    Authors: Sitan Chen, Shyam Narayanan

    Abstract: We revisit the well-studied problem of learning a linear combination of $k$ ReLU activations given labeled examples drawn from the standard $d$-dimensional Gaussian measure. Chen et al. [CDG+23] recently gave the first algorithm for this problem to run in $\text{poly}(d,1/\varepsilon)$ time when $k = O(1)$, where $\varepsilon$ is the target error. More precisely, their algorithm runs in time… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: 14 pages

  33. arXiv:2307.01178  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Learning Mixtures of Gaussians Using the DDPM Objective

    Authors: Kulin Shah, Sitan Chen, Adam Klivans

    Abstract: Recent works have shown that diffusion models can learn essentially any distribution provided one can perform score estimation. Yet it remains poorly understood under what settings score estimation is possible, let alone when practical gradient-based algorithms for this task can provably succeed. In this work, we give the first provably efficient results along these lines for one of the most fun… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 48 pages

  34. arXiv:2305.19461  [pdf, other

    math.ST stat.ME

    Residual spectrum: Brain functional connectivity detection beyond coherence

    Authors: Yuichi Goto, Xuze Zhang, Benjamin Kedem, Shuo Chen

    Abstract: Coherence is a widely used measure to assess linear relationships between time series. However, it fails to capture nonlinear dependencies. To overcome this limitation, this paper introduces the notion of residual spectral density as a higher-order extension of the squared coherence. The method is based on an orthogonal decomposition of time series regression models. We propose a test for the exis… ▽ More

    Submitted 17 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    MSC Class: 62M15; 62M10; 62G10

  35. arXiv:2305.11798  [pdf, ps, other

    cs.LG math.ST stat.ML

    The probability flow ODE is provably fast

    Authors: Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim

    Abstract: We provide the first polynomial-time convergence guarantees for the probability flow ODE implementation (together with a corrector step) of score-based generative modeling. Our analysis is carried out in the wake of recent results obtaining such guarantees for the SDE-based implementation (i.e., denoising diffusion probabilistic modeling or DDPM), but requires the development of novel techniques f… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 23 pages, 2 figures

  36. arXiv:2305.07642  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    The ASNR-MICCAI Brain Tumor Segmentation (BraTS) Challenge 2023: Intracranial Meningioma

    Authors: Dominic LaBella, Maruf Adewole, Michelle Alonso-Basanta, Talissa Altes, Syed Muhammad Anwar, Ujjwal Baid, Timothy Bergquist, Radhika Bhalerao, Sully Chen, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Devon Godfrey, Fathi Hilal, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang, Elaine Johanson, Anahita Fathi Kazerooni, Collin Kent, John Kirkpatrick, Florian Kofler , et al. (35 additional authors not shown)

    Abstract: Meningiomas are the most common primary intracranial tumor in adults and can be associated with significant morbidity and mortality. Radiologists, neurosurgeons, neuro-oncologists, and radiation oncologists rely on multiparametric MRI (mpMRI) for diagnosis, treatment planning, and longitudinal treatment monitoring; yet automated, objective, and quantitative tools for non-invasive assessment of men… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  37. arXiv:2305.04086  [pdf, other

    stat.ML math.OC

    Efficient Learning for Selecting Top-m Context-Dependent Designs

    Authors: Gongbo Zhang, Sihua Chen, Kuihua Huang, Yijie Peng

    Abstract: We consider a simulation optimization problem for a context-dependent decision-making, which aims to determine the top-m designs for all contexts. Under a Bayesian framework, we formulate the optimal dynamic sampling decision as a stochastic dynamic programming problem, and develop a sequential sampling policy to efficiently learn the performance of each design under each context. The asymptotical… ▽ More

    Submitted 9 June, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  38. arXiv:2305.01596  [pdf, other

    stat.ME stat.AP

    Network method for voxel-pair-level brain connectivity analysis under spatial-contiguity constraints

    Authors: Tong Lu, Yuan Zhang, Peter Kochunov, Elliot Hong, Shuo Chen

    Abstract: Brain connectome analysis commonly compresses high-resolution brain scans (typically composed of millions of voxels) down to only hundreds of regions of interest (ROIs) by averaging within-ROI signals. This huge dimension reduction improves computational speed and the morphological properties of anatomical structures; however, it also comes at the cost of substantial losses in spatial specificity… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 25 pages, 6 figures

  39. arXiv:2304.10524  [pdf, other

    cs.LG cs.DS stat.ML

    Learning Narrow One-Hidden-Layer ReLU Networks

    Authors: Sitan Chen, Zehao Dou, Surbhi Goel, Adam R Klivans, Raghu Meka

    Abstract: We consider the well-studied problem of learning a linear combination of $k$ ReLU activations with respect to a Gaussian distribution on inputs in $d$ dimensions. We give the first polynomial-time algorithm that succeeds whenever $k$ is a constant. All prior polynomial-time learners require additional assumptions on the network, such as positive combining coefficients or the matrix of hidden weigh… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 33 pages, comments welcome

  40. arXiv:2304.08553  [pdf, other

    stat.ME

    A New Representation of Uniform-Block Matrix and Applications

    Authors: Yifan Yang, Hwiyoung Lee, Shuo Chen

    Abstract: A covariance matrix with a special pattern (e.g., sparsity or block structure) is essential for conducting multivariate analysis on high-dimensional data. Recently, a block covariance or correlation pattern has been observed in various biological and biomedical studies, such as gene expression, proteomics, neuroimaging, exposome, and seed quality, among others. Specifically, this pattern partition… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  41. arXiv:2303.11187  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

    Authors: Siyu Chen, Yitan Wang, Zhaoran Wang, Zhuoran Yang

    Abstract: We study the offline contextual bandit problem, where we aim to acquire an optimal policy using observational data. However, this data usually contains two deficiencies: (i) some variables that confound actions are not observed, and (ii) missing observations exist in the collected data. Unobserved confounders lead to a confounding bias and missing observations cause bias and inefficiency problems.… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 76 page, 5 figures

  42. arXiv:2303.08613  [pdf, ps, other

    cs.LG cs.AI cs.GT econ.TH stat.ML

    Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

    Authors: Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang

    Abstract: We study the incentivized information acquisition problem, where a principal hires an agent to gather information on her behalf. Such a problem is modeled as a Stackelberg game between the principal and the agent, where the principal announces a scoring rule that specifies the payment, and then the agent then chooses an effort level that maximizes her own profit and reports the information. We stu… ▽ More

    Submitted 6 August, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 35 pages, adding an impossible result (Lemma 3.2) with its proof in Section D.1

  43. arXiv:2303.03520  [pdf, other

    stat.ME

    The Effect of Alcohol Consumption on Brain Ageing: A New Causal Inference Framework for Incomplete and Massive Phenomic Data

    Authors: Chixiang Chen, Shuo Chen, Zhenyao Ye, Xu Shi, Tianzhou Ma

    Abstract: Although substance use, such as alcohol consumption, is known to be associated with cognitive decline during ageing, its direct influence on the central nervous system remains unclear. In this study, we aim to investigate the potential influence of alcohol intake frequency on accelerated brain ageing by estimating the mean potential brain-age gap (BAG) index, the difference between brain age and a… ▽ More

    Submitted 4 March, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Contact: [email protected]

  44. arXiv:2303.03512  [pdf, other

    stat.ME

    An Efficient Data Integration Scheme for Synthesizing Information from Multiple Secondary Datasets for the Parameter Inference of the Main Analysis

    Authors: Chixiang Chen, Ming Wang, Shuo Chen

    Abstract: Many observational studies and clinical trials collect various secondary outcomes that may be highly correlated with the primary endpoint. These secondary outcomes are often analyzed in secondary analyses separately from the main data analysis. However, these secondary outcomes can be used to improve the estimation precision in the main analysis. We propose a method called Multiple Information Bor… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Contact Email: [email protected]

  45. arXiv:2303.03497  [pdf, other

    stat.ME

    Integrative data analysis where partial covariates have complex non-linear effects by using summary information from an external data

    Authors: Jia Liang, Shuo Chen, Peter Kochunov, L Elliot Hong, Chixiang Chen

    Abstract: A full parametric and linear specification may be insufficient to capture complicated patterns in studies exploring complex features, such as those investigating age-related changes in brain functional abilities. Alternatively, a partially linear model (PLM) consisting of both parametric and non-parametric elements may have a better fit. This model has been widely applied in economics, environment… ▽ More

    Submitted 5 February, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Contact Email: chixiang.chen [at] som [dot] umaryland [dot]edu

  46. arXiv:2303.03384  [pdf, ps, other

    cs.LG math.ST stat.ML

    Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers

    Authors: Sitan Chen, Giannis Daras, Alexandros G. Dimakis

    Abstract: We develop a framework for non-asymptotic analysis of deterministic samplers used for diffusion generative modeling. Several recent works have analyzed stochastic samplers using tools like Girsanov's theorem and a chain rule variant of the interpolation argument. Unfortunately, these techniques give vacuous bounds when applied to deterministic samplers. We give a new operational interpretation for… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 29 pages

  47. Bayesian Nonlinear Tensor Regression with Functional Fused Elastic Net Prior

    Authors: Shuoli Chen, Kejun He, Shiyuan He, Yang Ni, Raymond K. W. Wong

    Abstract: Tensor regression methods have been widely used to predict a scalar response from covariates in the form of a multiway array. In many applications, the regions of tensor covariates used for prediction are often spatially connected with unknown shapes and discontinuous jumps on the boundaries. Moreover, the relationship between the response and the tensor covariates can be nonlinear. In this articl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: Technometrics, 65:4, 524-536 (2023)

  48. arXiv:2302.04552  [pdf, ps, other

    cs.LG stat.ML

    Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

    Authors: Sijia Chen, Yu-Jie Zhang, Wei-Wei Tu, Peng Zhao, Lijun Zhang

    Abstract: Stochastically Extended Adversarial (SEA) model is introduced by Sachs et al. [2022] as an interpolation between stochastic and adversarial online convex optimization. Under the smoothness condition, they demonstrate that the expected regret of optimistic follow-the-regularized-leader (FTRL) depends on the cumulative stochastic variance $σ_{1:T}^2$ and the cumulative adversarial variation… ▽ More

    Submitted 16 March, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: v3 substantially improves the presentation and has a few improvements, including the regret bound for strongly convex functions; v2 is an extended version that enriches the content with improved regret bounds for strongly convex functions, discussions on the optimism design for dynamic regret minimization, and extensions to non-smooth scenarios; v1 is the ICML 2023 conference version

  49. arXiv:2302.01861  [pdf, other

    stat.ME

    Covariance Matrix Estimation for High-Throughput Biomedical Data with Interconnected Communities

    Authors: Yifan Yang, Chixiang Chen, Shuo Chen

    Abstract: Estimating a covariance matrix is central to high-dimensional data analysis. Empirical analyses of high-dimensional biomedical data, including genomics, proteomics, microbiome, and neuroimaging, among others, consistently reveal strong modularity in the dependence patterns. In these analyses, intercorrelated high-dimensional biomedical features often form communities or modules that can be interco… ▽ More

    Submitted 15 November, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 24 pages, 3 figures

  50. arXiv:2301.03246  [pdf, other

    stat.ME

    An instrumental variable method for point processes: generalised Wald estimation based on deconvolution

    Authors: Zhichao Jiang, Shizhe Chen, Peng Ding

    Abstract: Point processes are probabilistic tools for modeling event data. While there exists a fast-growing literature studying the relationships between point processes, it remains unexplored how such relationships connect to causal effects. In the presence of unmeasured confounders, parameters from point process models do not necessarily have causal interpretations. We propose an instrumental variable me… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.