Skip to main content

Showing 1–50 of 210 results for author: Wang, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.00920  [pdf, other

    cs.LG stat.ML

    Towards Certified Unlearning for Deep Neural Networks

    Authors: Binchi Zhang, Yushun Dong, Tianhao Wang, Jundong Li

    Abstract: In the field of machine unlearning, certified unlearning has been extensively studied in convex machine learning models due to its high efficiency and strong theoretical guarantees. However, its application to deep neural networks (DNNs), known for their highly nonconvex nature, still poses challenges. To bridge the gap between certified unlearning and DNNs, we propose several simple techniques to… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: ICML 2024

  2. arXiv:2407.19446  [pdf, ps, other

    cs.IT stat.ML

    Leave-One-Out Analysis for Nonconvex Robust Matrix Completion with General Thresholding Functions

    Authors: Tianming Wang, Ke Wei

    Abstract: We study the problem of robust matrix completion (RMC), where the partially observed entries of an underlying low-rank matrix is corrupted by sparse noise. Existing analysis of the non-convex methods for this problem either requires the explicit but empirically redundant regularization in the algorithm or requires sample splitting in the analysis. In this paper, we consider a simple yet efficient… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2407.03619  [pdf, other

    stat.ME

    Multivariate Representations of Univariate Marked Hawkes Processes

    Authors: Louis Davis, Conor Kresin, Boris Baeumer, Ting Wang

    Abstract: Univariate marked Hawkes processes are used to model a range of real-world phenomena including earthquake aftershock sequences, contagious disease spread, content diffusion on social media platforms, and order book dynamics. This paper illustrates a fundamental connection between univariate marked Hawkes processes and multivariate Hawkes processes. Exploiting this connection renders a framework th… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 26 pages, 3 figures, submitted to the Annals of Statistics

  4. arXiv:2407.02754  [pdf, other

    math.ST stat.ME

    Is Cross-Validation the Gold Standard to Evaluate Model Performance?

    Authors: Garud Iyengar, Henry Lam, Tianyu Wang

    Abstract: Cross-Validation (CV) is the default choice for evaluating the performance of machine learning models. Despite its wide usage, their statistical benefits have remained half-understood, especially in challenging nonparametric regimes. In this paper we fill in this gap and show that in fact, for a wide spectrum of models, CV does not statistically outperform the simple "plug-in" approach where one r… ▽ More

    Submitted 20 August, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2406.12843  [pdf, other

    cs.LG cs.AI stat.ML

    Can Go AIs be adversarially robust?

    Authors: Tom Tseng, Euan McLean, Kellin Pelrine, Tony T. Wang, Adam Gleave

    Abstract: Prior work found that superhuman Go AIs like KataGo can be defeated by simple adversarial strategies. In this paper, we study if simple defenses can improve KataGo's worst-case performance. We test three natural defenses: adversarial training on hand-constructed positions, iterated adversarial training, and changing the network architecture. We find that some of these defenses are able to protect… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 67 pages

  6. arXiv:2406.11011  [pdf, other

    cs.LG cs.CL stat.ML

    Data Shapley in One Training Run

    Authors: Jiachen T. Wang, Prateek Mittal, Dawn Song, Ruoxi Jia

    Abstract: Data Shapley provides a principled framework for attributing data's contribution within machine learning contexts. However, existing approaches require re-training models on different data subsets, which is computationally intensive, foreclosing their application to large-scale models. Furthermore, they produce the same attribution score for any models produced by running the learning algorithm, m… ▽ More

    Submitted 29 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  7. ZIKQ: An innovative centile chart method for utilizing natural history data in rare disease clinical development

    Authors: Tianying Wang, Wenfei Zhang, Ying Wei

    Abstract: Utilizing natural history data as external control plays an important role in the clinical development of rare diseases, since placebo groups in double-blind randomization trials may not be available due to ethical reasons and low disease prevalence. This article proposed an innovative approach for utilizing natural history data to support rare disease clinical development by constructing referenc… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.17248  [pdf, other

    stat.ML cs.LG

    Transformer In-Context Learning for Categorical Data

    Authors: Aaron T. Wang, Ricardo Henao, Lawrence Carin

    Abstract: Recent research has sought to understand Transformers through the lens of in-context learning with functional data. We extend that line of work with the goal of moving closer to language models, considering categorical outcomes, nonlinear underlying models, and nonlinear attention. The contextual data are of the form $\textsf{C}=(x_1,c_1,\dots,x_N,c_{N})$ where each $c_i\in\{0,\dots,C-1\}$ is draw… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.12437  [pdf

    stat.AP

    Considerations for Single-Arm Trials to Support Accelerated Approval of Oncology Drugs

    Authors: Feinan Lu, Tao Wang, Ying Lu, Jie Chen

    Abstract: In the last two decades, single-arm trials (SATs) have been effectively used to study anticancer therapies in well-defined patient populations using durable response rates as an objective and interpretable clinical endpoints. With a growing trend of regulatory accelerated approval (AA) requiring randomized controlled trials (RCTs), some confusions have arisen about the roles of SATs in AA. This pa… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  10. arXiv:2405.09841  [pdf, other

    stat.ML cs.LG

    Simultaneous Identification of Sparse Structures and Communities in Heterogeneous Graphical Models

    Authors: Dapeng Shi, Tiandong Wang, Zhiliang Ying

    Abstract: Exploring and detecting community structures hold significant importance in genetics, social sciences, neuroscience, and finance. Especially in graphical models, community detection can encourage the exploration of sets of variables with group-like properties. In this paper, within the framework of Gaussian graphical models, we introduce a novel decomposition of the underlying graphical structure… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 61 pages, 11 figures, 4 tables

  11. arXiv:2405.05733  [pdf, other

    stat.ML cs.LG

    Batched Stochastic Bandit for Nondegenerate Functions

    Authors: Yu Liu, Yunlu Shu, Tianyu Wang

    Abstract: This paper studies batched bandit learning problems for nondegenerate functions. We introduce an algorithm that solves the batched bandit problem for nondegenerate functions near-optimally. More specifically, we introduce an algorithm, called Geometric Narrowing (GN), whose regret bound is of order $\widetilde{\mathcal{O}} ( A_{+}^d \sqrt{T} )$. In addition, GN only needs… ▽ More

    Submitted 29 August, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 34 pages, 14 colored figures

  12. arXiv:2405.03875  [pdf, other

    cs.LG stat.ML

    Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

    Authors: Jiachen T. Wang, Tianji Yang, James Zou, Yongchan Kwon, Ruoxi Jia

    Abstract: Data Shapley provides a principled approach to data valuation and plays a crucial role in data-centric machine learning (ML) research. Data selection is considered a standard application of Data Shapley. However, its data selection performance has shown to be inconsistent across settings in the literature. This study aims to deepen our understanding of this phenomenon. We introduce a hypothesis te… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  13. arXiv:2404.13964  [pdf, other

    cs.LG econ.GN stat.ME

    An Economic Solution to Copyright Challenges of Generative AI

    Authors: Jiachen T. Wang, Zhun Deng, Hiroaki Chiba-Okabe, Boaz Barak, Weijie J. Su

    Abstract: Generative artificial intelligence (AI) systems are trained on large data corpora to generate new pieces of text, images, videos, and other media. There is growing concern that such systems may infringe on the copyright interests of training data contributors. To address the copyright challenges of generative AI, we propose a framework that compensates copyright owners proportionally to their cont… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  14. arXiv:2404.01478  [pdf, other

    stat.AP

    A Multidimensional Fractional Hawkes Process for Multiple Earthquake Mainshock Aftershock Sequences

    Authors: Louis Davis, Boris Baeumer, Ting Wang

    Abstract: Most point process models for earthquakes currently in the literature assume the magnitude distribution is i.i.d. potentially hindering the ability of the model to describe the main features of data sets containing multiple earthquake mainshock aftershock sequences in succession. This study presents a novel multidimensional fractional Hawkes process model designed to capture magnitude dependent tr… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 37 pages, 10 tables, 3 figures

  15. arXiv:2403.08699  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Implicit Regularization of Gradient Flow on One-Layer Softmax Attention

    Authors: Heejune Sheen, Siyu Chen, Tianhao Wang, Harrison H. Zhou

    Abstract: We study gradient flow on the exponential loss for a classification problem with a one-layer softmax attention model, where the key and query weight matrices are trained separately. Under a separability assumption on the data, we show that when gradient flow achieves the minimal loss value, it further implicitly minimizes the nuclear norm of the product of the key and query weight matrices. Such i… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 34 pages

  16. arXiv:2403.06783  [pdf

    stat.ME

    A doubly robust estimator for the Mann Whitney Wilcoxon Rank Sum Test when applied for causal inference in observational studies

    Authors: Ruohui Chen, Tuo Lin, Lin Liu, Jinyuan Liu, Ruifeng Chen, Jingjing Zou, Chenyu Liu, Loki Natarajan, Tang Wang, Xinlian Zhang, Xin Tu

    Abstract: The Mann-Whitney-Wilcoxon rank sum test (MWWRST) is a widely used method for comparing two treatment groups in randomized control trials, particularly when dealing with highly skewed data. However, when applied to observational study data, the MWWRST often yields invalid results for causal inference. To address this limitation, Wu et al. (2014) introduced an approach that incorporates inverse prob… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  17. arXiv:2403.03183  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    How Well Can Transformers Emulate In-context Newton's Method?

    Authors: Angeliki Giannou, Liu Yang, Tianhao Wang, Dimitris Papailiopoulos, Jason D. Lee

    Abstract: Transformer-based models have demonstrated remarkable in-context learning capabilities, prompting extensive research into its underlying mechanisms. Recent studies have suggested that Transformers can implement first-order optimization algorithms for in-context learning and even second order ones for the case of linear regression. In this work, we study whether Transformers can perform higher orde… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  18. arXiv:2403.00142  [pdf, other

    stat.AP

    A Fractional Model for Earthquakes

    Authors: Louis Davis, Boris Baeumer, Ting Wang

    Abstract: This paper extends the existing fractional Hawkes process to better model mainshock-aftershock sequences of earthquakes. The fractional Hawkes process is a self-exciting point process model with temporal decay kernel being a Mittag-Leffler function. A maximum likelihood estimation scheme is developed and its consistency is checked. It is then compared to the ETAS model on three earthquake sequence… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 16 pages, 7 figure, submitted to the Journal of the Royal Statistical Society Series C

  19. arXiv:2402.19442  [pdf, other

    cs.LG cs.AI math.OC math.ST stat.ML

    Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

    Authors: Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang

    Abstract: We study the dynamics of gradient flow for training a multi-head softmax attention model for in-context learning of multi-task linear regression. We establish the global convergence of gradient flow under suitable choices of initialization. In addition, we prove that an interesting "task allocation" phenomenon emerges during the gradient flow dynamics, where each attention head focuses on solving… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 141 pages, 7 figures

  20. arXiv:2402.16661  [pdf, other

    stat.ML cs.LG stat.ME

    Penalized Generative Variable Selection

    Authors: Tong Wang, Jian Huang, Shuangge Ma

    Abstract: Deep networks are increasingly applied to a wide variety of data, including data with high-dimensional predictors. In such analysis, variable selection can be needed along with estimation/model building. Many of the existing deep network studies that incorporate variable selection have been limited to methodological and numerical developments. In this study, we consider modeling/estimation using t… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  21. arXiv:2402.15620  [pdf, other

    stat.AP econ.GN physics.soc-ph

    Comparison of sectoral structures between China and Japan: A network perspective

    Authors: Tao Wang, Shiying Xiao, Jun Yan

    Abstract: Economic structure comparisons between China and Japan have long captivated development economists. To delve deeper into their sectoral differences from 1995 to 2018, we used the annual input-output tables (IOTs) of both nations to construct weighted and directed input-output networks (IONs). This facilitated deeper network analyses. Strength distributions underscored variations in inter-sector ec… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  22. arXiv:2402.14090  [pdf, other

    cs.AI econ.GN stat.ML

    Social Environment Design

    Authors: Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen

    Abstract: Artificial Intelligence (AI) holds promise as a technology that can be used to improve government and economic policy-making. This paper proposes a new research agenda towards this end by introducing Social Environment Design, a general framework for the use of AI for automated policy-making that connects with the Reinforcement Learning, EconCS, and Computational Social Choice communities. The fra… ▽ More

    Submitted 17 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Position Paper. Website at https://sed.eddie.win

  23. arXiv:2402.13259  [pdf, other

    stat.ME cs.CE math.NA math.PR

    Fast Discrete-Event Simulation of Markovian Queueing Networks through Euler Approximation

    Authors: L. Jeff Hong, Yingda Song, Tan Wang

    Abstract: The efficient management of large-scale queueing networks is critical for a variety of sectors, including healthcare, logistics, and customer service, where system performance has profound implications for operational effectiveness and cost management. To address this key challenge, our paper introduces simulation techniques tailored for complex, large-scale Markovian queueing networks. We develop… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  24. arXiv:2402.09702  [pdf, other

    cs.LG stat.ML

    Sparse and Faithful Explanations Without Sparse Models

    Authors: Yiyang Sun, Zhi Chen, Vittorio Orlandi, Tong Wang, Cynthia Rudin

    Abstract: Even if a model is not globally sparse, it is possible for decisions made from that model to be accurately and faithfully described by a small number of features. For instance, an application for a large loan might be denied to someone because they have no credit history, which overwhelms any evidence towards their creditworthiness. In this work, we introduce the Sparse Explanation Value (SEV), a… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted in AISTATS 2024

  25. arXiv:2402.07806  [pdf, other

    stat.AP stat.ME

    A comparison of mixed-models for the analysis of non-linear longitudinal data: application to late-life cognitive trajectories

    Authors: Maude Wagner, Donald R. Hedeker, Tianhao Wang, Graciela Muniz-Terrera, Ana W. Capuano

    Abstract: Several mixed-effects models for longitudinal data have been proposed to accommodate the non-linearity of late-life cognitive trajectories and assess the putative influence of covariates on it. No prior research provides a side-by-side examination of these models to offer guidance on their proper application and interpretation. In this work, we examined five statistical approaches previously used… ▽ More

    Submitted 6 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 34 pages, 7 Figures, 1 Table

  26. arXiv:2401.13094  [pdf, other

    stat.ME

    On cross-validated estimation of skew normal model

    Authors: Jian Zhang, Tong Wang

    Abstract: Skew normal model suffers from inferential drawbacks, namely singular Fisher information in the vicinity of symmetry and diverging of maximum likelihood estimation. To address the above drawbacks, Azzalini and Arellano-Valle (2013) introduced maximum penalised likelihood estimation (MPLE) by subtracting a penalty function from the log-likelihood function with a pre-specified penalty coefficient. H… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  27. arXiv:2401.11103  [pdf, other

    cs.DS cs.LG stat.ML

    Efficient Data Shapley for Weighted Nearest Neighbor Algorithms

    Authors: Jiachen T. Wang, Prateek Mittal, Ruoxi Jia

    Abstract: This work aims to address an open problem in data valuation literature concerning the efficient computation of Data Shapley for weighted $K$ nearest neighbor algorithm (WKNN-Shapley). By considering the accuracy of hard-label KNN with discretized weights as the utility function, we reframe the computation of WKNN-Shapley into a counting problem and introduce a quadratic-time algorithm, presenting… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: AISTATS 2024 Oral

  28. arXiv:2312.16260  [pdf, other

    stat.ME

    Multinomial Link Models

    Authors: Tianmeng Wang, Liping Tong, Jie Yang

    Abstract: We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of t… ▽ More

    Submitted 18 June, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 39 pages, 5 figures

  29. arXiv:2312.06204  [pdf, ps, other

    stat.ME

    Multilayer Network Regression with Eigenvector Centrality and Community Structure

    Authors: Zhuoye Han, Tiandong Wang

    Abstract: In the analysis of complex networks, centrality measures and community structures are two important aspects. For multilayer networks, one crucial task is to integrate information across different layers, especially taking the dependence structure within and between layers into consideration. In this study, we introduce a novel two-stage regression model (CC-MNetR) that leverages the eigenvector ce… ▽ More

    Submitted 11 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  30. arXiv:2312.03561  [pdf

    stat.ME cs.CY cs.LG

    Blueprinting the Future: Automatic Item Categorization using Hierarchical Zero-Shot and Few-Shot Classifiers

    Authors: Ting Wang, Keith Stelter, Jenn Floyd, Thomas O'Neill, Nathaniel Hendrix, Andrew Bazemore, Kevin Rode, Warren Newton

    Abstract: In testing industry, precise item categorization is pivotal to align exam questions with the designated content domains outlined in the assessment blueprint. Traditional methods either entail manual classification, which is laborious and error-prone, or utilize machine learning requiring extensive training data, often leading to model underfit or overfit issues. This study unveils a novel approach… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  31. arXiv:2311.02618  [pdf, other

    stat.AP

    Regionalization of China's PM2.5 through Robust Spatio temporal Functional Clustering Method

    Authors: Tingyin Wang, Xueqin Wang, Xiaobo Guo, Heping Zhang

    Abstract: The patterns of particulate matter with diameters that are generally 2.5 micrometers and smaller (PM2.5) are heterogeneous in China nationwide but can be homogeneous region-wide. To reduce the adverse effects from PM2.5, policymakers need to develop location-specific regulations based on nationwide clustering analysis of PM2.5 concentrations. However, such an analysis is challenging because the da… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  32. arXiv:2310.09999  [pdf, other

    stat.ML cs.LG eess.SP

    Outlier Detection Using Generative Models with Theoretical Performance Guarantees

    Authors: Jirong Yi, Jingchao Gao, Tianming Wang, Xiaodong Wu, Weiyu Xu

    Abstract: This paper considers the problem of recovering signals modeled by generative models from linear measurements contaminated with sparse outliers. We propose an outlier detection approach for reconstructing the ground-truth signals modeled by generative models under sparse outliers. We establish theoretical recovery guarantees for reconstruction of signals using generative models in the presence of o… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.11335

  33. arXiv:2310.06715  [pdf, other

    cs.LG eess.SP stat.ML

    S4Sleep: Elucidating the design space of deep-learning-based sleep stage classification models

    Authors: Tiezhi Wang, Nils Strodthoff

    Abstract: Scoring sleep stages in polysomnography recordings is a time-consuming task plagued by significant inter-rater variability. Therefore, it stands to benefit from the application of machine learning algorithms. While many algorithms have been proposed for this purpose, certain critical architectural decisions have not received systematic exploration. In this study, we meticulously investigate these… ▽ More

    Submitted 21 August, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 33 pages, 3 figures, code available at https://github.com/AI4HealthUOL/s4sleep

  34. arXiv:2309.16578  [pdf, other

    stat.ML cs.LG physics.chem-ph

    Overcoming the Barrier of Orbital-Free Density Functional Theory for Molecular Systems Using Deep Learning

    Authors: He Zhang, Siyuan Liu, Jiacheng You, Chang Liu, Shuxin Zheng, Ziheng Lu, Tong Wang, Nanning Zheng, Bin Shao

    Abstract: Orbital-free density functional theory (OFDFT) is a quantum chemistry formulation that has a lower cost scaling than the prevailing Kohn-Sham DFT, which is increasingly desired for contemporary molecular research. However, its accuracy is limited by the kinetic energy density functional, which is notoriously hard to approximate for non-periodic molecular systems. Here we propose M-OFDFT, an OFDFT… ▽ More

    Submitted 9 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Published in Nature Computational Science, March 2024. Full paper with supplementary information

  35. arXiv:2308.15709  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    Threshold KNN-Shapley: A Linear-Time and Privacy-Friendly Approach to Data Valuation

    Authors: Jiachen T. Wang, Yuqing Zhu, Yu-Xiang Wang, Ruoxi Jia, Prateek Mittal

    Abstract: Data valuation aims to quantify the usefulness of individual data sources in training machine learning (ML) models, and is a critical aspect of data-centric ML research. However, data valuation faces significant yet frequently overlooked privacy challenges despite its importance. This paper studies these challenges with a focus on KNN-Shapley, one of the most practical data valuation methods nowad… ▽ More

    Submitted 25 November, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: NeurIPS 2023 Spotlight

  36. arXiv:2308.13298  [pdf, other

    cs.LG eess.SP stat.ML

    Federated Linear Bandit Learning via Over-the-Air Computation

    Authors: Jiali Wang, Yuning Jiang, Xin Liu, Ting Wang, Yuanming Shi

    Abstract: In this paper, we investigate federated contextual linear bandit learning within a wireless system that comprises a server and multiple devices. Each device interacts with the environment, selects an action based on the received reward, and sends model updates to the server. The primary objective is to minimize cumulative regret across all devices within a finite time horizon. To reduce the commun… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  37. arXiv:2308.10113  [pdf, other

    stat.ML cs.LG cs.SI stat.AP stat.CO

    Modeling Random Networks with Heterogeneous Reciprocity

    Authors: Daniel Cirkovic, Tiandong Wang

    Abstract: Reciprocity, or the tendency of individuals to mirror behavior, is a key measure that describes information exchange in a social network. Users in social networks tend to engage in different levels of reciprocal behavior. Differences in such behavior may indicate the existence of communities that reciprocate links at varying rates. In this paper, we develop methodology to model the diverse recipro… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

  38. arXiv:2306.15163  [pdf, other

    stat.ML cs.LG

    Wasserstein Generative Regression

    Authors: Shanshan Song, Tong Wang, Guohao Shen, Yuanyuan Lin, Jian Huang

    Abstract: In this paper, we propose a new and unified approach for nonparametric regression and conditional distribution learning. Our approach simultaneously estimates a regression function and a conditional generator using a generative learning framework, where a conditional generator is a function that can generate samples from a conditional distribution. The main idea is to estimate a conditional genera… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 50 pages, including appendix. 5 figures and 6 tables in the main text. 1 figure and 7 tables in the appendix

    MSC Class: 62G08; 68T07

  39. arXiv:2306.10475  [pdf, other

    stat.ME cs.SI physics.soc-ph

    SpreadDetect: Detection of spreading change in a network over time

    Authors: Hanqing Cai, Tengyao Wang

    Abstract: Change-point analysis has been successfully applied to the detect changes in multivariate data streams over time. In many applications, when data are observed over a graph/network, change does not occur simultaneously but instead spread from an initial source coordinate to the neighbouring coordinates over time. We propose a new method, SpreadDetect, that estimates both the source coordinate and t… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: 26 pages,3 figures, 2 tables

  40. arXiv:2306.08280  [pdf, other

    cs.IT cs.CR cs.LG eess.SP stat.ML

    Differentially Private Wireless Federated Learning Using Orthogonal Sequences

    Authors: Xizixiang Wei, Tianhao Wang, Ruiquan Huang, Cong Shen, Jing Yang, H. Vincent Poor

    Abstract: We propose a privacy-preserving uplink over-the-air computation (AirComp) method, termed FLORAS, for single-input single-output (SISO) wireless federated learning (FL) systems. From the perspective of communication designs, FLORAS eliminates the requirement of channel state information at the transmitters (CSIT) by leveraging the properties of orthogonal sequences. From the privacy perspective, we… ▽ More

    Submitted 21 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 33 pages, 5 figures

  41. arXiv:2305.18987  [pdf, other

    math.ST stat.ME

    Robust mean change point testing in high-dimensional data with heavy tails

    Authors: Mengchu Li, Yudong Chen, Tengyao Wang, Yi Yu

    Abstract: We study a mean change point testing problem for high-dimensional data, with exponentially- or polynomially-decaying tails. In each case, depending on the $\ell_0$-norm of the mean change vector, we separately consider dense and sparse regimes. We characterise the boundary between the dense and sparse regimes under the above two tail conditions for the first time in the change point literature and… ▽ More

    Submitted 17 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 51 pages, 1 figure

  42. arXiv:2305.17284  [pdf, other

    cs.LG stat.ML

    GC-Flow: A Graph-Based Flow Network for Effective Clustering

    Authors: Tianchun Wang, Farzaneh Mirzazadeh, Xiang Zhang, Jie Chen

    Abstract: Graph convolutional networks (GCNs) are \emph{discriminative models} that directly model the class posterior $p(y|\mathbf{x})$ for semi-supervised classification of graph data. While being effective, as a representation learning approach, the node representations extracted from a GCN often miss useful information for effective clustering, because the objectives are different. In this work, we desi… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ICML 2023. Code is available at https://github.com/xztcwang/GCFlow

  43. arXiv:2305.11509  [pdf, other

    cs.LG stat.ML

    From Random Search to Bandit Learning in Metric Measure Spaces

    Authors: Chuying Han, Yasong Feng, Tianyu Wang

    Abstract: Random Search is one of the most widely-used method for Hyperparameter Optimization, and is critical to the success of deep learning models. Despite its astonishing performance, little non-heuristic theory has been developed to describe the underlying working mechanism. This paper gives a theoretical accounting of Random Search. We introduce the concept of \emph{scattering dimension} that describe… ▽ More

    Submitted 12 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  44. arXiv:2305.06465  [pdf, other

    stat.ME

    Occam Factor for Random Graphs: Erdös-Rényi, Independent Edge, and Rank-1 Stochastic Blockmodel

    Authors: Tianyu Wang, Zachary M. Pisano, Carey E. Priebe

    Abstract: We investigate the evidence/flexibility (i.e., "Occam") paradigm and demonstrate the theoretical and empirical consistency of Bayesian evidence for the task of determining an appropriate generative model for network data. This model selection framework involves determining a collection of candidate models, equipping each of these models' parameters with prior distributions derived via the encompas… ▽ More

    Submitted 7 May, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

  45. arXiv:2305.00054  [pdf, other

    cs.LG cs.AI stat.ML

    LAVA: Data Valuation without Pre-Specified Learning Algorithms

    Authors: Hoang Anh Just, Feiyang Kang, Jiachen T. Wang, Yi Zeng, Myeongseob Ko, Ming Jin, Ruoxi Jia

    Abstract: Traditionally, data valuation (DV) is posed as a problem of equitably splitting the validation performance of a learning algorithm among the training data. As a result, the calculated data values depend on many design choices of the underlying learning algorithm. However, this dependence is undesirable for many DV use cases, such as setting priorities over different data sources in a data acquisit… ▽ More

    Submitted 19 December, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICLR 2023 Spotlight Latest Updated Version: 2023/12/19

  46. arXiv:2304.09154  [pdf, other

    stat.ME math.ST stat.ML

    Sharp-SSL: Selective high-dimensional axis-aligned random projections for semi-supervised learning

    Authors: Tengyao Wang, Edgar Dobriban, Milana Gataric, Richard J. Samworth

    Abstract: We propose a new method for high-dimensional semi-supervised learning problems based on the careful aggregation of the results of a low-dimensional procedure applied to many axis-aligned random projections of the data. Our primary goal is to identify important variables for distinguishing between the classes; existing low-dimensional methods can then be applied for final class assignment. Motivate… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 49 pages, 4 figures

    MSC Class: 62H30

  47. arXiv:2304.04258  [pdf, ps, other

    stat.ML cs.LG

    A Note on "Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms"

    Authors: Jiachen T. Wang, Ruoxi Jia

    Abstract: Data valuation is a growing research field that studies the influence of individual data points for machine learning (ML) models. Data Shapley, inspired by cooperative game theory and economics, is an effective method for data valuation. However, it is well-known that the Shapley value (SV) can be computationally expensive. Fortunately, Jia et al. (2019) showed that for K-Nearest Neighbors (KNN) m… ▽ More

    Submitted 25 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: Technical Note

  48. arXiv:2302.14275  [pdf, other

    stat.ME

    Self-normalized score-based tests to detect parameter heterogeneity for mixed models

    Authors: Ting Wang, Edgar Merkle

    Abstract: Score-based tests have been used to study parameter heterogeneity across many types of statistical models. This chapter describes a new self-normalization approach for score-based tests of mixed models, which addresses situations where there is dependence between scores. This differs from the traditional score-based tests, which require independence of scores. We first review traditional score-bas… ▽ More

    Submitted 11 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  49. arXiv:2302.11431  [pdf, ps, other

    stat.ML cs.LG

    A Note on "Towards Efficient Data Valuation Based on the Shapley Value''

    Authors: Jiachen T. Wang, Ruoxi Jia

    Abstract: The Shapley value (SV) has emerged as a promising method for data valuation. However, computing or estimating the SV is often computationally expensive. To overcome this challenge, Jia et al. (2019) propose an advanced SV estimation algorithm called ``Group Testing-based SV estimator'' which achieves favorable asymptotic sample complexity. In this technical note, we present several improvements in… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  50. arXiv:2302.07348  [pdf, other

    cs.LG cs.AI stat.ML

    Cliff-Learning

    Authors: Tony T. Wang, Igor Zablotchi, Nir Shavit, Jonathan S. Rosenfeld

    Abstract: We study the data-scaling of transfer learning from foundation models in the low-downstream-data regime. We observe an intriguing phenomenon which we call cliff-learning. Cliff-learning refers to regions of data-scaling laws where performance improves at a faster than power law rate (i.e. regions of concavity on a log-log scaling plot). We conduct an in-depth investigation of foundation-model clif… ▽ More

    Submitted 6 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 16 pages; v2 updates: improved layout, added acknowledgements