Skip to main content

Showing 1–50 of 69 results for author: Huang, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.06798  [pdf, other

    stat.CO

    Estimating Value at Risk and Expected Shortfall: A Brief Review and Some New Developments

    Authors: Kanon Kamronnaher, Andrew Bellucco, Whitney K. Huang, Colin M. Gallagher

    Abstract: Value-at-risk (VaR) and expected shortfall (ES) are two commonly utilized metrics for quantifying financial risk. In this study, we review the widely employed Generalized Autoregressive Conditional Heteroskedasticity (GARCH) models. These models are explored with diverse distributional assumptions on innovation, including parametric, non-parametric, and `semi-parametric' that incorporates a parame… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 54 pages, 21 figures

    MSC Class: 62-08

  2. arXiv:2306.02857  [pdf, other

    stat.AP physics.data-an

    Topological Data Analysis Assisted Automated Sleep Stage Scoring Using Airflow Signals

    Authors: Yu-Min Chung, Whitney K. Huang, Hau-Tieng Wu

    Abstract: Objective: Breathing pattern variability (BPV), as a universal physiological feature, encodes rich health information. We aim to show that, a high-quality automatic sleep stage scoring based on a proper quantification of BPV extracting from the single airflow signal can be achieved. Methods: Topological data analysis (TDA) is applied to characterize BPV from the intrinsically nonstationary airfl… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  3. arXiv:2305.14535  [pdf, other

    cs.LG stat.ML

    Uncertainty Quantification over Graph with Conformalized Graph Neural Networks

    Authors: Kexin Huang, Ying Jin, Emmanuel Candès, Jure Leskovec

    Abstract: Graph Neural Networks (GNNs) are powerful machine learning prediction models on graph-structured data. However, GNNs lack rigorous uncertainty estimates, limiting their reliable deployment in settings where the cost of errors is significant. We propose conformalized GNN (CF-GNN), extending conformal prediction (CP) to graph-based models for guaranteed uncertainty estimates. Given an entity in the… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023

  4. arXiv:2305.14067  [pdf, other

    cs.LG stat.ML

    DIVA: A Dirichlet Process Mixtures Based Incremental Deep Clustering Algorithm via Variational Auto-Encoder

    Authors: Zhenshan Bing, Yuan Meng, Yuqi Yun, Hang Su, Xiaojie Su, Kai Huang, Alois Knoll

    Abstract: Generative model-based deep clustering frameworks excel in classifying complex data, but are limited in handling dynamic and complex features because they require prior knowledge of the number of clusters. In this paper, we propose a nonparametric deep clustering framework that employs an infinite mixture of Gaussians as a prior. Our framework utilizes a memoized online variational inference metho… ▽ More

    Submitted 24 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: static datasets comparision updated

  5. arXiv:2305.04086  [pdf, other

    stat.ML math.OC

    Efficient Learning for Selecting Top-m Context-Dependent Designs

    Authors: Gongbo Zhang, Sihua Chen, Kuihua Huang, Yijie Peng

    Abstract: We consider a simulation optimization problem for a context-dependent decision-making, which aims to determine the top-m designs for all contexts. Under a Bayesian framework, we formulate the optimal dynamic sampling decision as a stochastic dynamic programming problem, and develop a sequential sampling policy to efficiently learn the performance of each design under each context. The asymptotical… ▽ More

    Submitted 9 June, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  6. arXiv:2304.05223  [pdf, other

    cs.LG cs.SI stat.ML

    Inhomogeneous graph trend filtering via a l2,0 cardinality penalty

    Authors: Xiaoqing Huang, Andersen Ang, Kun Huang, Jie Zhang, Yijie Wang

    Abstract: We study estimation of piecewise smooth signals over a graph. We propose a $\ell_{2,0}$-norm penalized Graph Trend Filtering (GTF) model to estimate piecewise smooth graph signals that exhibit inhomogeneous levels of smoothness across the nodes. We prove that the proposed GTF model is simultaneously a k-means clustering on the signal over the nodes and a minimum graph cut on the edges of the graph… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 13 pages, 3 figures, 4 tables

    MSC Class: 65F50; 68U01; 68R01 ACM Class: G.1.6; G.1.10

  7. arXiv:2304.05187  [pdf, other

    cs.LG cs.AI cs.NE math.NA stat.ML

    Automatic Gradient Descent: Deep Learning without Hyperparameters

    Authors: Jeremy Bernstein, Chris Mingard, Kevin Huang, Navid Azizan, Yisong Yue

    Abstract: The architecture of a deep neural network is defined explicitly in terms of the number of layers, the width of each layer and the general network topology. Existing optimisation frameworks neglect this information in favour of implicit architectural information (e.g. second-order methods) or architecture-agnostic distance functions (e.g. mirror descent). Meanwhile, the most popular optimiser in pr… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  8. arXiv:2302.07194  [pdf, other

    cs.LG stat.ML

    Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data

    Authors: Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang

    Abstract: Diffusion models achieve state-of-the-art performance in various generation tasks. However, their theoretical foundations fall far behind. This paper studies score approximation, estimation, and distribution recovery of diffusion models, when data are supported on an unknown low-dimensional linear subspace. Our result provides sample complexity bounds for distribution estimation using diffusion mo… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages, 4 figures

  9. arXiv:2302.05881  [pdf, other

    cs.CV cs.AI cs.LG math.NA stat.ML

    A generalizable framework for low-rank tensor completion with numerical priors

    Authors: Shiran Yuan, Kaizhu Huang

    Abstract: Low-Rank Tensor Completion, a method which exploits the inherent structure of tensors, has been studied extensively as an effective approach to tensor completion. Whilst such methods attained great success, none have systematically considered exploiting the numerical priors of tensor elements. Ignoring numerical priors causes loss of important information regarding the data, and therefore prevents… ▽ More

    Submitted 18 June, 2024; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted to Pattern Recognition

  10. arXiv:2302.05686  [pdf, other

    math.ST cs.LG stat.ML

    A High-dimensional Convergence Theorem for U-statistics with Applications to Kernel-based Testing

    Authors: Kevin H. Huang, Xing Liu, Andrew B. Duncan, Axel Gandy

    Abstract: We prove a convergence theorem for U-statistics of degree two, where the data dimension $d$ is allowed to scale with sample size $n$. We find that the limiting distribution of a U-statistic undergoes a phase transition from the non-degenerate Gaussian limit to the degenerate limit, regardless of its degeneracy and depending only on a moment ratio. A surprising consequence is that a non-degenerate… ▽ More

    Submitted 2 July, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: COLT camera-ready version

  11. arXiv:2301.12677  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Stochastic Optimization under a General Variance Condition

    Authors: Kun Huang, Xiao Li, Shi Pu

    Abstract: Distributed stochastic optimization has drawn great attention recently due to its effectiveness in solving large-scale machine learning problems. Though numerous algorithms have been proposed and successfully applied to general practical problems, their theoretical guarantees mainly rely on certain boundedness conditions on the stochastic gradients, varying from uniform boundedness to the relaxed… ▽ More

    Submitted 13 December, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 16 pages, 2 figure

  12. arXiv:2210.14843  [pdf, other

    stat.ML cs.AI cs.LG

    TuneUp: A Simple Improved Training Strategy for Graph Neural Networks

    Authors: Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec

    Abstract: Despite recent advances in Graph Neural Networks (GNNs), their training strategies remain largely under-explored. The conventional training strategy learns over all nodes in the original graph(s) equally, which can be sub-optimal as certain nodes are often more difficult to learn than others. Here we present TuneUp, a simple curriculum-based training strategy for improving the predictive performan… ▽ More

    Submitted 26 August, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  13. arXiv:2206.14846  [pdf, other

    cs.LG cs.SI stat.ML

    Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization

    Authors: Kaixuan Huang, Yu Wu, Xuezhou Zhang, Shenyinying Tu, Qingyun Wu, Mengdi Wang, Huazheng Wang

    Abstract: Online influence maximization aims to maximize the influence spread of a content in a social network with unknown network model by selecting a few seed nodes. Recent studies followed a non-adaptive setting, where the seed nodes are selected before the start of the diffusion process and network parameters are updated when the diffusion stops. We consider an adaptive version of content-dependent onl… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  14. arXiv:2204.01960  [pdf, other

    cs.CV cs.AI stat.ML

    FaceSigns: Semi-Fragile Neural Watermarks for Media Authentication and Countering Deepfakes

    Authors: Paarth Neekhara, Shehzeen Hussain, Xinqiao Zhang, Ke Huang, Julian McAuley, Farinaz Koushanfar

    Abstract: Deepfakes and manipulated media are becoming a prominent threat due to the recent advances in realistic image and video synthesis techniques. There have been several attempts at combating Deepfakes using machine learning classifiers. However, such classifiers do not generalize well to black-box image synthesis techniques and have been shown to be vulnerable to adversarial examples. To address thes… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 13 pages, 8 figures

  15. arXiv:2202.09134  [pdf, other

    cs.LG math.ST stat.ML

    Data Augmentation in the Underparameterized and Overparameterized Regimes

    Authors: Kevin Han Huang, Peter Orbanz, Morgane Austern

    Abstract: We provide results that exactly quantify how data augmentation affects the variance and limiting distribution of estimates, and analyze several specific models in detail. The results confirm some observations made in machine learning practice, but also lead to unexpected findings: Data augmentation may increase rather than decrease the uncertainty of estimates, such as the empirical prediction ris… ▽ More

    Submitted 28 September, 2023; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: Changed title and added an analysis on the effect of augmentations on the double-descent risk curve of a high-dimensional ridgeless estimator

  16. arXiv:2202.00071  [pdf, other

    cs.LG cs.IR stat.ML

    JULIA: Joint Multi-linear and Nonlinear Identification for Tensor Completion

    Authors: Cheng Qian, Kejun Huang, Lucas Glass, Rakshith S. Srinivasa, Jimeng Sun

    Abstract: Tensor completion aims at imputing missing entries from a partially observed tensor. Existing tensor completion methods often assume either multi-linear or nonlinear relationships between latent components. However, real-world tensors have much more complex patterns where both multi-linear and nonlinear relationships may coexist. In such cases, the existing methods are insufficient to describe t… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  17. arXiv:2112.07746  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

    Authors: Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

    Abstract: Current state-of-the-art model-based reinforcement learning algorithms use trajectory sampling methods, such as the Cross-Entropy Method (CEM), for planning in continuous control settings. These zeroth-order optimizers require sampling a large number of trajectory rollouts to select an optimal action, which scales poorly for large prediction horizons or high dimensional action spaces. First-order… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  18. arXiv:2107.06466  [pdf, other

    cs.LG stat.ML

    Going Beyond Linear RL: Sample Efficient Neural Function Approximation

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Deep Reinforcement Learning (RL) powered by neural net approximation of the Q function has had enormous empirical success. While the theory of RL has traditionally focused on linear function approximation (or eluder dimension) approaches, little is known about nonlinear RL with neural net approximations of the Q functions. This is the focus of this work, where we study function approximation with… ▽ More

    Submitted 25 December, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  19. arXiv:2107.04518  [pdf, ps, other

    cs.LG stat.ML

    Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

    Authors: Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

    Abstract: Bandit problems with linear or concave reward have been extensively studied, but relatively few works have studied bandits with non-concave reward. This work considers a large family of bandit problems where the unknown underlying reward function is non-concave, including the low-rank generalized linear bandit problems and two-layer neural network with polynomial activation bandit problem. For the… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  20. arXiv:2107.02377  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    A Short Note on the Relationship of Information Gain and Eluder Dimension

    Authors: Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

    Abstract: Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning. Eluder dimension was originally proposed as a general complexity measure of function classes, but the common examples of where it is known to be small are function spaces (vector spaces). In these cases, the primary tool to upper bound the eluder dimension is the elliptic… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  21. FastAdaBelief: Improving Convergence Rate for Belief-based Adaptive Optimizers by Exploiting Strong Convexity

    Authors: Yangfan Zhou, Kaizhu Huang, Cheng Cheng, Xuguang Wang, Amir Hussain, Xin Liu

    Abstract: AdaBelief, one of the current best optimizers, demonstrates superior generalization ability compared to the popular Adam algorithm by viewing the exponential moving average of observed gradients. AdaBelief is theoretically appealing in that it has a data-dependent $O(\sqrt{T})$ regret bound when objective functions are convex, where $T$ is a time horizon. It remains however an open problem whether… ▽ More

    Submitted 25 May, 2022; v1 submitted 28 April, 2021; originally announced April 2021.

  22. arXiv:2009.11098  [pdf, other

    stat.ME

    Modeling short-ranged dependence in block extrema with application to polar temperature data

    Authors: Brook T. Russell, Whitney K. Huang

    Abstract: The block maxima approach is an important method in univariate extreme value analysis. While assuming that block maxima are independent results in straightforward analysis, the resulting inferences maybe invalid when a series of block maxima exhibits dependence. We propose a model, based on a first-order Markov assumption, that incorporates dependence between successive block maxima through the us… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: 40 pages, 8 figures, and 9 tables

    MSC Class: 62G32

  23. arXiv:2008.11721  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels

    Authors: Hua Shen, Ting-Hao Kenneth Huang

    Abstract: Explaining to users why automated systems make certain mistakes is important and challenging. Researchers have proposed ways to automatically produce interpretations for deep neural network models. However, it is unclear how useful these interpretations are in helping users figure out why they are getting an error. If an interpretation effectively explains to users how the underlying deep neural n… ▽ More

    Submitted 27 August, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted by The 8th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2020) https://github.com/huashen218/GuessWrongLabel

  24. arXiv:2008.04473  [pdf, other

    stat.ML cs.LG eess.SP

    Airflow recovery from thoracic and abdominal movements using Synchrosqueezing Transform and Locally Stationary Gaussian Process Regression

    Authors: Whitney K. Huang, Yu-Min Chung, Yu-Bo Wang, Jeff E. Mandel, Hau-Tieng Wu

    Abstract: Airflow signal encodes rich information about respiratory system. While the gold standard for measuring airflow is to use a spirometer with an occlusive seal, this is not practical for ambulatory monitoring of patients. Advances in sensor technology have made measurement of motion of the thorax and abdomen feasible with small inexpensive devices, but estimation of airflow from these time series is… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  25. arXiv:2008.03776  [pdf, other

    q-bio.QM cs.LG q-bio.GN stat.ML

    Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

    Authors: Zhi Huang, Paul Salama, Wei Shao, Jie Zhang, Kun Huang

    Abstract: One of the central goals in precision health is the understanding and interpretation of high-dimensional biological data to identify genes and markers associated with disease initiation, development, and outcomes. Though significant effort has been committed to harness gene expression data for multiple analyses while accounting for time-to-event modeling by including survival times, many tradition… ▽ More

    Submitted 17 September, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

  26. arXiv:2006.08720  [pdf, other

    stat.AP

    Estimating Concurrent Climate Extremes: A Conditional Approach

    Authors: Whitney K. Huang, Adam H. Monahan, Francis W. Zwiers

    Abstract: Simultaneous concurrence of extreme values across multiple climate variables can result in large societal and environmental impacts. Therefore, there is growing interest in understanding these concurrent extremes. In many applications, not only the frequency but also the magnitude of concurrent extremes are of interest. One way to approach this problem is to study the distribution of one climate v… ▽ More

    Submitted 12 March, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 39 pages, 18 figures, 1 table

    MSC Class: 62G32; 62H10; 62P12

  27. arXiv:2006.07889  [pdf, other

    cs.LG stat.ML

    Graph Meta Learning via Local Subgraphs

    Authors: Kexin Huang, Marinka Zitnik

    Abstract: Prevailing methods for graphs require abundant label and edge information for learning. When data for a new task are scarce, meta-learning can learn from prior experiences and form much-needed inductive biases for fast adaption to new tasks. Here, we introduce G-Meta, a novel meta-learning algorithm for graphs. G-Meta uses local subgraphs to transfer subgraph-specific information and learn transfe… ▽ More

    Submitted 8 January, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  28. arXiv:2005.00718  [pdf, other

    cs.LG stat.ML

    Large-scale Uncertainty Estimation and Its Application in Revenue Forecast of SMEs

    Authors: Zebang Zhang, Kui Zhao, Kai Huang, Quanhui Jia, Yanming Fang, Quan Yu

    Abstract: The economic and banking importance of the small and medium enterprise (SME) sector is well recognized in contemporary society. Business credit loans are very important for the operation of SMEs, and the revenue is a key indicator of credit limit management. Therefore, it is very beneficial to construct a reliable revenue forecasting model. If the uncertainty of an enterprise's revenue forecasting… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

  29. arXiv:2004.13344  [pdf, ps, other

    cs.LG stat.ML

    Robust Generative Adversarial Network

    Authors: Shufei Zhang, Zhuang Qian, Kaizhu Huang, Jimin Xiao, Yuan He

    Abstract: Generative adversarial networks (GANs) are powerful generative models, but usually suffer from instability and generalization problem which may lead to poor generations. Most existing works focus on stabilizing the training of the discriminator while ignoring the generalization properties. In this work, we aim to improve the generalization capability of GANs by promoting the local robustness withi… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: This paper has been submitted to ICLR in Sep 25. 2019

  30. arXiv:2004.08919  [pdf, other

    cs.LG q-bio.QM stat.ML

    DeepPurpose: a Deep Learning Library for Drug-Target Interaction Prediction

    Authors: Kexin Huang, Tianfan Fu, Lucas Glass, Marinka Zitnik, Cao Xiao, Jimeng Sun

    Abstract: Accurate prediction of drug-target interactions (DTI) is crucial for drug discovery. Recently, deep learning (DL) models for show promising performance for DTI prediction. However, these models can be difficult to use for both computer scientists entering the biomedical field and bioinformaticians with limited DL experience. We present DeepPurpose, a comprehensive and easy-to-use deep learning lib… ▽ More

    Submitted 9 December, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

    Comments: Published in Bioinformatics (2020)

  31. arXiv:2002.08675  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Domain Adaptation via Discriminative Manifold Embedding and Alignment

    Authors: You-Wei Luo, Chuan-Xian Ren, Pengfei Ge, Ke-Kun Huang, Yu-Feng Yu

    Abstract: Unsupervised domain adaptation is effective in leveraging the rich information from the source domain to the unsupervised target domain. Though deep learning and adversarial strategy make an important breakthrough in the adaptability of features, there are two issues to be further explored. First, the hard-assigned pseudo labels on the target domain are risky to the intrinsic data structure. Secon… ▽ More

    Submitted 28 February, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted to AAAI 2020. Code available: \<https://github.com/LavieLuo/DRMEA>

  32. arXiv:2002.06262  [pdf, other

    cs.LG stat.ML

    Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? -- A Neural Tangent Kernel Perspective

    Authors: Kaixuan Huang, Yuqing Wang, Molei Tao, Tuo Zhao

    Abstract: Deep residual networks (ResNets) have demonstrated better generalization performance than deep feedforward networks (FFNets). However, the theory behind such a phenomenon is still largely unknown. This paper studies this fundamental problem in deep learning from a so-called "neural tangent kernel" perspective. Specifically, we first show that under proper conditions, as the width goes to infinity,… ▽ More

    Submitted 22 December, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: Accepted in NeurIPS 2020

  33. arXiv:1912.05122  [pdf, other

    cs.LG stat.ML

    Towards Better Forecasting by Fusing Near and Distant Future Visions

    Authors: Jiezhu Cheng, Kaizhu Huang, Zibin Zheng

    Abstract: Multivariate time series forecasting is an important yet challenging problem in machine learning. Most existing approaches only forecast the series value of one future moment, ignoring the interactions between predictions of future moments with different temporal distance. Such a deficiency probably prevents the model from getting enough information about the future, thus limiting the forecasting… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAI 2020

  34. arXiv:1911.08723  [pdf, other

    cs.LG stat.ML

    Deep Minimax Probability Machine

    Authors: Lirong He, Ziyi Guo, Kaizhu Huang, Zenglin Xu

    Abstract: Deep neural networks enjoy a powerful representation and have proven effective in a number of applications. However, recent advances show that deep neural networks are vulnerable to adversarial attacks incurred by the so-called adversarial examples. Although the adversarial example is only slightly different from the input sample, the neural network classifies it as the wrong class. In order to al… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  35. arXiv:1911.06479   

    cs.LG cs.CR cs.CV stat.ML

    On Model Robustness Against Adversarial Examples

    Authors: Shufei Zhang, Kaizhu Huang, Zenglin Xu

    Abstract: We study the model robustness against adversarial examples, referred to as small perturbed input data that may however fool many state-of-the-art deep learning models. Unlike previous research, we establish a novel theory addressing the robustness issue from the perspective of stability of the loss function in the small neighborhood of natural examples. We propose to exploit an energy function to… ▽ More

    Submitted 10 June, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: some theoretical bounds need to be revised

  36. arXiv:1911.06446  [pdf, other

    cs.LG q-bio.QM stat.ML

    CASTER: Predicting Drug Interactions with Chemical Substructure Representation

    Authors: Kexin Huang, Cao Xiao, Trong Nghia Hoang, Lucas M. Glass, Jimeng Sun

    Abstract: Adverse drug-drug interactions (DDIs) remain a leading cause of morbidity and mortality. Identifying potential DDIs during the drug design process is critical for patients and society. Although several computational models have been proposed for DDI prediction, there are still limitations: (1) specialized design of drug representation for DDI predictions is lacking; (2) predictions are based on li… ▽ More

    Submitted 19 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  37. arXiv:1909.12325  [pdf, other

    cs.LG stat.ML

    Crowdsourcing via Pairwise Co-occurrences: Identifiability and Algorithms

    Authors: Shahana Ibrahim, Xiao Fu, Nikos Kargas, Kejun Huang

    Abstract: The data deluge comes with high demands for data labeling. Crowdsourcing (or, more generally, ensemble learning) techniques aim to produce accurate labels via integrating noisy, non-expert labeling from annotators. The classic Dawid-Skene estimator and its accompanying expectation maximization (EM) algorithm have been widely used, but the theoretical properties are not fully understood. Tensor met… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: 28 pages, 5 figures, to appear in 33rd NeurIPS conference, Vancouver, Canada

  38. arXiv:1907.04450  [pdf, ps, other

    math.OC cs.CC stat.ML

    SNAP: Finding Approximate Second-Order Stationary Solutions Efficiently for Non-convex Linearly Constrained Problems

    Authors: Songtao Lu, Meisam Razaviyayn, Bo Yang, Kejun Huang, Mingyi Hong

    Abstract: This paper proposes low-complexity algorithms for finding approximate second-order stationary points (SOSPs) of problems with smooth non-convex objective and linear constraints. While finding (approximate) SOSPs is computationally intractable, we first show that generic instances of the problem can be solved efficiently. More specifically, for a generic problem instance, certain strict complementa… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

  39. arXiv:1907.02189  [pdf, other

    stat.ML cs.LG math.OC

    On the Convergence of FedAvg on Non-IID Data

    Authors: Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, Zhihua Zhang

    Abstract: Federated learning enables a large amount of edge computing devices to jointly learn a model without data sharing. As a leading algorithm in this setting, Federated Averaging (\texttt{FedAvg}) runs Stochastic Gradient Descent (SGD) in parallel on a small subset of the total devices and averages the sequences only once in a while. Despite its simplicity, it lacks theoretical guarantees under realis… ▽ More

    Submitted 25 June, 2020; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: 2020 International Conference on Learning Representations

  40. arXiv:1901.08169  [pdf, other

    stat.ME

    New Exploratory Tools for Extremal Dependence: Chi Networks and Annual Extremal Networks

    Authors: Whitney K. Huang, Daniel S. Cooley, Imme Ebert-Uphoff, Chen Chen, Snigdhansu Chatterjee

    Abstract: Understanding dependence structure among extreme values plays an important role in risk assessment in environmental studies. In this work we propose the $χ$ network and the annual extremal network for exploring the extremal dependence structure of environmental processes. A $χ$ network is constructed by connecting pairs whose estimated upper tail dependence coefficient, $\hat χ$, exceeds a prescri… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: 26 pages, 7 figures, 1 table

    MSC Class: 62

  41. arXiv:1901.01860  [pdf, other

    cs.LG stat.ML

    JECL: Joint Embedding and Cluster Learning for Image-Text Pairs

    Authors: Sean T. Yang, Kuan-Hao Huang, Bill Howe

    Abstract: We propose JECL, a method for clustering image-caption pairs by training parallel encoders with regularized clustering and alignment objectives, simultaneously learning both representations and cluster assignments. These image-caption pairs arise frequently in high-value applications where structured training data is expensive to produce, but free-text descriptions are common. JECL trains by minim… ▽ More

    Submitted 16 October, 2020; v1 submitted 4 January, 2019; originally announced January 2019.

    Comments: ICPR2020

  42. Learning Nonlinear Mixtures: Identifiability and Algorithm

    Authors: Bo Yang, Xiao Fu, Nicholas D. Sidiropoulos, Kejun Huang

    Abstract: Linear mixture models have proven very useful in a plethora of applications, e.g., topic modeling, clustering, and source separation. As a critical aspect of the linear mixture models, identifiability of the model parameters is well-studied, under frameworks such as independent component analysis and constrained matrix factorization. Nevertheless, when the linear mixtures are distorted by an unkno… ▽ More

    Submitted 6 January, 2019; originally announced January 2019.

    Comments: 15 pages

  43. arXiv:1901.00032  [pdf, other

    cond-mat.mtrl-sci cs.AI stat.ML

    Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks

    Authors: Edward Kim, Zach Jensen, Alexander van Grootel, Kevin Huang, Matthew Staib, Sheshera Mysore, Haw-Shiuan Chang, Emma Strubell, Andrew McCallum, Stefanie Jegelka, Elsa Olivetti

    Abstract: Leveraging new data sources is a key step in accelerating the pace of materials design and discovery. To complement the strides in synthesis planning driven by historical, experimental, and computed data, we present an automated method for connecting scientific literature to synthesis insights. Starting from natural language text, we apply word embeddings from language models, which are fed into a… ▽ More

    Submitted 17 February, 2019; v1 submitted 31 December, 2018; originally announced January 2019.

    Comments: Added new funding support to the acknowledgments section in this version

  44. Pre-Defined Sparse Neural Networks with Hardware Acceleration

    Authors: Sourya Dey, Kuan-Wen Huang, Peter A. Beerel, Keith M. Chugg

    Abstract: Neural networks have proven to be extremely powerful tools for modern artificial intelligence applications, but computational and storage complexity remain limiting factors. This paper presents two compatible contributions towards reducing the time, energy, computational, and storage complexities associated with multilayer perceptrons. Pre-defined sparsity is proposed to reduce the complexity duri… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: This work has been submitted to the IEEE Journal on Emerging and Selected Topics in Circuits and Systems for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  45. arXiv:1809.09143  [pdf, other

    cs.LG q-bio.QM stat.ML

    EpiRL: A Reinforcement Learning Agent to Facilitate Epistasis Detection

    Authors: Kexin Huang, Rodrigo Nogueira

    Abstract: Epistasis (gene-gene interaction) is crucial to predicting genetic disease. Our work tackles the computational challenges faced by previous works in epistasis detection by modeling it as a one-step Markov Decision Process where the state is genome data, the actions are the interacted genes, and the reward is an interaction measurement for the selected actions. A reinforcement learning agent using… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  46. arXiv:1808.02229  [pdf, other

    cs.LG cs.CV cs.IT eess.SP stat.ML

    Grassmannian Learning: Embedding Geometry Awareness in Shallow and Deep Learning

    Authors: Jiayao Zhang, Guangxu Zhu, Robert W. Heath Jr., Kaibin Huang

    Abstract: Modern machine learning algorithms have been adopted in a range of signal-processing applications spanning computer vision, natural language processing, and artificial intelligence. Many relevant problems involve subspace-structured features, orthogonality constrained or low-rank constrained objective functions, or subspace distances. These mathematical characteristics are expressed naturally usin… ▽ More

    Submitted 12 August, 2018; v1 submitted 7 August, 2018; originally announced August 2018.

    Comments: Submitted to IEEE Signal Processing Magazine

  47. arXiv:1807.05832  [pdf, ps, other

    cs.LG stat.ML

    Manifold Adversarial Learning

    Authors: Shufei Zhang, Kaizhu Huang, Jianke Zhu, Yang Liu

    Abstract: Recently proposed adversarial training methods show the robustness to both adversarial and original examples and achieve state-of-the-art results in supervised and semi-supervised learning. All the existing adversarial training methods consider only how the worst perturbed examples (i.e., adversarial examples) could affect the model output. Despite their success, we argue that such setting may be… ▽ More

    Submitted 14 November, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

    Comments: 11 pages, 26 figures

  48. arXiv:1805.08311  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    AgileNet: Lightweight Dictionary-based Few-shot Learning

    Authors: Mohammad Ghasemzadeh, Fang Lin, Bita Darvish Rouhani, Farinaz Koushanfar, Ke Huang

    Abstract: The success of deep learning models is heavily tied to the use of massive amount of labeled data and excessively long training time. With the emergence of intelligent edge applications that use these models, the critical challenge is to obtain the same inference capability on a resource-constrained device while providing adaptability to cope with the dynamic changes in the data. We propose AgileNe… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: 10 Pages

  49. arXiv:1803.01257  [pdf, other

    eess.SP cs.LG stat.ML

    Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications

    Authors: Xiao Fu, Kejun Huang, Nicholas D. Sidiropoulos, Wing-Kin Ma

    Abstract: Nonnegative matrix factorization (NMF) has become a workhorse for signal and data analytics, triggered by its model parsimony and interpretability. Perhaps a bit surprisingly, the understanding to its model identifiability---the major reason behind the interpretability in many applications such as topic mining and hyperspectral imaging---had been rather limited until recent years. Beginning from t… ▽ More

    Submitted 16 November, 2018; v1 submitted 3 March, 2018; originally announced March 2018.

    Comments: accepted version, IEEE Signal Processing Magazine; supplementary materials added. Some minor revisions implemented

  50. arXiv:1802.09387  [pdf, other

    stat.ME

    Estimating Precipitation Extremes using Log-Histospline

    Authors: Whitney K. Huang, Douglas W. Nychka, Hao Zhang

    Abstract: One of the commonly used approaches to modeling extremes is the peaks-over-threshold (POT) method. The POT method models exceedances over a threshold that is sufficiently high or low so that the exceedance has approximately a generalized Pareto distribution (GPD). This method requires the selection of a threshold that might affect the estimates. Here we propose an alternative method, the Log-Histo… ▽ More

    Submitted 4 October, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: 32 pages, 13 figures, 2 table

    MSC Class: 62