Skip to main content

Showing 1–18 of 18 results for author: Salim, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.12825  [pdf, ps, other

    cs.AI cs.LG stat.ME

    A PSO Based Method to Generate Actionable Counterfactuals for High Dimensional Data

    Authors: Shashank Shekhar, Asif Salim, Adesh Bansode, Vivaswan Jinturkar, Anirudha Nayak

    Abstract: Counterfactual explanations (CFE) are methods that explain a machine learning model by giving an alternate class prediction of a data point with some minimal changes in its features. It helps the users to identify their data attributes that caused an undesirable prediction like a loan or credit card rejection. We describe an efficient and an actionable counterfactual (CF) generation method based o… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced November 2023.

    Comments: Accepted in IEEE CSDE 2023

  2. arXiv:2306.16308  [pdf, other

    math.PR cs.LG math.ST stat.ML

    Gaussian random field approximation via Stein's method with applications to wide random neural networks

    Authors: Krishnakumar Balasubramanian, Larry Goldstein, Nathan Ross, Adil Salim

    Abstract: We derive upper bounds on the Wasserstein distance ($W_1$), with respect to $\sup$-norm, between any continuous $\mathbb{R}^d$ valued random field indexed by the $n$-sphere and the Gaussian, based on Stein's method. We develop a novel Gaussian smoothing technique that allows us to transfer a bound in a smoother metric to the $W_1$ distance. The smoothing is based on covariance functions constructe… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: To appear in Applied and Computational Harmonic Analysis

  3. arXiv:2305.11798  [pdf, ps, other

    cs.LG math.ST stat.ML

    The probability flow ODE is provably fast

    Authors: Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim

    Abstract: We provide the first polynomial-time convergence guarantees for the probability flow ODE implementation (together with a corrector step) of score-based generative modeling. Our analysis is carried out in the wake of recent results obtaining such guarantees for the SDE-based implementation (i.e., denoising diffusion probabilistic modeling or DDPM), but requires the development of novel techniques f… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 23 pages, 2 figures

  4. arXiv:2203.12859  [pdf, other

    stat.ME

    Making SMART decisions in prophylaxis and treatment studies

    Authors: Robert K. Mahar, Katherine J. Lee, Bibhas Chakraborty, Agus Salim, Julie A. Simpson

    Abstract: The optimal prophylaxis, and treatment if the prophylaxis fails, for a disease may be best evaluated using a sequential multiple assignment randomised trial (SMART). A SMART is a multi-stage study that randomises a participant to an initial treatment, observes some response to that treatment and then, depending on their observed response, randomises the same participant to an alternative treatment… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  5. arXiv:2202.06386  [pdf, ps, other

    math.ST stat.ML

    Improved analysis for a proximal algorithm for sampling

    Authors: Yongxin Chen, Sinho Chewi, Adil Salim, Andre Wibisono

    Abstract: We study the proximal sampler of Lee, Shen, and Tian (2021) and obtain new convergence guarantees under weaker assumptions than strong log-concavity: namely, our results hold for (1) weakly log-concave targets, and (2) targets satisfying isoperimetric assumptions which allow for non-log-concavity. We demonstrate our results by obtaining new state-of-the-art sampling guarantees for several classes… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: 34 pages

  6. arXiv:2202.05214  [pdf, other

    math.ST stat.ML

    Towards a Theory of Non-Log-Concave Sampling: First-Order Stationarity Guarantees for Langevin Monte Carlo

    Authors: Krishnakumar Balasubramanian, Sinho Chewi, Murat A. Erdogdu, Adil Salim, Matthew Zhang

    Abstract: For the task of sampling from a density $π\propto \exp(-V)$ on $\mathbb{R}^d$, where $V$ is possibly non-convex but $L$-gradient Lipschitz, we prove that averaged Langevin Monte Carlo outputs a sample with $\varepsilon$-relative Fisher information after $O( L^2 d^2/\varepsilon^2)$ iterations. This is the sampling analogue of complexity bounds for finding an $\varepsilon$-approximate first-order st… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  7. arXiv:2009.13801  [pdf, other

    cs.LG stat.ML

    Framework for Designing Filters of Spectral Graph Convolutional Neural Networks in the Context of Regularization Theory

    Authors: Asif Salim, Sumitra S

    Abstract: Graph convolutional neural networks (GCNNs) have been widely used in graph learning. It has been observed that the smoothness functional on graphs can be defined in terms of the graph Laplacian. This fact points out in the direction of using Laplacian in deriving regularization operators on graphs and its consequent use with spectral GCNN filter designs. In this work, we explore the regularization… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  8. arXiv:2006.11773  [pdf, other

    math.OC stat.ML

    Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization

    Authors: Dmitry Kovalev, Adil Salim, Peter Richtárik

    Abstract: We consider the task of decentralized minimization of the sum of smooth strongly convex functions stored across the nodes of a network. For this problem, lower bounds on the number of gradient computations and the number of communication rounds required to achieve $\varepsilon$ accuracy have recently been proven. We propose two new algorithms for this decentralized optimization problem and equip t… ▽ More

    Submitted 13 November, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

  9. arXiv:2006.09797  [pdf, other

    stat.ML cs.LG

    A Non-Asymptotic Analysis for Stein Variational Gradient Descent

    Authors: Anna Korba, Adil Salim, Michael Arbel, Giulia Luise, Arthur Gretton

    Abstract: We study the Stein Variational Gradient Descent (SVGD) algorithm, which optimises a set of particles to approximate a target probability distribution $π\propto e^{-V}$ on $\mathbb{R}^d$. In the population limit, SVGD performs gradient descent in the space of probability distributions on the KL divergence with respect to $π$, where the gradient is smoothed through a kernel integral operator. In thi… ▽ More

    Submitted 3 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to Neurips 2020

  10. arXiv:2006.09270  [pdf, other

    stat.ML cs.LG math.OC

    Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm

    Authors: Adil Salim, Peter Richtárik

    Abstract: We consider the task of sampling with respect to a log concave probability distribution. The potential of the target distribution is assumed to be composite, \textit{i.e.}, written as the sum of a smooth convex term, and a nonsmooth convex term possibly taking infinite values. The target distribution can be seen as a minimizer of the Kullback-Leibler divergence defined on the Wasserstein space (\t… ▽ More

    Submitted 22 February, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  11. arXiv:2004.02635  [pdf, other

    math.OC cs.LG stat.ML

    Dualize, Split, Randomize: Toward Fast Nonsmooth Optimization Algorithms

    Authors: Adil Salim, Laurent Condat, Konstantin Mishchenko, Peter Richtárik

    Abstract: We consider minimizing the sum of three convex functions, where the first one F is smooth, the second one is nonsmooth and proximable and the third one is the composition of a nonsmooth proximable function with a linear operator L. This template problem has many applications, for instance, in image processing and machine learning. First, we propose a new primal-dual algorithm, which we call PDDY,… ▽ More

    Submitted 26 July, 2022; v1 submitted 3 April, 2020; originally announced April 2020.

  12. arXiv:2002.03035  [pdf, other

    math.OC stat.ML

    The Wasserstein Proximal Gradient Algorithm

    Authors: Adil Salim, Anna Korba, Giulia Luise

    Abstract: Wasserstein gradient flows are continuous time dynamics that define curves of steepest descent to minimize an objective function over the space of probability measures (i.e., the Wasserstein space). This objective is typically a divergence w.r.t. a fixed target distribution. In recent years, these continuous time dynamics have been used to study the convergence of machine learning algorithms aimin… ▽ More

    Submitted 21 February, 2021; v1 submitted 7 February, 2020; originally announced February 2020.

  13. arXiv:1906.04370  [pdf, other

    stat.ML cs.LG

    Maximum Mean Discrepancy Gradient Flow

    Authors: Michael Arbel, Anna Korba, Adil Salim, Arthur Gretton

    Abstract: We construct a Wasserstein gradient flow of the maximum mean discrepancy (MMD) and study its convergence properties. The MMD is an integral probability metric defined for a reproducing kernel Hilbert space (RKHS), and serves as a metric on probability measures for a sufficiently rich RKHS. We obtain conditions for convergence of the gradient flow towards a global optimum, that can be related to… ▽ More

    Submitted 3 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  14. arXiv:1905.11768  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates

    Authors: Adil Salim, Dmitry Kovalev, Peter Richtárik

    Abstract: We propose a new algorithm---Stochastic Proximal Langevin Algorithm (SPLA)---for sampling from a log concave distribution. Our method is a generalization of the Langevin algorithm to potentials expressed as the sum of one stochastic smooth term and multiple stochastic nonsmooth terms. In each iteration, our splitting technique only requires access to a stochastic gradient of the smooth term and a… ▽ More

    Submitted 16 June, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Journal ref: Neurips 2019 (Spotlight)

  15. arXiv:1901.08170  [pdf, ps, other

    math.OC stat.ML

    A Fully Stochastic Primal-Dual Algorithm

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: A new stochastic primal--dual algorithm for solving a composite optimization problem is proposed. It is assumed that all the functions/operators that enter the optimization problem are given as statistical expectations. These expectations are unknown but revealed across time through i.i.d. realizations. The proposed algorithm is proven to converge to a saddle point of the Lagrangian function. In t… ▽ More

    Submitted 22 June, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  16. arXiv:1808.06444  [pdf

    cs.LG stat.ML

    Synthetic Patient Generation: A Deep Learning Approach Using Variational Autoencoders

    Authors: Ally Salim Jr

    Abstract: Artificial Intelligence in healthcare is a new and exciting frontier and the possibilities are endless. With deep learning approaches beating human performances in many areas, the logical next step is to attempt their application in the health space. For these and other Machine Learning approaches to produce good results and have their potential realized, the need for, and importance of, large amo… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    MSC Class: 68T00

  17. arXiv:1804.00934  [pdf, other

    math.OC stat.ML

    A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: The Douglas Rachford algorithm is an algorithm that converges to a minimizer of a sum of two convex functions. The algorithm consists in fixed point iterations involving computations of the proximity operators of the two functions separately. The paper investigates a stochastic version of the algorithm where both functions are random and the step size is constant. We establish that the iterates of… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  18. arXiv:1712.07027  [pdf, other

    math.OC cs.LG stat.ML

    Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: A regularized optimization problem over a large unstructured graph is studied, where the regularization term is tied to the graph geometry. Typical regularization examples include the total variation and the Laplacian regularizations over the graph. When applying the proximal gradient algorithm to solve this problem, there exist quite affordable methods to implement the proximity operator (backwar… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.