Skip to main content

Showing 1–32 of 32 results for author: Ali, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.17445  [pdf, other

    stat.ME math.ST

    Copula-Based Estimation of Causal Effects in Multiple Linear and Path Analysis Models

    Authors: Alam Ali, Ashok Kumar Pathak, Mohd Arshad, Ayyub Sheikhi

    Abstract: Regression analysis is one of the most popularly used statistical technique which only measures the direct effect of independent variables on dependent variable. Path analysis looks for both direct and indirect effects of independent variables and may overcome several hurdles allied with regression models. It utilizes one or more structural regression equations in the model which are used to estim… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 23 pages, 3 figures, 11 tables

    MSC Class: 62H05; 62J05; 62F10

  2. arXiv:2405.11624  [pdf, other

    stat.ME math.ST

    On Generalized Transmuted Lifetime Distribution

    Authors: Alok Kumar Pandey, Alam Ali, Ashok Kumar Pathak

    Abstract: This article presents a new class of generalized transmuted lifetime distributions which includes a large number of lifetime distributions as sub-family. Several important mathematical quantities such as density function, distribution function, quantile function, moments, moment generating function, stress-strength reliability function, order statistics, Rényi and q-entropy, residual and reversed… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 26 pages, 8 figures

    MSC Class: 60E05; 62F10; 62E15; 65C05; 33B20

  3. arXiv:2401.12667  [pdf, ps, other

    stat.ML cs.LG

    Feature Selection via Robust Weighted Score for High Dimensional Binary Class-Imbalanced Gene Expression Data

    Authors: Zardad Khan, Amjad Ali, Saeed Aldahmani

    Abstract: In this paper, a robust weighted score for unbalanced data (ROWSU) is proposed for selecting the most discriminative feature for high dimensional gene expression binary classification with class-imbalance problem. The method addresses one of the most challenging problems of highly skewed class distributions in gene expression datasets that adversely affect the performance of classification algorit… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages

    MSC Class: 14J60

  4. arXiv:2401.05591  [pdf

    astro-ph.SR physics.data-an stat.AP

    Time Series of Magnetic Field Parameters of Merged MDI and HMI Space-Weather Active Region Patches as Potential Tool for Solar Flare Forecasting

    Authors: Paul A. Kosovich, Viacheslav M. Sadykov, Alexander G. Kosovichev, Spiridon Kasapis, Irina N. Kitiashvili, Patrick M. O'Keefe, Aatiya Ali, Vincent Oria, Samuel Granovsky, Chun Jie Chong, Gelu M. Nita

    Abstract: Solar flare prediction studies have been recently conducted with the use of Space-Weather MDI (Michelson Doppler Imager onboard Solar and Heliospheric Observatory) Active Region Patches (SMARP) and Space-Weather HMI (Helioseismic and Magnetic Imager onboard Solar Dynamics Observatory) Active Region Patches (SHARP), which are two currently available data products containing magnetic field character… ▽ More

    Submitted 22 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  5. arXiv:2303.12210  [pdf, ps, other

    stat.ML cs.LG

    A Random Projection k Nearest Neighbours Ensemble for Classification via Extended Neighbourhood Rule

    Authors: Amjad Ali, Muhammad Hamraz, Dost Muhammad Khan, Wajdan Deebani, Zardad Khan

    Abstract: Ensembles based on k nearest neighbours (kNN) combine a large number of base learners, each constructed on a sample taken from a given training data. Typical kNN based ensembles determine the k closest observations in the training data bounded to a test sample point by a spherical region to predict its class. In this paper, a novel random projection extended neighbourhood rule (RPExNRule) ensemble… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 23 pages, 8 diagrams, 69 references

    ACM Class: F.2.2

  6. arXiv:2211.11278  [pdf, ps, other

    stat.ML cs.LG

    Optimal Extended Neighbourhood Rule $k$ Nearest Neighbours Ensemble

    Authors: Amjad Ali, Zardad Khan, Dost Muhammad Khan, Saeed Aldahmani

    Abstract: The traditional k nearest neighbor (kNN) approach uses a distance formula within a spherical region to determine the k closest training observations to a test sample point. However, this approach may not work well when test point is located outside this region. Moreover, aggregating many base kNN learners can result in poor ensemble performance due to high classification errors. To address these i… ▽ More

    Submitted 15 February, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: This manuscript has been submitted for publication in the esteemed journal Pattern Recognition Letters

    MSC Class: 14J60

  7. arXiv:2202.04166  [pdf, other

    stat.ME stat.ML

    The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

    Authors: Alnur Ali, Maxime Cauchois, John C. Duchi

    Abstract: The statistical machine learning community has demonstrated considerable resourcefulness over the years in developing highly expressive tools for estimation, prediction, and inference. The bedrock assumptions underlying these developments are that the data comes from a fixed population and displays little heterogeneity. But reality is significantly more complex: statistical models now routinely fa… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  8. arXiv:2201.08315  [pdf, other

    stat.ML cs.LG

    Predictive Inference with Weak Supervision

    Authors: Maxime Cauchois, Suyash Gupta, Alnur Ali, John Duchi

    Abstract: The expense of acquiring labels in large-scale statistical machine learning makes partially and weakly-labeled data attractive, though it is not always apparent how to leverage such data for model fitting or validation. We present a methodology to bridge the gap between partial supervision and validation, developing a conformal prediction framework to provide valid predictive confidence sets -- se… ▽ More

    Submitted 9 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  9. arXiv:2201.08311  [pdf, other

    stat.ML cs.LG math.OC

    Accelerated Gradient Flow: Risk, Stability, and Implicit Regularization

    Authors: Yue Sheng, Alnur Ali

    Abstract: Acceleration and momentum are the de facto standard in modern applications of machine learning and optimization, yet the bulk of the work on implicit regularization focuses instead on unaccelerated methods. In this paper, we study the statistical risk of the iterates generated by Nesterov's accelerated gradient method and Polyak's heavy ball method, when applied to least squares regression, drawin… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  10. arXiv:2103.02559  [pdf, other

    cs.LG math.OC stat.ML

    Minimum-Distortion Embedding

    Authors: Akshay Agrawal, Alnur Ali, Stephen Boyd

    Abstract: We consider the vector embedding problem. We are given a finite set of items, with the goal of assigning a representative vector to each one, possibly under some constraints (such as the collection of vectors being standardized, i.e., having zero mean and unit covariance). We are given data indicating that some pairs of items are similar, and optionally, some other pairs are dissimilar. For pairs… ▽ More

    Submitted 24 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  11. arXiv:2011.03668  [pdf, other

    math.ST stat.ME

    Confidence bands for a log-concave density

    Authors: Guenther Walther, Alnur Ali, Xinyue Shen, Stephen Boyd

    Abstract: We present a new approach for inference about a log-concave distribution: Instead of using the method of maximum likelihood, we propose to incorporate the log-concavity constraint in an appropriate nonparametric confidence set for the cdf $F$. This approach has the advantage that it automatically provides a measure of statistical uncertainty and it thus overcomes a marked limitation of the maximum… ▽ More

    Submitted 6 May, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Added a discussion section, minor changes

  12. arXiv:2008.04267  [pdf, other

    stat.ML cs.LG stat.ME

    Robust Validation: Confident Predictions Even When Distributions Shift

    Authors: Maxime Cauchois, Suyash Gupta, Alnur Ali, John C. Duchi

    Abstract: While the traditional viewpoint in machine learning and statistics assumes training and testing samples come from the same population, practice belies this fiction. One strategy -- coming from robust statistics and optimization -- is thus to build a model robust to distributional perturbations. In this paper, we take a different approach to describe procedures for robust predictive inference, wher… ▽ More

    Submitted 4 July, 2024; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: Published in the Journal of the American Statistical Association (JASA 2024)

  13. arXiv:2006.07187  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    HMIC: Hierarchical Medical Image Classification, A Deep Learning Approach

    Authors: Kamran Kowsari, Rasoul Sali, Lubaina Ehsan, William Adorno, Asad Ali, Sean Moore, Beatrice Amadi, Paul Kelly, Sana Syed, Donald Brown

    Abstract: Image classification is central to the big data revolution in medicine. Improved information processing methods for diagnosis and classification of digital medical images have shown to be successful via deep learning approaches. As this field is explored, there are limitations to the performance of traditional supervised classifiers. This paper outlines an approach that is different from the curre… ▽ More

    Submitted 23 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Information 11, no. 6 (2020): 318

  14. arXiv:2005.03868  [pdf, other

    eess.IV cs.LG stat.ML

    Hierarchical Deep Convolutional Neural Networks for Multi-category Diagnosis of Gastrointestinal Disorders on Histopathological Images

    Authors: Rasoul Sali, Sodiq Adewole, Lubaina Ehsan, Lee A. Denson, Paul Kelly, Beatrice C. Amadi, Lori Holtz, Syed Asad Ali, Sean R. Moore, Sana Syed, Donald E. Brown

    Abstract: Deep convolutional neural networks(CNNs) have been successful for a wide range of computer vision tasks, including image classification. A specific area of the application lies in digital pathology for pattern recognition in the tissue-based diagnosis of gastrointestinal(GI) diseases. This domain can utilize CNNs to translate histopathological images into precise diagnostics. This is challenging s… ▽ More

    Submitted 6 August, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: accepted at IEEE International Conference on Healthcare Informatics (ICHI 2020)

  15. arXiv:2003.09018  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Human Activity Recognition from Wearable Sensor Data Using Self-Attention

    Authors: Saif Mahmud, M Tanjid Hasan Tonmoy, Kishor Kumar Bhaumik, A K M Mahbubur Rahman, M Ashraful Amin, Mohammad Shoyaib, Muhammad Asif Hossain Khan, Amin Ahsan Ali

    Abstract: Human Activity Recognition from body-worn sensor data poses an inherent challenge in capturing spatial and temporal dependencies of time-series signals. In this regard, the existing recurrent or convolutional or their hybrid models for activity recognition struggle to capture spatio-temporal context from the feature space of sensor reading sequence. To address this complex problem, we propose a se… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at the 24th European Conference on Artificial Intelligence (ECAI-2020); 8 pages, 4 figures

  16. arXiv:2003.07802  [pdf, other

    stat.ML cs.LG math.OC

    The Implicit Regularization of Stochastic Gradient Flow for Least Squares

    Authors: Alnur Ali, Edgar Dobriban, Ryan J. Tibshirani

    Abstract: We study the implicit regularization of mini-batch stochastic gradient descent, when applied to the fundamental problem of least squares regression. We leverage a continuous-time stochastic differential equation having the same moments as stochastic gradient descent, which we call stochastic gradient flow. We give a bound on the excess risk of stochastic gradient flow at time $t$, over ridge regre… ▽ More

    Submitted 19 June, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: ICML 2020

  17. arXiv:2001.09249  [pdf, other

    cs.LG cs.PF stat.ML

    TiFL: A Tier-based Federated Learning System

    Authors: Zheng Chai, Ahsan Ali, Syed Zawad, Stacey Truex, Ali Anwar, Nathalie Baracaldo, Yi Zhou, Heiko Ludwig, Feng Yan, Yue Cheng

    Abstract: Federated Learning (FL) enables learning a shared model across many clients without violating the privacy requirements. One of the key attributes in FL is the heterogeneity that exists in both resource and data due to the differences in computation and communication capacity, as well as the quantity and content of data among different clients. We conduct a case study to show that heterogeneity in… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  18. arXiv:2001.09001  [pdf, other

    cs.LG cs.MA cs.RO stat.ML

    MagNet: Discovering Multi-agent Interaction Dynamics using Neural Network

    Authors: Priyabrata Saha, Arslan Ali, Burhan A. Mudassar, Yun Long, Saibal Mukhopadhyay

    Abstract: We present the MagNet, a neural network-based multi-agent interaction model to discover the governing dynamics and predict evolution of a complex multi-agent system from observations. We formulate a multi-agent system as a coupled non-linear network with a generic ordinary differential equation (ODE) based state evolution, and develop a neural network-based realization of its time-discretized mode… ▽ More

    Submitted 3 March, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

    Comments: Accepted manuscript by ICRA 2020

    Journal ref: ICRA 2020, pp. 8158-8164

  19. arXiv:1909.04525  [pdf, other

    cs.LG cs.CV stat.ML

    Skin cancer detection based on deep learning and entropy to detect outlier samples

    Authors: Andre G. C. Pacheco, Abder-Rahman Ali, Thomas Trappenberg

    Abstract: We describe our methods that achieved the 3rd and 4th places in tasks 1 and 2, respectively, at ISIC challenge 2019. The goal of this challenge is to provide the diagnostic for skin cancer using images and meta-data. There are nine classes in the dataset, nonetheless, one of them is an outlier and is not present on it. To tackle the challenge, we apply an ensemble of classifiers, which has 13 conv… ▽ More

    Submitted 5 January, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: 3rd and 4th places in tasks 1 and 2 respectively, at ISIC challenge 2019 @ MICCAI workshop 2019

  20. arXiv:1904.05773  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    Diagnosis of Celiac Disease and Environmental Enteropathy on Biopsy Images Using Color Balancing on Convolutional Neural Networks

    Authors: Kamran Kowsari, Rasoul Sali, Marium N. Khan, William Adorno, S. Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Sana Syed, Donald E. Brown

    Abstract: Celiac Disease (CD) and Environmental Enteropathy (EE) are common causes of malnutrition and adversely impact normal childhood development. CD is an autoimmune disorder that is prevalent worldwide and is caused by an increased sensitivity to gluten. Gluten exposure destructs the small intestinal epithelial barrier, resulting in nutrient mal-absorption and childhood under-nutrition. EE also results… ▽ More

    Submitted 9 October, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

  21. arXiv:1902.07855  [pdf, other

    stat.ML cs.LG q-fin.GN

    Stacking with Neural network for Cryptocurrency investment

    Authors: Avinash Barnwal, Hari Pad Bharti, Aasim Ali, Vishal Singh

    Abstract: Predicting the direction of assets have been an active area of study and a difficult task. Machine learning models have been used to build robust models to model the above task. Ensemble methods is one of them showing results better than a single supervised method. In this paper, we have used generative and discriminative classifiers to create the stack, particularly 3 generative and 6 discriminat… ▽ More

    Submitted 22 February, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: 20 pages,7 figues

  22. arXiv:1810.10082  [pdf, other

    stat.ML cs.LG

    A Continuous-Time View of Early Stopping for Least Squares

    Authors: Alnur Ali, J. Zico Kolter, Ryan J. Tibshirani

    Abstract: We study the statistical properties of the iterates generated by gradient descent, applied to the fundamental problem of least squares regression. We take a continuous-time view, i.e., consider infinitesimal step sizes in gradient descent, in which case the iterates form a trajectory called gradient flow. Our primary focus is to compare the risk of gradient flow to that of ridge regression. Under… ▽ More

    Submitted 23 February, 2019; v1 submitted 23 October, 2018; originally announced October 2018.

  23. arXiv:1810.05041  [pdf, other

    cs.LG cs.AI stat.ML

    A General Framework for Fair Regression

    Authors: Jack Fitzsimons, AbdulRahman Al Ali, Michael Osborne, Stephen Roberts

    Abstract: Fairness, through its many forms and definitions, has become an important issue facing the machine learning community. In this work, we consider how to incorporate group fairness constraints in kernel regression methods, applicable to Gaussian processes, support vector machines, neural network regression and decision tree regression. Further, we focus on examining the effect of incorporating these… ▽ More

    Submitted 2 February, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: 8 pages, 4 figures, 2 pages references

  24. arXiv:1710.10769  [pdf, other

    stat.ML cs.DC cs.LG

    Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance Estimation

    Authors: Penporn Koanantakool, Alnur Ali, Ariful Azad, Aydin Buluc, Dmitriy Morozov, Leonid Oliker, Katherine Yelick, Sang-Yun Oh

    Abstract: Across a variety of scientific disciplines, sparse inverse covariance estimation is a popular tool for capturing the underlying dependency relationships in multivariate data. Unfortunately, most estimators are not scalable enough to handle the sizes of modern high-dimensional data sets (often on the order of terabytes), and assume Gaussian samples. To address these deficiencies, we introduce HP-CO… ▽ More

    Submitted 8 April, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Main paper: 15 pages, appendix: 24 pages

    Journal ref: Artificial Intelligence and Statistics vol. 84 1376-1386 (2018)

  25. arXiv:1607.00515  [pdf, other

    stat.ME

    The Multiple Quantile Graphical Model

    Authors: Alnur Ali, J. Zico Kolter, Ryan J. Tibshirani

    Abstract: We introduce the Multiple Quantile Graphical Model (MQGM), which extends the neighborhood selection approach of Meinshausen and Buhlmann for learning sparse graphical models. The latter is defined by the basic subproblem of modeling the conditional mean of one variable as a sparse function of all others. Our approach models a set of conditional quantiles of one variable as a sparse function of all… ▽ More

    Submitted 27 October, 2016; v1 submitted 2 July, 2016; originally announced July 2016.

  26. arXiv:1606.00033  [pdf, other

    stat.ME

    Generalized Pseudolikelihood Methods for Inverse Covariance Estimation

    Authors: Alnur Ali, Kshitij Khare, Sang-Yun Oh, Bala Rajaratnam

    Abstract: We introduce PseudoNet, a new pseudolikelihood-based estimator of the inverse covariance matrix, that has a number of useful statistical and computational properties. We show, through detailed experiments with synthetic and also real-world finance as well as wind power data, that PseudoNet outperforms related methods in terms of estimation error and support recovery, making it well-suited for use… ▽ More

    Submitted 14 October, 2016; v1 submitted 31 May, 2016; originally announced June 2016.

  27. arXiv:1506.09060  [pdf, other

    cs.IT stat.AP

    Nonlinear Distortion Reduction in OFDM from Reliable Perturbations in Data Carriers

    Authors: Ebrahim B. Al-Safadi, Tareq Y. Al-Naffouri, Mudassir Masood, Anum Ali

    Abstract: A novel method for correcting the effect of nonlinear distortion in orthogonal frequency division multiplexing signals is proposed. The method depends on adaptively selecting the distortion over a subset of the data carriers, and then using tools from compressed sensing and sparse Bayesian recovery to estimate the distortion over the other carriers. Central to this method is the fact that carriers… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.

    Comments: 27 pages, 11 Figures

  28. arXiv:1412.6137  [pdf, ps, other

    cs.IT stat.AP

    Narrowband Interference Mitigation in SC-FDMA Using Bayesian Sparse Recovery

    Authors: Anum Ali, Mudassir Masood, Muhammad S. Sohail, Samir Al-Ghadhban, Tareq Y. Al-Naffouri

    Abstract: This paper presents a novel narrowband interference (NBI) mitigation scheme for SC-FDMA systems. The proposed NBI cancellation scheme exploits the frequency domain sparsity of the unknown signal and adopts a low complexity Bayesian sparse recovery procedure. At the transmitter, a few randomly chosen sub-carriers are kept data free to sense the NBI signal at the receiver. Further, it is noted that… ▽ More

    Submitted 8 October, 2014; originally announced December 2014.

  29. arXiv:1410.2457  [pdf, other

    stat.AP cs.IT

    Receiver-based Recovery of Clipped OFDM Signals for PAPR Reduction: A Bayesian Approach

    Authors: Anum Ali, Abdullatif Al-Rabah, Mudassir Masood, Tareq Y. Al-Naffouri

    Abstract: Clipping is one of the simplest peak-to-average power ratio (PAPR) reduction schemes for orthogonal frequency division multiplexing (OFDM). Deliberately clipping the transmission signal degrades system performance, and clipping mitigation is required at the receiver for information restoration. In this work, we acknowledge the sparse nature of the clipping signal and propose a low-complexity Bayes… ▽ More

    Submitted 21 October, 2014; v1 submitted 8 October, 2014; originally announced October 2014.

  30. arXiv:1301.0550  [pdf

    cs.AI stat.ME

    Markov Equivalence Classes for Maximal Ancestral Graphs

    Authors: Ayesha R. Ali, Thomas S. Richardson

    Abstract: Ancestral graphs are a class of graphs that encode conditional independence relations arising in DAG models with latent and selection variables, corresponding to marginalization and conditioning. However, for any ancestral graph, there may be several other graphs to which it is Markov equivalent. We introduce a simple representation of a Markov equivalence class of ancestral graphs, thereby faci… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-1-9

  31. arXiv:1207.1365  [pdf

    stat.ME cs.AI

    Towards Characterizing Markov Equivalence Classes for Directed Acyclic Graphs with Latent Variables

    Authors: Ayesha R. Ali, Thomas S. Richardson, Peter L. Spirtes, Jiji Zhang

    Abstract: It is well known that there may be many causal explanations that are consistent with a given set of data. Recent work has been done to represent the common aspects of these explanations into one representation. In this paper, we address what is less well known: how do the relationships common to every causal explanation among the observed variables of some DAG process change in the presence of lat… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-10-17

  32. arXiv:0905.1540  [pdf, ps, other

    stat.ML

    Supplementary material for Markov equivalence for ancestral graphs

    Authors: R. A. Ali, T. Richardson, P. Spirtes

    Abstract: We prove that the criterion for Markov equivalence provided by Zhao et al. (2005) may involve a set of features of a graph that is exponential in the number of vertices.

    Submitted 11 May, 2009; originally announced May 2009.

    Comments: 2 pages, 1 figure, supplement to paper to appear in the Annals of Statistics