Skip to main content

Showing 1–10 of 10 results for author: Abdulah, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.04440  [pdf, other

    stat.CO

    Boosting Earth System Model Outputs And Saving PetaBytes in their Storage Using Exascale Climate Emulators

    Authors: Sameh Abdulah, Allison H. Baker, George Bosilca, Qinglei Cao, Stefano Castruccio, Marc G. Genton, David E. Keyes, Zubair Khalid, Hatem Ltaief, Yan Song, Georgiy L. Stenchikov, Ying Sun

    Abstract: We present the design and scalable implementation of an exascale climate emulator for addressing the escalating computational and storage requirements of high-resolution Earth System Model simulations. We utilize the spherical harmonic transform to stochastically model spatio-temporal variations in climate data. This provides tunable spatio-temporal resolution and significantly improves the fideli… ▽ More

    Submitted 11 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2406.02701  [pdf, other

    stat.CO

    MPCR: Multi- and Mixed-Precision Computations Package in R

    Authors: Mary Lai O. Salvana, Sameh Abdulah, Minwoo Kim, David Helmy, Ying Sun, Marc G. Genton

    Abstract: Computational statistics has traditionally utilized double-precision (64-bit) data structures and full-precision operations, resulting in higher-than-necessary accuracy for certain applications. Recently, there has been a growing interest in exploring low-precision options that could reduce computational complexity while still achieving the required level of accuracy. This trend has been amplified… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2405.14892  [pdf, other

    cs.DC stat.CO

    Parallel Approximations for High-Dimensional Multivariate Normal Probability Computation in Confidence Region Detection Applications

    Authors: Xiran Zhang, Sameh Abdulah, Jian Cao, Hatem Ltaief, Ying Sun, Marc G. Genton, David E. Keyes

    Abstract: Addressing the statistical challenge of computing the multivariate normal (MVN) probability in high dimensions holds significant potential for enhancing various applications. One common way to compute high-dimensional MVN probabilities is the Separation-of-Variables (SOV) algorithm. This algorithm is known for its high computational complexity of O(n^3) and space complexity of O(n^2), mainly due t… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  4. arXiv:2403.07412  [pdf, other

    stat.CO cs.DC

    GPU-Accelerated Vecchia Approximations of Gaussian Processes for Geospatial Data using Batched Matrix Computations

    Authors: Qilong Pan, Sameh Abdulah, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun

    Abstract: Gaussian processes (GPs) are commonly used for geospatial analysis, but they suffer from high computational complexity when dealing with massive data. For instance, the log-likelihood function required in estimating the statistical model parameters for geospatial data is a computationally intensive procedure that involves computing the inverse of a covariance matrix with size n X n, where n repres… ▽ More

    Submitted 3 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2402.09356  [pdf, other

    stat.CO stat.ME

    On the Impact of Spatial Covariance Matrix Ordering on Tile Low-Rank Estimation of Matérn Parameters

    Authors: Sihan Chen, Sameh Abdulah, Ying Sun, Marc G. Genton

    Abstract: Spatial statistical modeling and prediction involve generating and manipulating an n*n symmetric positive definite covariance matrix, where n denotes the number of spatial locations. However, when n is large, processing this covariance matrix using traditional methods becomes prohibitive. Thus, coupling parallel processing with approximation can be an elegant solution to this challenge by relying… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 31 pages, 13 figures

  6. arXiv:2309.12000  [pdf, other

    stat.ME stat.CO

    Which Parameterization of the Matérn Covariance Function?

    Authors: Kesen Wang, Sameh Abdulah, Ying Sun, Marc G. Genton

    Abstract: The Matérn family of covariance functions is currently the most popularly used model in spatial statistics, geostatistics, and machine learning to specify the correlation between two geographical locations based on spatial distance. Compared to existing covariance functions, the Matérn family has more flexibility in data fitting because it allows the control of the field smoothness through a dedic… ▽ More

    Submitted 28 August, 2023; originally announced September 2023.

  7. arXiv:2306.11487  [pdf, other

    stat.ML cs.LG stat.CO

    Efficient Large-scale Nonstationary Spatial Covariance Function Estimation Using Convolutional Neural Networks

    Authors: Pratik Nag, Yiping Hong, Sameh Abdulah, Ghulam A. Qadir, Marc G. Genton, Ying Sun

    Abstract: Spatial processes observed in various fields, such as climate and environmental science, often occur on a large scale and demonstrate spatial nonstationarity. Fitting a Gaussian process with a nonstationary Matérn covariance is challenging. Previous studies in the literature have tackled this challenge by employing spatial partitioning techniques to estimate the parameters that vary spatially in t… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  8. arXiv:2211.03119  [pdf, other

    stat.OT

    The Second Competition on Spatial Statistics for Large Datasets

    Authors: Sameh Abdulah, Faten Alamri, Pratik Nag, Ying Sun, Hatem Ltaief, David E. Keyes, Marc G. Genton

    Abstract: In the last few decades, the size of spatial and spatio-temporal datasets in many research areas has rapidly increased with the development of data collection technologies. As a result, classical statistical methods in spatial statistics are facing computational challenges. For example, the kriging predictor in geostatistics becomes prohibitive on traditional hardware architectures for large datas… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  9. Efficiency Assessment of Approximated Spatial Predictions for Large Datasets

    Authors: Yiping Hong, Sameh Abdulah, Marc G. Genton, Ying Sun

    Abstract: Due to the well-known computational showstopper of the exact Maximum Likelihood Estimation (MLE) for large geospatial observations, a variety of approximation methods have been proposed in the literature, which usually require tuning certain inputs. For example, the recently developed Tile Low-Rank approximation (TLR) method involves many tuning parameters, including numerical accuracy. To properl… ▽ More

    Submitted 9 June, 2021; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: 43 pages + 8 pages of Supplementary Material, 8 figures, 8 tables + 8 tables in Supplementary Material. The Abstract is slightly abridged compared to the article. Corrected the affiliation of Sameh Abdulah

    Journal ref: Spatial Statistics, 43, 100517 (2021)

  10. arXiv:1908.06936  [pdf, other

    cs.DC stat.CO

    Large-scale Environmental Data Science with ExaGeoStatR

    Authors: Sameh Abdulah, Yuxiao Li, Jian Cao, Hatem Ltaief, David E. Keyes, Marc G. Genton, Ying Sun

    Abstract: Parallel computing in Gaussian process calculations becomes necessary for avoiding computational and memory restrictions associated with large-scale environmental data science applications. The evaluation of the Gaussian log-likelihood function requires O(n^2) storage and O(n^3) operations where n is the number of geographical locations. Thus, computing the log-likelihood function with a large num… ▽ More

    Submitted 18 October, 2022; v1 submitted 23 July, 2019; originally announced August 2019.