Search | arXiv e-print repository

An Adaptive Second-order Method for a Class of Nonconvex Nonsmooth Composite Optimization

Authors: Hao Wang, Xiangyu Yang, Yichen Zhu

Abstract: This paper explores a specific type of nonconvex sparsity-promoting regularization problems, namely those involving $\ell_p$-norm regularization, in conjunction with a twice continuously differentiable loss function. We propose a novel second-order algorithm designed to effectively address this class of challenging nonconvex and nonsmooth problems, showcasing several innovative features: (i) The u… ▽ More This paper explores a specific type of nonconvex sparsity-promoting regularization problems, namely those involving $\ell_p$-norm regularization, in conjunction with a twice continuously differentiable loss function. We propose a novel second-order algorithm designed to effectively address this class of challenging nonconvex and nonsmooth problems, showcasing several innovative features: (i) The use of an alternating strategy to solve a reweighted $\ell_1$ regularized subproblem and the subspace approximate Newton step. (ii) The reweighted $\ell_1$ regularized subproblem relies on a convex approximation to the nonconvex regularization term, enabling a closed-form solution characterized by the soft-thresholding operator. This feature allows our method to be applied to various nonconvex regularization problems. (iii) Our algorithm ensures that the iterates maintain their sign values and that nonzero components are kept away from 0 for a sufficient number of iterations, eventually transitioning to a perturbed Newton method. (iv) We provide theoretical guarantees of global convergence, local superlinear convergence in the presence of the Kurdyka-Łojasiewicz (KL) property, and local quadratic convergence when employing the exact Newton step in our algorithm. We also showcase the effectiveness of our approach through experiments on a diverse set of model prediction problems. △ Less

Submitted 24 July, 2024; originally announced July 2024.

MSC Class: 90C26; 49M15; 90C53

arXiv:2407.16360 [pdf, ps, other]

Anisotropic grand Herz type spaces with variable exponents and their applications

Authors: Hongbin Wang, Zongguang Liu

Abstract: In this paper, we introduce some anisotropic grand Herz type spaces with variable exponents, including anisotropic grand Herz spaces, anisotropic grand Herz-Morrey spaces and anisotropic grand Herz-type Hardy spaces with variable exponents. We obtain some properties and characterizations of these spaces in terms of some decompositions. Using their decompositions, we obtain some boundedness on the… ▽ More In this paper, we introduce some anisotropic grand Herz type spaces with variable exponents, including anisotropic grand Herz spaces, anisotropic grand Herz-Morrey spaces and anisotropic grand Herz-type Hardy spaces with variable exponents. We obtain some properties and characterizations of these spaces in terms of some decompositions. Using their decompositions, we obtain some boundedness on the anisotropic grand Herz type spaces with variable exponents for some singular integral operators. △ Less

Submitted 23 July, 2024; originally announced July 2024.

arXiv:2407.16275 [pdf, ps, other]

A higher index on finite-volume locally symmetric spaces

Authors: Hao Guo, Peter Hochs, Hang Wang

Abstract: Let $G$ be a connected, real semisimple Lie group. Let $K<G$ be maximal compact, and let $Γ< G$ be discrete and such that $Γ\backslash G$ has finite volume. If the real rank of $G$ is $1$ and $Γ$ is torsion-free, then Barbasch and Moscovici obtained an index theorem for Dirac operators on the locally symmetric space $Γ\backslash G/K$. We obtain a higher version of this, by constructing an index of… ▽ More Let $G$ be a connected, real semisimple Lie group. Let $K<G$ be maximal compact, and let $Γ< G$ be discrete and such that $Γ\backslash G$ has finite volume. If the real rank of $G$ is $1$ and $Γ$ is torsion-free, then Barbasch and Moscovici obtained an index theorem for Dirac operators on the locally symmetric space $Γ\backslash G/K$. We obtain a higher version of this, by constructing an index of Dirac operators on $G/K$ in the $K$-theory of an algebra on which the conjugation-invariant terms in Barbasch and Moscovici's index theorem define continuous traces. The resulting index theorems also apply when $Γ$ has torsion. The cases of these index theorems for traces defined by semisimple orbital integrals extend to Song and Tang's higher orbital integrals, and yield nonzero and computable results even when $\operatorname{rank}(G)> \operatorname{rank}(K)$, or the real rank of $G$ is larger than $1$. △ Less

Submitted 23 July, 2024; originally announced July 2024.

arXiv:2407.15384 [pdf, other]

Inversion Diameter and Treewidth

Authors: Yichen Wang, Haozhe Wang, Yuxuan Yang, Mei Lu

Abstract: In an oriented graph $\overrightarrow{G}$, the inversion of a subset $X$ of vertices consists in reversing the orientation of all arcs with both end-vertices in $X$. The inversion graph of a graph $G$, denoted by $\mathcal{I}(G)$, is the graph whose vertices are orientations of $G$ in which two orientations $\overrightarrow{G_1}$ and $\overrightarrow{G_2}$ are adjacent if and only if there is an i… ▽ More In an oriented graph $\overrightarrow{G}$, the inversion of a subset $X$ of vertices consists in reversing the orientation of all arcs with both end-vertices in $X$. The inversion graph of a graph $G$, denoted by $\mathcal{I}(G)$, is the graph whose vertices are orientations of $G$ in which two orientations $\overrightarrow{G_1}$ and $\overrightarrow{G_2}$ are adjacent if and only if there is an inversion $X$ transforming $\overrightarrow{G_1}$ into $\overrightarrow{G_2}$. The inversion diameter of a graph $G$ is the diameter of its inversion graph $\mathcal{I}(G)$ denoted by $diam(\mathcal{I}(G))$. Havet, Hörsch, and Rambaud~(2024) first proved that for $G$ of treewidth $k$, $diam(\mathcal{I}(G)) \le 2k$, and there are graphs of treewidth $k$ with inversion diameter $k+2$. In this paper, we construct graphs of treewidth $k$ with inversion diameter $2k$, which implies that the previous upper bound $diam(\mathcal{I}(G)) \le 2k$ is tight. Moreover, for graphs with maximum degree $Δ$, Havet, Hörsch, and Rambaud~(2024) proved $diam(\mathcal{I}(G)) \le 2Δ-1$ and conjectured that $diam(\mathcal{I}(G)) \le Δ$. We prove the conjecture when $Δ=3$ with the help of computer calculations. △ Less

Submitted 22 July, 2024; originally announced July 2024.

arXiv:2407.14867 [pdf, ps, other]

On a level analog of Selberg's result on $S(t)$

Authors: Qingfeng Sun, Hui Wang

Abstract: Let $S(t,f)=π^{-1}\arg L(1/2+it, f)$, where $f$ is a holomorphic Hecke cusp form of weight $2$ and prime level $q$. In this paper, we establish an asymptotic formula for the moments of $S(t,f)$ without assuming the GRH. Let $S(t,f)=π^{-1}\arg L(1/2+it, f)$, where $f$ is a holomorphic Hecke cusp form of weight $2$ and prime level $q$. In this paper, we establish an asymptotic formula for the moments of $S(t,f)$ without assuming the GRH. △ Less

Submitted 20 July, 2024; originally announced July 2024.

Comments: 19 pages

arXiv:2407.13984 [pdf, ps, other]

The first Neumann eigenvalue and the width

Authors: Haibin Wang, Guoyi Xu

Abstract: We prove the sharp lower bound of the first Neumann eigenvalue for bounded convex planar domain in term of its diameter and width. We prove the sharp lower bound of the first Neumann eigenvalue for bounded convex planar domain in term of its diameter and width. △ Less

Submitted 18 July, 2024; originally announced July 2024.

arXiv:2407.13120 [pdf, other]

HPPP: Halpern-type Preconditioned Proximal Point Algorithms and Applications to Image Restoration

Authors: Shuchang Zhang, Hui Zhang, Hongxia Wang

Abstract: Preconditioned Proximal Point (PPP) algorithms provide a unified framework for splitting methods in image restoration. Recent advancements with RED (Regularization by Denoising) and PnP (Plug-and-Play) priors have achieved state-of-the-art performance in this domain, emphasizing the need for a meaningful particular solution. However, degenerate PPP algorithms typically exhibit weak convergence in… ▽ More Preconditioned Proximal Point (PPP) algorithms provide a unified framework for splitting methods in image restoration. Recent advancements with RED (Regularization by Denoising) and PnP (Plug-and-Play) priors have achieved state-of-the-art performance in this domain, emphasizing the need for a meaningful particular solution. However, degenerate PPP algorithms typically exhibit weak convergence in infinite-dimensional Hilbert space, leading to uncertain solutions. To address this issue, we propose the Halpern-type Preconditioned Proximal Point (HPPP) algorithm, which leverages the strong convergence properties of Halpern iteration to achieve a particular solution. Based on the implicit regularization defined by gradient RED, we further introduce the Gradient REgularization by Denoising via HPPP called GraRED-HP3 algorithm. The HPPP algorithm is shown to have the regularity converging to a particular solution by a toy example. Additionally, experiments in image deblurring and inpainting validate the effectiveness of GraRED-HP3, showing it surpasses classical methods such as Chambolle-Pock (CP), PPP, RED, and RED-PRO. △ Less

Submitted 21 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

arXiv:2407.12845 [pdf, other]

Time-dependent Regularized 13-Moment Equations with Onsager Boundary Conditions in the Linear Regime

Authors: Bo Lin, Haoxuan Wang, Siyao Yang, Zhenning Cai

Abstract: We develop the time-dependent regularized 13-moment equations for general elastic collision models under the linear regime. Detailed derivation shows the proposed equations have super-Burnett order for small Knudsen numbers, and the moment equations enjoy a symmetric structure. A new modification of Onsager boundary conditions is proposed to ensure stability as well as the removal of undesired bou… ▽ More We develop the time-dependent regularized 13-moment equations for general elastic collision models under the linear regime. Detailed derivation shows the proposed equations have super-Burnett order for small Knudsen numbers, and the moment equations enjoy a symmetric structure. A new modification of Onsager boundary conditions is proposed to ensure stability as well as the removal of undesired boundary layers. Numerical examples of one-dimensional channel flows is conducted to verified our model. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 30 pages, 24 figures

MSC Class: 76P05; 82C40

arXiv:2407.11465 [pdf, ps, other]

Testing by Betting while Borrowing and Bargaining

Authors: Hongjian Wang, Aaditya Ramdas

Abstract: Testing by betting has been a cornerstone of the game-theoretic statistics literature. In this framework, a betting score (or more generally an e-process), as opposed to a traditional p-value, is used to quantify the evidence against a null hypothesis: the higher the betting score, the more money one has made betting against the null, and thus the larger the evidence that the null is false. A key… ▽ More Testing by betting has been a cornerstone of the game-theoretic statistics literature. In this framework, a betting score (or more generally an e-process), as opposed to a traditional p-value, is used to quantify the evidence against a null hypothesis: the higher the betting score, the more money one has made betting against the null, and thus the larger the evidence that the null is false. A key ingredient assumed throughout past works is that one cannot bet more money than one currently has. In this paper, we ask what happens if the bettor is allowed to borrow money after going bankrupt, allowing further financial flexibility in this game of hypothesis testing. We propose various definitions of (adjusted) evidence relative to the wealth borrowed, indebted, and accumulated. We also ask what happens if the bettor can "bargain", in order to obtain odds bettor than specified by the null hypothesis. The adjustment of wealth in order to serve as evidence appeals to the characterization of arbitrage, interest rates, and numéraire-adjusted pricing in this setting. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.10205 [pdf, other]

Parallel Ising Annealer via Gradient-based Hamiltonian Monte Carlo

Authors: Hao Wang, Zixuan Liu, Zhixin Xie, Langyu Li, Zibo Miao, Wei Cui, Yu Pan

Abstract: Ising annealer is a promising quantum-inspired computing architecture for combinatorial optimization problems. In this paper, we introduce an Ising annealer based on the Hamiltonian Monte Carlo, which updates the variables of all dimensions in parallel. The main innovation is the fusion of an approximate gradient-based approach into the Ising annealer which introduces significant acceleration and… ▽ More Ising annealer is a promising quantum-inspired computing architecture for combinatorial optimization problems. In this paper, we introduce an Ising annealer based on the Hamiltonian Monte Carlo, which updates the variables of all dimensions in parallel. The main innovation is the fusion of an approximate gradient-based approach into the Ising annealer which introduces significant acceleration and allows a portable and scalable implementation on the commercial FPGA. Comprehensive simulation and hardware experiments show that the proposed Ising annealer has promising performance and scalability on all types of benchmark problems when compared to other Ising annealers including the state-of-the-art hardware. In particular, we have built a prototype annealer which solves Ising problems of both integer and fraction coefficients with up to 200 spins on a single low-cost FPGA board, whose performance is demonstrated to be better than the state-of-the-art quantum hardware D-Wave 2000Q and similar to the expensive coherent Ising machine. The sub-linear scalability of the annealer signifies its potential in solving challenging combinatorial optimization problems and evaluating the advantage of quantum hardware. △ Less

Submitted 14 July, 2024; originally announced July 2024.

arXiv:2407.09901 [pdf, other]

Stochastic generalized Kolmogorov systems with small diffusion: II. Explicit approximations for periodic solutions in distribution

Authors: Baoquan Zhou, Hao Wang, Tianxu Wang, Daqing Jiang

Abstract: This paper is Part II of a two-part series on coexistence states study in stochastic generalized Kolmogorov systems under small diffusion. Part I provided a complete characterization for approximating invariant probability measures and density functions, while here, we focus on explicit approximations for periodic solutions in distribution. Two easily implementable methods are introduced: periodic… ▽ More This paper is Part II of a two-part series on coexistence states study in stochastic generalized Kolmogorov systems under small diffusion. Part I provided a complete characterization for approximating invariant probability measures and density functions, while here, we focus on explicit approximations for periodic solutions in distribution. Two easily implementable methods are introduced: periodic normal approximation (PNOA) and periodic log-normal approximation (PLNA). These methods offer unified algorithms to calculate the mean and covariance matrix, and verify positive definiteness, without additional constraints like non-degenerate diffusion. Furthermore, we explore essential properties of the covariance matrix, particularly its connection under periodic and non-periodic drift coefficients. Our new approximation methods significantly relax the minimal criteria for positive definiteness of the solution of the discrete-type Lyapunov equation. Some numerical experiments are provided to support our theoretical results. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 39 pages, 5 figures

MSC Class: 37H05; 37H30; 45M15; 60H10

arXiv:2407.06453 [pdf, ps, other]

Dual minus partial order

Authors: Ju Gao, Hongxing Wang, Xiaoji Liu

Abstract: In this paper, we introduce the Dual-minus partial order, get some characterizations of the partial order, and prove that both the dual star partial order and the dual sharp partial order are Dual-minus-type partial orders. Based on the Dual-minus partial order, we introduce the Dual-minus sharp partial order and the Dual-minus star partial order, which are also Dual-minus-type partial orders. In… ▽ More In this paper, we introduce the Dual-minus partial order, get some characterizations of the partial order, and prove that both the dual star partial order and the dual sharp partial order are Dual-minus-type partial orders. Based on the Dual-minus partial order, we introduce the Dual-minus sharp partial order and the Dual-minus star partial order, which are also Dual-minus-type partial orders. In addition, we discuss relationships among the Dual-minus sharp partial order, the D-sharp partial order and the G-sharp partial order(the Dual-minus star partial order, the D-star partial order and the P-star partial order). △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 23 pages

MSC Class: 15A09; 15A24; 62G30

arXiv:2407.00414 [pdf, ps, other]

Safe and Stable Filter Design Using a Relaxed Compatibitlity Control Barrier -- Lyapunov Condition

Authors: Han Wang, Kostas Margellos, Antonis Papachristodoulou

Abstract: In this paper, we propose a quadratic programming-based filter for safe and stable controller design, via a Control Barrier Function (CBF) and a Control Lyapunov Function (CLF). Our method guarantees safety and local asymptotic stability without the need for an asymptotically stabilizing control law. Feasibility of the proposed program is ensured under a mild regularity condition, termed relaxed c… ▽ More In this paper, we propose a quadratic programming-based filter for safe and stable controller design, via a Control Barrier Function (CBF) and a Control Lyapunov Function (CLF). Our method guarantees safety and local asymptotic stability without the need for an asymptotically stabilizing control law. Feasibility of the proposed program is ensured under a mild regularity condition, termed relaxed compatibility between the CLF and CBF. The resulting optimal control law is guaranteed to be locally Lipschitz continuous. We also analyze the closed-loop behaviour by characterizing the equilibrium points, and verifying that there are no equilibrium points in the interior of the control invariant set except at the origin. For a polynomial system and a semi-algebraic safe set, we provide a sum-of-squares program to design a relaxed compatible pair of CLF and CBF. The proposed approach is compared with other methods in the literature using numerical examples, exhibits superior filter performance and guarantees safety and local stability. △ Less

Submitted 29 June, 2024; originally announced July 2024.

arXiv:2406.15713 [pdf, other]

Efficient Low-rank Identification via Accelerated Iteratively Reweighted Nuclear Norm Minimization

Authors: Hao Wang, Ye Wang, Xiangyu Yang

Abstract: This paper considers the problem of minimizing the sum of a smooth function and the Schatten-$p$ norm of the matrix. Our contribution involves proposing accelerated iteratively reweighted nuclear norm methods designed for solving the nonconvex low-rank minimization problem. Two major novelties characterize our approach. Firstly, the proposed method possesses a rank identification property, enablin… ▽ More This paper considers the problem of minimizing the sum of a smooth function and the Schatten-$p$ norm of the matrix. Our contribution involves proposing accelerated iteratively reweighted nuclear norm methods designed for solving the nonconvex low-rank minimization problem. Two major novelties characterize our approach. Firstly, the proposed method possesses a rank identification property, enabling the provable identification of the "correct" rank of the stationary point within a finite number of iterations. Secondly, we introduce an adaptive updating strategy for smoothing parameters. This strategy automatically fixes parameters associated with zero singular values as constants upon detecting the "correct" rank while quickly driving the rest of the parameters to zero. This adaptive behavior transforms the algorithm into one that effectively solves smooth problems after a few iterations, setting our work apart from existing iteratively reweighted methods for low-rank optimization. We prove the global convergence of the proposed algorithm, guaranteeing that every limit point of the iterates is a critical point. Furthermore, a local convergence rate analysis is provided under the Kurdyka-Łojasiewicz property. We conduct numerical experiments using both synthetic and real data to showcase our algorithm's efficiency and superiority over existing methods. △ Less

Submitted 26 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

Comments: Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2406.13166 [pdf]

Enhancing supply chain security with automated machine learning

Authors: Haibo Wang, Lutfu S. Sua, Bahram Alidaee

Abstract: This study tackles the complexities of global supply chains, which are increasingly vulnerable to disruptions caused by port congestion, material shortages, and inflation. To address these challenges, we explore the application of machine learning methods, which excel in predicting and optimizing solutions based on large datasets. Our focus is on enhancing supply chain security through fraud detec… ▽ More This study tackles the complexities of global supply chains, which are increasingly vulnerable to disruptions caused by port congestion, material shortages, and inflation. To address these challenges, we explore the application of machine learning methods, which excel in predicting and optimizing solutions based on large datasets. Our focus is on enhancing supply chain security through fraud detection, maintenance prediction, and material backorder forecasting. We introduce an automated machine learning framework that streamlines data analysis, model construction, and hyperparameter optimization for these tasks. By automating these processes, our framework improves the efficiency and effectiveness of supply chain security measures. Our research identifies key factors that influence machine learning performance, including sampling methods, categorical encoding, feature selection, and hyperparameter optimization. We demonstrate the importance of considering these factors when applying machine learning to supply chain challenges. Traditional mathematical programming models often struggle to cope with the complexity of large-scale supply chain problems. Our study shows that machine learning methods can provide a viable alternative, particularly when dealing with extensive datasets and complex patterns. The automated machine learning framework presented in this study offers a novel approach to supply chain security, contributing to the existing body of knowledge in the field. Its comprehensive automation of machine learning processes makes it a valuable contribution to the domain of supply chain management. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 22 pages

arXiv:2406.07941 [pdf, other]

Global-in-time energy stability: a powerful analysis tool for the gradient flow problem without maximum principle or Lipschitz assumption

Authors: J. Sun, H. Wang, H. Zhang, X. Qian, S. Song

Abstract: Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global… ▽ More Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global-in-time energy stability, to demonstrate energy dissipation without assuming any strong Lipschitz condition or $L^{\infty}$ boundedness. The fourth-order-in-space Swift-Hohenberg equation is used to elucidate the theoretical results in detail. We also propose a temporal second-order accurate scheme for efficiently solving such a strongly stiff equation. Furthermore, we present the corresponding optimal $L^2$ error estimate and provide several numerical simulations to demonstrate the dynamics. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07666 [pdf]

A Unified Framework for Integer Programming Formulation of Graph Matching Problems

Authors: Bahram Alidaee, Haibo Wang, Hugh Sloan

Abstract: Graph theory has been a powerful tool in solving difficult and complex problems arising in all disciplines. In particular, graph matching is a classical problem in pattern analysis with enormous applications. Many graph problems have been formulated as a mathematical program and then solved using exact, heuristic, and/or approximated-guaranteed procedures. On the other hand, graph theory has been… ▽ More Graph theory has been a powerful tool in solving difficult and complex problems arising in all disciplines. In particular, graph matching is a classical problem in pattern analysis with enormous applications. Many graph problems have been formulated as a mathematical program and then solved using exact, heuristic, and/or approximated-guaranteed procedures. On the other hand, graph theory has been a powerful tool in visualizing and understanding complex mathematical programming problems, especially integer programs. Formulating a graph problem as a natural integer program (IP) is often a challenging task. However, an IP formulation of the problem has many advantages. Several researchers have noted the need for natural IP formulation of graph theoretic problems. The present study aims to provide a unified framework for IP formulation of graph-matching problems. Although there are many surveys on graph matching problems, none is concerned with IP formulation. This paper is the first to provide a comprehensive IP formulation for such problems. The framework includes a variety of graph optimization problems in the literature. While these problems have been studied by different research communities, however, the framework presented here helps to bring efforts from different disciplines to tackle such diverse and complex problems. We hope the present study can significantly help to simplify some of the difficult problems arising in practice, especially in pattern analysis. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 34 pages

arXiv:2406.07382 [pdf]

Fast Adaptive Meta-Heuristic for Large-Scale Facility Location Problem

Authors: Bahram Alidaee, Haibo Wang

Abstract: Facility location problems have been a major research area of interest in the last several decades. In particular, uncapacitated location problems (ULP) have enormous applications. Variations of ULP often appear, especially as large-scale subproblems in more complex combinatorial optimization problems. Although many researchers have studied different versions of ULP (e.g., uncapacitated facility l… ▽ More Facility location problems have been a major research area of interest in the last several decades. In particular, uncapacitated location problems (ULP) have enormous applications. Variations of ULP often appear, especially as large-scale subproblems in more complex combinatorial optimization problems. Although many researchers have studied different versions of ULP (e.g., uncapacitated facility location problem (UCFLP) and p-Median problem), most of these authors have considered small to moderately sized problems. In this paper, we address the ULP and provide a fast adaptive meta-heuristic for large-scale problems. The approach is based on critical event memory tabu search. For the diversification component of the algorithm, we have chosen a procedure based on a sequencing problem commonly used for traveling salesman-type problems. The efficacy of this approach is evaluated across a diverse range of benchmark problems sourced from the Internet, with a comprehensive comparison against four prominent algorithms in the literature. The proposed adaptive critical event tabu search (ACETS) demonstrates remarkable effectiveness for large-scale problems. The algorithm successfully solved all problems optimally within a short computing time. Notably, ACETS discovered three best new solutions for benchmark problems, specifically for Asymmetric 500A-1, Asymmetric 750A-1, and Symmetric 750B-4, underscoring its innovative and robust nature. △ Less

Submitted 17 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

Comments: 18 pages

arXiv:2406.06884 [pdf, ps, other]

Szemerédi-Trotter bounds for tubes and applications

Authors: Ciprian Demeter, Hong Wang

Abstract: We prove sharp estimates for incidences involving planar tubes that satisfy packing conditions. We apply them to improve the estimates for the Fourier transform of fractal measures supported on planar curves. We prove sharp estimates for incidences involving planar tubes that satisfy packing conditions. We apply them to improve the estimates for the Fourier transform of fractal measures supported on planar curves. △ Less

Submitted 21 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

Comments: Updated bibliography

arXiv:2406.06876 [pdf, other]

Boundedness for maximal operators over hypersurfaces in $\mathbb{R}^3$

Authors: Wenjuan Li, Huiju Wang

Abstract: In this article, we study maximal functions related to hypersurfaces with vanishing Gaussian curvature in $\mathbb{R}^3$. Firstly, we characterize the $L^p\rightarrow L^q$ boundedness of local maximal operators along homogeneous hypersurfaces. Moreover, weighted $L^p$-estimates are obtained for the corresponding global operators. Secondly, for a class of hypersurfaces that lack a homogeneous struc… ▽ More In this article, we study maximal functions related to hypersurfaces with vanishing Gaussian curvature in $\mathbb{R}^3$. Firstly, we characterize the $L^p\rightarrow L^q$ boundedness of local maximal operators along homogeneous hypersurfaces. Moreover, weighted $L^p$-estimates are obtained for the corresponding global operators. Secondly, for a class of hypersurfaces that lack a homogeneous structure and pass through the origin, we attempt to look for other geometric properties instead of height of hypersurfaces to characterize the optimal $L^p$-boundedness of the corresponding global maximal operators. △ Less

Submitted 10 June, 2024; originally announced June 2024.

MSC Class: 42B20; 42B25

arXiv:2406.02941 [pdf, ps, other]

Numerical approximation for variable-exponent fractional diffusion-wave equation

Authors: Xiangcheng Zheng, Hong Wang, Wenlin Qiu

Abstract: This work considers the variable-exponent fractional diffusion-wave equation, which describes, e.g. the propagation of mechanical diffusive waves in viscoelastic media with varying material properties. Rigorous mathematical and numerical analysis for this model is not available in the literature, partly because the variable-exponent Abel kernel may not be positive definite or monotonic. We overcom… ▽ More This work considers the variable-exponent fractional diffusion-wave equation, which describes, e.g. the propagation of mechanical diffusive waves in viscoelastic media with varying material properties. Rigorous mathematical and numerical analysis for this model is not available in the literature, partly because the variable-exponent Abel kernel may not be positive definite or monotonic. We overcome these difficulties to design two numerical schemes and derive their stability and error estimate based on the proved solution regularity, with $α(0)$-order and second-order accuracy in time, respectively. Numerical experiments are presented to substantiate the theoretical findings. △ Less

Submitted 2 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

MSC Class: 35R11; 65M12; 65M60

arXiv:2406.01986 [pdf, ps, other]

Congruence properties of the coefficients of the classical modular polynomials

Authors: Haiyang Wang

Abstract: The classical modular polynomials $Φ_\ell(X,Y)$ give plane curve models for the modular curves $X_0(\ell)/\mathbb{Q}$ and have been extensively studied. In this article, we provide closed formulas for $\ell$ nontrivial coefficients of the classical modular polynomials $Φ_\ell(X,Y)$ in terms of the Fourier coefficients of the modular invariant function $j(z)$ for a prime $\ell$. Our interest in the… ▽ More The classical modular polynomials $Φ_\ell(X,Y)$ give plane curve models for the modular curves $X_0(\ell)/\mathbb{Q}$ and have been extensively studied. In this article, we provide closed formulas for $\ell$ nontrivial coefficients of the classical modular polynomials $Φ_\ell(X,Y)$ in terms of the Fourier coefficients of the modular invariant function $j(z)$ for a prime $\ell$. Our interest in the formulas were motivated by our conjectures on congruences modulo powers of the primes $2,3$ and $5$ satisfied by the coefficients of these polynomials. We deduce congruences from these formulas supporting the conjectures. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.01985 [pdf, ps, other]

On the Kodaira types of elliptic curves with potentially good supersingular reduction

Authors: Haiyang Wang

Abstract: Let $\mathcal{O}_K$ be a Henselian discrete valuation domain with field of fractions $K$. Assume that $\mathcal{O}_K$ has algebraically closed residue field $k$. Let $E/K$ be an elliptic curve with additive reduction. The semi-stable reduction theorem asserts that there exists a minimal extension $L/K$ such that the base change $E_L/L$ has semi-stable reduction. It is natural to wonder whether s… ▽ More Let $\mathcal{O}_K$ be a Henselian discrete valuation domain with field of fractions $K$. Assume that $\mathcal{O}_K$ has algebraically closed residue field $k$. Let $E/K$ be an elliptic curve with additive reduction. The semi-stable reduction theorem asserts that there exists a minimal extension $L/K$ such that the base change $E_L/L$ has semi-stable reduction. It is natural to wonder whether specific properties of the semi-stable reduction and of the extension $L/K$ impose restrictions on what types of Kodaira type the special fiber of $E/K$ may have. In this paper we study the restrictions imposed on the reduction type when the extension $L/K$ is wildly ramified of degree $2$, and the curve $E/K$ has potentially good supersingular reduction. We also analyze the possible reduction types of two isogenous elliptic curves with these properties. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.00701 [pdf, other]

Profiled Transfer Learning for High Dimensional Linear Model

Authors: Ziqian Lin, Junlong Zhao, Fang Wang, Hansheng Wang

Abstract: We develop here a novel transfer learning methodology called Profiled Transfer Learning (PTL). The method is based on the \textit{approximate-linear} assumption between the source and target parameters. Compared with the commonly assumed \textit{vanishing-difference} assumption and \textit{low-rank} assumption in the literature, the \textit{approximate-linear} assumption is more flexible and less… ▽ More We develop here a novel transfer learning methodology called Profiled Transfer Learning (PTL). The method is based on the \textit{approximate-linear} assumption between the source and target parameters. Compared with the commonly assumed \textit{vanishing-difference} assumption and \textit{low-rank} assumption in the literature, the \textit{approximate-linear} assumption is more flexible and less stringent. Specifically, the PTL estimator is constructed by two major steps. Firstly, we regress the response on the transferred feature, leading to the profiled responses. Subsequently, we learn the regression relationship between profiled responses and the covariates on the target data. The final estimator is then assembled based on the \textit{approximate-linear} relationship. To theoretically support the PTL estimator, we derive the non-asymptotic upper bound and minimax lower bound. We find that the PTL estimator is minimax optimal under appropriate regularity conditions. Extensive simulation studies are presented to demonstrate the finite sample performance of the new method. A real data example about sentence prediction is also presented with very encouraging results. △ Less

Submitted 5 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.19499 [pdf, other]

Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments

Authors: Han Wang, Sihong He, Zhili Zhang, Fei Miao, James Anderson

Abstract: We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maxim… ▽ More We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maximizes the average performance across all potentially completely different environments, we propose two algorithms: FedSVRPG-M and FedHAPG-M. In contrast to existing results, we demonstrate that both FedSVRPG-M and FedHAPG-M, both of which leverage momentum mechanisms, can exactly converge to a stationary point of the average performance function, regardless of the magnitude of environment heterogeneity. Furthermore, by incorporating the benefits of variance-reduction techniques or Hessian approximation, both algorithms achieve state-of-the-art convergence results, characterized by a sample complexity of $\mathcal{O}\left(ε^{-\frac{3}{2}}/N\right)$. Notably, our algorithms enjoy linear convergence speedups with respect to the number of agents, highlighting the benefit of collaboration among agents in finding a common policy. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Journal ref: Proceedings of the 41st International Conference on Machine Learning, 2024 Learning

arXiv:2405.17527 [pdf, other]

Unisolver: PDE-Conditional Transformers Are Universal PDE Solvers

Authors: Hang Zhou, Yuezhou Ma, Haixu Wu, Haowen Wang, Mingsheng Long

Abstract: Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizabil… ▽ More Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizability of neural solvers, which is widely recognized as its major advantage over numerical solvers. In this paper, we present the Universal PDE solver (Unisolver) capable of solving a wide scope of PDEs by leveraging a Transformer pre-trained on diverse data and conditioned on diverse PDEs. Instead of simply scaling up data and parameters, Unisolver stems from the theoretical analysis of the PDE-solving process. Our key finding is that a PDE solution is fundamentally under the control of a series of PDE components, e.g. equation symbols, coefficients, and initial and boundary conditions. Inspired by the mathematical structure of PDEs, we define a complete set of PDE components and correspondingly embed them as domain-wise (e.g. equation symbols) and point-wise (e.g. boundaries) conditions for Transformer PDE solvers. Integrating physical insights with recent Transformer advances, Unisolver achieves consistent state-of-the-art results on three challenging large-scale benchmarks, showing impressive gains and endowing favorable generalizability and scalability. △ Less

Submitted 1 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.14710 [pdf, ps, other]

Sums of four polygonal numbers: precise formulas

Authors: Jialin Li, Haowu Wang

Abstract: In this paper we give unified formulas for the numbers of representations of positive integers as sums of four generalized $m$-gonal numbers, and as restricted sums of four squares under a linear condition, respectively. These formulas are given as $\mathbb{Z}$-linear combinations of Hurwitz class numbers. As applications, we prove several Zhi-Wei Sun's conjectures. As by-products, we obtain formu… ▽ More In this paper we give unified formulas for the numbers of representations of positive integers as sums of four generalized $m$-gonal numbers, and as restricted sums of four squares under a linear condition, respectively. These formulas are given as $\mathbb{Z}$-linear combinations of Hurwitz class numbers. As applications, we prove several Zhi-Wei Sun's conjectures. As by-products, we obtain formulas for expressing the Fourier coefficients of $\vartheta(τ,z)^4$, $η(τ)^{12}$, $η(τ)^4$ and $η(τ)^8η(2τ)^8$ in terms of the Hurwitz class numbers, respectively. The proof is based on the theory of Jacobi forms. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 12 pages, comments welcome!

MSC Class: 11E25; 11F50; 11F30; 11D85

arXiv:2405.08599 [pdf, other]

The distributed biased min-consensus protocol revisited: pre-specified finite time control strategies and small-gain based analysis

Authors: Yuanqiu Mo, He Wang

Abstract: Unlike the classical distributed consensus protocols enabling the group of agents as a whole to reach an agreement regarding a certain quantity of interest in a distributed fashion, the distributed biased min-consensus protocol (DBMC) has been proven to generate advanced complexity pertaining to solving the shortest path problem. As such a protocol is commonly incorporated as the first step of a h… ▽ More Unlike the classical distributed consensus protocols enabling the group of agents as a whole to reach an agreement regarding a certain quantity of interest in a distributed fashion, the distributed biased min-consensus protocol (DBMC) has been proven to generate advanced complexity pertaining to solving the shortest path problem. As such a protocol is commonly incorporated as the first step of a hierarchical architecture in real applications, e.g., robots path planning, management of dispersed computing services, an impedance limiting the application potential of DBMC lies in, the lack of results regarding to its convergence within a user-assigned time. In this paper, we first propose two control strategies ensuring the state error of DBMC decrease exactly to zero or a desired level manipulated by the user, respectively. To compensate the high feedback gains incurred by these two control strategies, this paper further investigates the nominal DBMC itself. By leveraging small gain based stability tools, this paper also proves the global exponential input-to-state stability of DBMC, outperforming its current stability results. Simulations have been provided to validate the efficacy of our theoretical result. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.05887 [pdf, ps, other]

Convergence Rates of Online Critic Value Function Approximation in Native Spaces

Authors: Shengyuan Niu, Ali Bouland, Haoran Wang, Filippos Fotiadis, Andrew Kurdila, Andrea L'Afflitto, Sai Tej Paruchuri, Kyriakos G. Vamvoudakis

Abstract: In this paper, the evolution equation that defines the online critic for the approximation of the optimal value function is cast in a general class of reproducing kernel Hilbert spaces (RKHSs). Exploiting some core tools of RKHS theory, this formulation allows deriving explicit bounds on the performance of the critic in terms of the kernel and definition of the RKHS, the number of basis functions,… ▽ More In this paper, the evolution equation that defines the online critic for the approximation of the optimal value function is cast in a general class of reproducing kernel Hilbert spaces (RKHSs). Exploiting some core tools of RKHS theory, this formulation allows deriving explicit bounds on the performance of the critic in terms of the kernel and definition of the RKHS, the number of basis functions, and the location of centers used to define scattered bases. The performance of the critic is precisely measured in terms of the power function of the scattered basis used in approximations, and it can be used either in an a priori evaluation of potential bases or in an a posteriori assessments of value function error for basis enrichment or pruning. The most concise bounds in the paper describe explicitly how the critic performance depends on the placement of centers, as measured by their fill distance in a subset that contains the trajectory of the critic. △ Less

Submitted 28 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03837 [pdf, ps, other]

Higher Kazhdan projections and delocalised $\ell^ 2$-Betti numbers

Authors: Sanaz Pooya, Hang Wang

Abstract: We provide an explicit description of the K-classes of higher Kazhdan projections in degrees greater than 0 for specific free product groups and Cartesian product groups. Employing this description, we obtain new calculations of Lott's delocalised $\ell^2$-Betti numbers. Notably, we establish the first non-vanishing results for infinite groups. We provide an explicit description of the K-classes of higher Kazhdan projections in degrees greater than 0 for specific free product groups and Cartesian product groups. Employing this description, we obtain new calculations of Lott's delocalised $\ell^2$-Betti numbers. Notably, we establish the first non-vanishing results for infinite groups. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 19 pages

MSC Class: 46L80; 19D55; 20F65

arXiv:2405.02100 [pdf, other]

Data-Driven Stable Neural Feedback Loop Design

Authors: Zuxun Xiong, Han Wang, Liqun Zhao, Antonis Papachristodoulou

Abstract: This paper proposes a data-driven approach to design a feedforward Neural Network (NN) controller with a stability guarantee for systems with unknown dynamics. We first introduce data-driven representations of stability conditions for Neural Feedback Loops (NFLs) with linear plants. These conditions are then formulated into a semidefinite program (SDP). Subsequently, this SDP constraint is integra… ▽ More This paper proposes a data-driven approach to design a feedforward Neural Network (NN) controller with a stability guarantee for systems with unknown dynamics. We first introduce data-driven representations of stability conditions for Neural Feedback Loops (NFLs) with linear plants. These conditions are then formulated into a semidefinite program (SDP). Subsequently, this SDP constraint is integrated into the NN training process resulting in a stable NN controller. We propose an iterative algorithm to solve this problem efficiently. Finally, we illustrate the effectiveness of the proposed method and its superiority compared to model-based methods via numerical examples. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2405.01194 [pdf, ps, other]

The Lp Polar bodies of shadow system and related inequalities

Authors: Lujun Guo, Hanxiao Wang

Abstract: The $L_p$ versions of the support function and polar body are introduced by Berndtsson, Mastrantonis and Rubinstein in \cite{Berndtsson-Mastrantonis-Rubinstein-2023} recently. In this paper, we prove that the $L_p$-support function of the shadow system $K_t$ introduced by Rogers and Shephard in \cite{rogers-1958-02,shephard-1964} is convex and the volume of the section of $L_p$ polar bodies of… ▽ More The $L_p$ versions of the support function and polar body are introduced by Berndtsson, Mastrantonis and Rubinstein in \cite{Berndtsson-Mastrantonis-Rubinstein-2023} recently. In this paper, we prove that the $L_p$-support function of the shadow system $K_t$ introduced by Rogers and Shephard in \cite{rogers-1958-02,shephard-1964} is convex and the volume of the section of $L_p$ polar bodies of $K_t$ is $\frac{1}{n}$-concave with respect to parameter $t$, and obtain some related inequalities. Finally, we present the reverse Rogers-Shephard type inequality for $L_p$-polar bodies. △ Less

Submitted 2 May, 2024; originally announced May 2024.

MSC Class: 52A40; 52A20; 46G12

arXiv:2404.18838 [pdf, other]

Accurate adaptive deep learning method for solving elliptic problems

Authors: Jingyong Ying, Yaqi Xie, Jiao Li, Hongqiao Wang

Abstract: Deep learning method is of great importance in solving partial differential equations. In this paper, inspired by the failure-informed idea proposed by Gao et.al. (SIAM Journal on Scientific Computing 45(4)(2023)) and as an improvement, a new accurate adaptive deep learning method is proposed for solving elliptic problems, including the interface problems and the convection-dominated problems. Bas… ▽ More Deep learning method is of great importance in solving partial differential equations. In this paper, inspired by the failure-informed idea proposed by Gao et.al. (SIAM Journal on Scientific Computing 45(4)(2023)) and as an improvement, a new accurate adaptive deep learning method is proposed for solving elliptic problems, including the interface problems and the convection-dominated problems. Based on the failure probability framework, the piece-wise uniform distribution is used to approximate the optimal proposal distribution and an kernel-based method is proposed for efficient sampling. Together with the improved Levenberg-Marquardt optimization method, the proposed adaptive deep learning method shows great potential in improving solution accuracy. Numerical tests on the elliptic problems without interface conditions, on the elliptic interface problem, and on the convection-dominated problems demonstrate the effectiveness of the proposed method, as it reduces the relative errors by a factor varying from $10^2$ to $10^4$ for different cases. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.14937 [pdf, ps, other]

The inversion number of dijoins and blow-up digraphs

Authors: Haozhe Wang, Yuxuan Yang, Mei Lu

Abstract: For an oriented graph $D$, the $inversion$ of $X \subseteq V(D)$ in $D$ is the digraph obtained from $D$ by reversing the direction of all arcs with both ends in $X$. The inversion number of $D$, denoted by $inv(D)$, is the minimum number of inversions needed to transform $D$ into an acyclic digraph. In this paper, we first show that $inv (\overrightarrow{C_3} \Rightarrow D)= inv(D) +1$ for any or… ▽ More For an oriented graph $D$, the $inversion$ of $X \subseteq V(D)$ in $D$ is the digraph obtained from $D$ by reversing the direction of all arcs with both ends in $X$. The inversion number of $D$, denoted by $inv(D)$, is the minimum number of inversions needed to transform $D$ into an acyclic digraph. In this paper, we first show that $inv (\overrightarrow{C_3} \Rightarrow D)= inv(D) +1$ for any oriented graph $\textit{D}$ with even inversion number $inv(D)$, where the dijoin $\overrightarrow{C_3} \Rightarrow D$ is the oriented graph obtained from the disjoint union of $\overrightarrow{C_3}$ and $D$ by adding all arcs from $\overrightarrow{C_3}$ to $D$. Thus we disprove the conjecture of Aubian el at. \cite{2212.09188} and the conjecture of Alon el at. \cite{2212.11969}. We also study the blow-up graph which is an oriented graph obtained from a tournament by replacing all vertices into oriented graphs. We construct a tournament $T$ with order $n$ and $inv(T)=\frac{n}{3}+1$ using blow-up graphs. △ Less

Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.13622 [pdf, ps, other]

On the CR Nirenberg problem: density and multiplicity of solutions

Authors: Zhongwei Tang, Heming Wang, Bingwei Zhang

Abstract: We prove some results on the density and multiplicity of positive solutions to the prescribed Webster scalar curvature problem on the $(2n+1)$-dimensional standard unit CR sphere $(\mathbb{S} ^{2n+1},θ_0)$. Specifically, we construct arbitrarily many multi-bump solutions via the variational gluing method. In particular, we show the Webster scalar curvature functions of contact forms conformal to… ▽ More We prove some results on the density and multiplicity of positive solutions to the prescribed Webster scalar curvature problem on the $(2n+1)$-dimensional standard unit CR sphere $(\mathbb{S} ^{2n+1},θ_0)$. Specifically, we construct arbitrarily many multi-bump solutions via the variational gluing method. In particular, we show the Webster scalar curvature functions of contact forms conformal to $θ_0$ are $C^{0}$-dense among bounded functions which are positive somewhere. Existence results of infinitely many positive solutions to the related equation $-Δ_{\mathbb{H}} u=R(ξ) u^{(n+2) /n}$ on the Heisenberg group $\Hn $ with $R(ξ)$ being asymptotically periodic with respect to left translation are also obtained. Our proofs make use of a refined analysis of bubbling behavior, gradient flow, Pohozaev identity, as well as blow up arguments. △ Less

Submitted 21 April, 2024; originally announced April 2024.

arXiv:2404.12234 [pdf, other]

Quantitative homogenization and hydrodynamic limit of non-gradient exclusion process

Authors: Tadahisa Funaki, Chenlin Gu, Han Wang

Abstract: For the non-gradient exclusion process, we prove the quantitative homogenization in the approximation of the diffusion matrix and the conductivity by local functions. The proof relies on the renormalization approach developed by Armstrong, Kuusi, Mourrat, and Smart, while the new challenge here is the hard core constraint of particle number on every site. Therefore, a coarse-grained method is prop… ▽ More For the non-gradient exclusion process, we prove the quantitative homogenization in the approximation of the diffusion matrix and the conductivity by local functions. The proof relies on the renormalization approach developed by Armstrong, Kuusi, Mourrat, and Smart, while the new challenge here is the hard core constraint of particle number on every site. Therefore, a coarse-grained method is proposed to lift the configuration to a larger space without exclusion, and a gradient coupling between two systems is applied to capture the spatial cancellation. We then strengthen the convergence rate to be uniform concerning the density and integrate it into the work by Funaki, Uchiyama, and Yau [IMA Vol. Math. Appl., 77 (1996), pp. 1-40.] to yield a quantitative hydrodynamic limit. Our new approach avoids showing the characterization of closed forms and provides stronger results. The extension is discussed for the model in the presence of disorder on the bonds. △ Less

Submitted 23 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: 83 pages, 3 figures

MSC Class: 82C22; 35B27; 60K35

arXiv:2404.10951 [pdf, ps, other]

$L^p$ weighted Fourier restriction estimates

Authors: Xiumin Du, Jianhui Li, Hong Wang, Ruixiang Zhang

Abstract: We obtain some sharp $L^p$ weighted Fourier restriction estimates of the form $\|Ef\|_{L^p(B^{n+1}(0,R),Hdx)} \lessapprox R^β\|f\|_2$, where $E$ is the Fourier extension operator over the truncated paraboloid, and $H$ is a weight function on $\mathbb R^{n+1}$ which is $n$-dimensional up to scale $\sqrt R$. We obtain some sharp $L^p$ weighted Fourier restriction estimates of the form $\|Ef\|_{L^p(B^{n+1}(0,R),Hdx)} \lessapprox R^β\|f\|_2$, where $E$ is the Fourier extension operator over the truncated paraboloid, and $H$ is a weight function on $\mathbb R^{n+1}$ which is $n$-dimensional up to scale $\sqrt R$. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 15 pages, 2 figures

arXiv:2404.09061 [pdf, other]

Asynchronous Heterogeneous Linear Quadratic Regulator Design

Authors: Leonardo F. Toso, Han Wang, James Anderson

Abstract: We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple dataset… ▽ More We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple datasets can be leveraged simultaneously to produce more accurate policy gradient estimates. However, the interplay of heterogeneous trajectory data and varying levels of local computational power introduce bias to the aggregated PG descent direction, and prevents us from fully exploiting the parallelism in the distributed computation. The latter stems from synchronous aggregation, where straggler systems negatively impact the runtime. To address this, we propose an asynchronous policy gradient algorithm for LQR control design. By carefully controlling the "staleness" in the asynchronous aggregation, we show that the designed controller converges to each system's $ε$-near optimal controller up to a heterogeneity bias. Furthermore, we prove that our asynchronous approach obtains exact local convergence at a sub-linear rate. △ Less

Submitted 13 April, 2024; originally announced April 2024.

Comments: Leonardo F. Toso and Han Wang contributed equally to this work

arXiv:2404.08893 [pdf, other]

Early detection of disease outbreaks and non-outbreaks using incidence data

Authors: Shan Gao, Amit K. Chakraborty, Russell Greiner, Mark A. Lewis, Hao Wang

Abstract: Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a… ▽ More Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a Susceptible-Infected-Recovered model for slowly changing, noisy disease dynamics. Outbreak sequences give a transcritical bifurcation within a specified future time window, whereas non-outbreak (null bifurcation) sequences do not. We identified incipient differences in time series of infectives leading to future outbreaks and non-outbreaks. These differences are reflected in 22 statistical features and 5 early warning signal indicators. Classifier performance, given by the area under the receiver-operating curve, ranged from 0.99 for large expanding windows of training data to 0.7 for small rolling windows. Real-world performances of classifiers were tested on two empirical datasets, COVID-19 data from Singapore and SARS data from Hong Kong, with two classifiers exhibiting high accuracy. In summary, we showed that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur. We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2404.03487 [pdf, other]

Explicit Witt basis over the tensor product of Clifford algebras and octonions

Authors: Yong Li, Guangbin Ren, Haiyan Wang

Abstract: In this article, we investigate how the Witt basis serves as a link between real and complex variables in higher-dimensional spaces. Our focus is on the detailed construction of the Witt basis within the tensor product space combining Clifford algebra and multiple octonionic spaces. This construction effectively introduces complex coordinates. The technique is based on a specific subgroup of octon… ▽ More In this article, we investigate how the Witt basis serves as a link between real and complex variables in higher-dimensional spaces. Our focus is on the detailed construction of the Witt basis within the tensor product space combining Clifford algebra and multiple octonionic spaces. This construction effectively introduces complex coordinates. The technique is based on a specific subgroup of octonionic automorphisms, distinguished by binary codes. This method allows us to perform a Hermitian analysis of the complex structures within the tensor product space. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 13pages

MSC Class: Primary 30G35; Secondary 17A35

arXiv:2404.01245 [pdf, other]

A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules

Authors: Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su

Abstract: Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical effi… ▽ More Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical efficiency of watermarks and designing powerful detection rules. Inspired by the hypothesis testing formulation of watermark detection, our framework starts by selecting a pivotal statistic of the text and a secret key -- provided by the LLM to the verifier -- to enable controlling the false positive rate (the error of mistakenly detecting human-written text as LLM-generated). Next, this framework allows one to evaluate the power of watermark detection rules by obtaining a closed-form expression of the asymptotic false negative rate (the error of incorrectly classifying LLM-generated text as human-written). Our framework further reduces the problem of determining the optimal detection rule to solving a minimax optimization program. We apply this framework to two representative watermarks -- one of which has been internally implemented at OpenAI -- and obtain several findings that can be instrumental in guiding the practice of implementing watermarks. In particular, we derive optimal detection rules for these watermarks under our framework. These theoretically derived detection rules are demonstrated to be competitive and sometimes enjoy a higher power than existing detection approaches through numerical experiments. △ Less

Submitted 1 April, 2024; originally announced April 2024.

arXiv:2403.19328 [pdf, ps, other]

Complex generalized Gauss-Radau quadrature rules for Hankel transforms of integer order

Authors: Haiyong Wang, Menghan Wu

Abstract: Complex Gaussian quadrature rules for oscillatory integral transforms have the advantage that they can achieve optimal asymptotic order. However, their existence for Hankel transform can only be guaranteed when the order of the transform belongs to $[0,1/2]$. In this paper we consider the construction of generalized Gauss-Radau quadrature rules for Hankel transform. We show that, if adding certain… ▽ More Complex Gaussian quadrature rules for oscillatory integral transforms have the advantage that they can achieve optimal asymptotic order. However, their existence for Hankel transform can only be guaranteed when the order of the transform belongs to $[0,1/2]$. In this paper we consider the construction of generalized Gauss-Radau quadrature rules for Hankel transform. We show that, if adding certain value and derivative information at the left endpoint, then complex generalized Gauss-Radau quadrature rules for Hankel transform of integer order can be constructed with theoretical guarantees. Orthogonal polynomials that are closely related to such quadrature rules are investigated and their existence for even degrees is proved. Numerical experiments are presented to confirm our findings. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 24 pages

MSC Class: 65R10; 65D32

arXiv:2403.18320 [pdf, other]

Online Prediction for Streaming Tensor Time Series

Authors: Zhenting Luan, Haoning Wang, Liping Zhang, Shansuo Liang, Wei Han

Abstract: Real-time prediction plays a vital role in various control systems, such as traffic congestion control and wireless channel resource allocation. In these scenarios, the predictor usually needs to track the evolution of the latent statistical patterns in the modern high-dimensional streaming time series continuously and quickly, which presents new challenges for traditional prediction methods. This… ▽ More Real-time prediction plays a vital role in various control systems, such as traffic congestion control and wireless channel resource allocation. In these scenarios, the predictor usually needs to track the evolution of the latent statistical patterns in the modern high-dimensional streaming time series continuously and quickly, which presents new challenges for traditional prediction methods. This paper proposes a novel algorithm based on tensor factorization to predict streaming tensor time series online. The proposed algorithm updates the predictor in a low-complexity online manner to adapt to the time-evolving data. Additionally, an automatically adaptive version of the algorithm is presented to mitigate the negative impact of stale data. Simulation results demonstrate that our proposed methods achieve prediction accuracy similar to that of conventional offline tensor prediction methods, while being much faster than them during long-term online prediction. Therefore, our proposed algorithm provides an effective and efficient solution for the online prediction of streaming tensor time series. △ Less

Submitted 27 March, 2024; originally announced March 2024.

arXiv:2403.14924 [pdf, ps, other]

Anderson acceleration of derivative-free projection methods for constrained monotone nonlinear equations

Authors: Jiachen Jin, Hongxia Wang, Kangkang Deng

Abstract: The derivative-free projection method (DFPM) is an efficient algorithm for solving monotone nonlinear equations. As problems grow larger, there is a strong demand for speeding up the convergence of DFPM. This paper considers the application of Anderson acceleration (AA) to DFPM for constrained monotone nonlinear equations. By employing a nonstationary relaxation parameter and interleaving with sli… ▽ More The derivative-free projection method (DFPM) is an efficient algorithm for solving monotone nonlinear equations. As problems grow larger, there is a strong demand for speeding up the convergence of DFPM. This paper considers the application of Anderson acceleration (AA) to DFPM for constrained monotone nonlinear equations. By employing a nonstationary relaxation parameter and interleaving with slight modifications in each iteration, a globally convergent variant of AA for DFPM named as AA-DFPM is proposed. Further, the linear convergence rate is proved under some mild assumptions. Experiments on both mathematical examples and a real-world application show encouraging results of AA-DFPM and confirm the suitability of AA for accelerating DFPM in solving optimization problems. △ Less

Submitted 21 March, 2024; originally announced March 2024.

arXiv:2403.14311 [pdf, ps, other]

On Intermediate Exceptional Series

Authors: Kimyeong Lee, Kaiwen Sun, Haowu Wang

Abstract: The Freudenthal--Tits magic square $\mathfrak{m}(\mathbb{A}_1,\mathbb{A}_2)$ for $\mathbb{A}=\mathbb{R},\mathbb{C},\mathbb{H},\mathbb{O}$ of semi-simple Lie algebras can be extended by including the sextonions $\mathbb{S}$. A series of non-reductive Lie algebras naturally appear in the new row associated with the sextonions, which we will call the \textit{intermediate exceptional series}, with the… ▽ More The Freudenthal--Tits magic square $\mathfrak{m}(\mathbb{A}_1,\mathbb{A}_2)$ for $\mathbb{A}=\mathbb{R},\mathbb{C},\mathbb{H},\mathbb{O}$ of semi-simple Lie algebras can be extended by including the sextonions $\mathbb{S}$. A series of non-reductive Lie algebras naturally appear in the new row associated with the sextonions, which we will call the \textit{intermediate exceptional series}, with the largest one as the intermediate Lie algebra $E_{7+1/2}$ constructed by Landsberg--Manivel. We study various aspects of the intermediate vertex operator (super)algebras associated with the intermediate exceptional series, including rationality, coset constructions, irreducible modules, (super)characters and modular linear differential equations. For all $\mathfrak{g}_I$ belonging to the intermediate exceptional series, the intermediate VOA $L_1(\mathfrak{g}_I)$ has characters of irreducible modules coinciding with those of the simple rational $C_2$-cofinite $W$-algebra $W_{-h^\vee/6}(\mathfrak{g},f_θ)$ studied by Kawasetsu, with $\mathfrak{g} $ belonging to the Cvitanović--Deligne exceptional series. We propose some new intermediate VOA $L_k(\mathfrak{g}_I)$ with integer level $k$ and investigate their properties. For example, for the intermediate Lie algebra $D_{6+1/2}$ between $D_6$ and $E_7$ in the subexceptional series and also in Vogel's projective plane, we find that the intermediate VOA $L_2(D_{6+1/2})$ has a simple current extension to a SVOA with four irreducible Neveu--Schwarz modules. We also provide some (super) coset constructions such as $L_2(E_7)/L_2(D_{6+1/2})$ and $L_1(D_{6+1/2})^{\otimes2}\!/L_2(D_{6+1/2})$. In the end, we find that the theta blocks associated with the intermediate exceptional series produce some new holomorphic Jacobi forms of critical weight and lattice index. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 46 pages

Report number: KIAS-P24006, UUITP-06/24

arXiv:2403.14079 [pdf, ps, other]

Addressing complex boundary conditions of miscible flow and transport in two and three dimensions with application to optimal control

Authors: Yiqun Li, Hong Wang, Xiangcheng Zheng

Abstract: We investigate complex boundary conditions of the miscible displacement system in two and three space dimensions with the commonly-used Bear-Scheidegger diffusion-dispersion tensor, which describes, e.g., the porous medium flow processes in petroleum reservoir simulation or groundwater contaminant transport. Specifically, we incorporate the no-flux boundary condition for the Darcy velocity to prov… ▽ More We investigate complex boundary conditions of the miscible displacement system in two and three space dimensions with the commonly-used Bear-Scheidegger diffusion-dispersion tensor, which describes, e.g., the porous medium flow processes in petroleum reservoir simulation or groundwater contaminant transport. Specifically, we incorporate the no-flux boundary condition for the Darcy velocity to prove that the general no-flux boundary condition for the transport equation is equivalent to the normal derivative boundary condition of the concentration, based on which we further prove several complex boundary conditions by the Bear-Scheidegger tensor and its derivative. The derived boundary conditions not only provide new insights and distinct properties of the Bear-Scheidegger diffusion-dispersion tensor, but accommodate the coupling and the nonlinearity of the miscible displacement system and the Bear-Scheidegger tensor in deriving the first-order optimality condition of the corresponding optimal control problem for practical application. △ Less

Submitted 20 March, 2024; originally announced March 2024.

MSC Class: 35K20; 49J20; 49K20; 76S05

arXiv:2403.13255 [pdf, other]

Network-Aware Value Stacking of Community Battery via Asynchronous Distributed Optimization

Authors: Canchen Jiang, Hao Wang

Abstract: Community battery systems have been widely deployed to provide services to the grid. Unlike a single battery storage system in the community, coordinating multiple community batteries can further unlock their value, enhancing the viability of community battery solutions. However, the centralized control of community batteries relies on the full information of the system, which is less practical an… ▽ More Community battery systems have been widely deployed to provide services to the grid. Unlike a single battery storage system in the community, coordinating multiple community batteries can further unlock their value, enhancing the viability of community battery solutions. However, the centralized control of community batteries relies on the full information of the system, which is less practical and may even lead to privacy leakage. In this paper, we formulate a value-stacking optimization problem for community batteries to interact with local solar, buildings, and the grid, within distribution network constraints. We then propose a distributed algorithm using asynchronous distributed alternate direction method of multipliers (ADMM) to solve the problem. Our algorithm is robust to communication latency between community batteries and the grid while preserving the operational privacy. The simulation results demonstrate the convergence of our proposed asynchronous distributed ADMM algorithm. We also evaluate the electricity cost and the contribution of each value stream in the value-stacking problem for community batteries using real-world data. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 2024 IEEE Power & Energy Society General Meeting (PESGM)

arXiv:2403.13236 [pdf, other]

Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network

Authors: Jiarong Fan, Ariel Liebman, Hao Wang

Abstract: The increasing integration of electric vehicles (EVs) into the grid can pose a significant risk to the distribution system operation in the absence of coordination. In response to the need for effective coordination of EVs within the distribution network, this paper presents a safety-aware reinforcement learning (RL) algorithm designed to manage EV charging stations while ensuring the satisfaction… ▽ More The increasing integration of electric vehicles (EVs) into the grid can pose a significant risk to the distribution system operation in the absence of coordination. In response to the need for effective coordination of EVs within the distribution network, this paper presents a safety-aware reinforcement learning (RL) algorithm designed to manage EV charging stations while ensuring the satisfaction of system constraints. Unlike existing methods, our proposed algorithm does not rely on explicit penalties for constraint violations, eliminating the need for penalty coefficient tuning. Furthermore, managing EV charging stations is further complicated by multiple uncertainties, notably the variability in solar energy generation and energy prices. To address this challenge, we develop an off-policy RL algorithm to efficiently utilize data to learn patterns in such uncertain environments. Our algorithm also incorporates a maximum entropy framework to enhance the RL algorithm's exploratory process, preventing convergence to local optimal solutions. Simulation results demonstrate that our algorithm outperforms traditional RL algorithms in managing EV charging in the distribution network. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 2024 IEEE Power & Energy Society General Meeting (PESGM)

arXiv:2403.12946 [pdf, ps, other]

Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes

Authors: He Wang, Laixi Shi, Yuejie Chi

Abstract: In offline reinforcement learning (RL), the absence of active exploration calls for attention on the model robustness to tackle the sim-to-real gap, where the discrepancy between the simulated and deployed environments can significantly undermine the performance of the learned policy. To endow the learned policy with robustness in a sample-efficient manner in the presence of high-dimensional state… ▽ More In offline reinforcement learning (RL), the absence of active exploration calls for attention on the model robustness to tackle the sim-to-real gap, where the discrepancy between the simulated and deployed environments can significantly undermine the performance of the learned policy. To endow the learned policy with robustness in a sample-efficient manner in the presence of high-dimensional state-action space, this paper considers the sample complexity of distributionally robust linear Markov decision processes (MDPs) with an uncertainty set characterized by the total variation distance using offline data. We develop a pessimistic model-based algorithm and establish its sample complexity bound under minimal data coverage assumptions, which outperforms prior art by at least $\widetilde{O}(d)$, where $d$ is the feature dimension. We further improve the performance guarantee of the proposed algorithm by incorporating a carefully-designed variance estimator. △ Less

Submitted 26 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

Comments: accepted by Reinforcement Learning Conference (RLC)

arXiv:2403.11763 [pdf, other]

Convex Co-Design of Control Barrier Function and Safe Feedback Controller Under Input Constraints

Authors: Han Wang, Kostas Margellos, Antonis Papachristodoulou, Claudio De Persis

Abstract: We study the problem of co-designing control barrier functions (CBF) and linear state feedback controllers for continuous-time linear systems. We achieve this by means of a single semi-definite optimization program. Our formulation can handle mixed-relative degree problems without requiring an explicit safe controller. Different L-norm based input limitations can be introduced as convex constraint… ▽ More We study the problem of co-designing control barrier functions (CBF) and linear state feedback controllers for continuous-time linear systems. We achieve this by means of a single semi-definite optimization program. Our formulation can handle mixed-relative degree problems without requiring an explicit safe controller. Different L-norm based input limitations can be introduced as convex constraints in the proposed program. We demonstrate our results on an omni-directional car numerical example. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: manuscript submitted to TAC

Showing 1–50 of 965 results for author: Wang, H