Clustering to Minimize Cluster-Aware Norm Objectives
MG Herold, E Kipouridis, J Spoerhase - Proceedings of the 2025 Annual ACM …, 2025 - SIAM
MG Herold, E Kipouridis, J Spoerhase
Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), 2025•SIAMWe initiate the study of the following general clustering problem. We seek to partition a given
set P of data points into k clusters by finding a set X of k centers and assigning each data
point to one of the centers. The cost of a cluster, represented by a center x∊ X, is a
monotone, symmetric norm f (called inner norm) of the vector of distances of points assigned
to x. The goal is to minimize a norm g (called outer norm) of the vector of cluster costs. This
problem, which we call (f, g)-Clustering, generalizes many fundamental clustering problems …
set P of data points into k clusters by finding a set X of k centers and assigning each data
point to one of the centers. The cost of a cluster, represented by a center x∊ X, is a
monotone, symmetric norm f (called inner norm) of the vector of distances of points assigned
to x. The goal is to minimize a norm g (called outer norm) of the vector of cluster costs. This
problem, which we call (f, g)-Clustering, generalizes many fundamental clustering problems …
Abstract
We initiate the study of the following general clustering problem. We seek to partition a given set P of data points into k clusters by finding a set X of k centers and assigning each data point to one of the centers. The cost of a cluster, represented by a center x ∊ X, is a monotone, symmetric norm f (called inner norm) of the vector of distances of points assigned to x. The goal is to minimize a norm g (called outer norm) of the vector of cluster costs. This problem, which we call (f, g )-Clustering, generalizes many fundamental clustering problems such as k-Center (i.e., (𝓛∞, 𝓛∞)-Clustering), k-Median (i.e., (𝓛1, 𝓛1)-Clustering), Min-Sum of Radii (i.e., (𝓛∞, 𝓛1)-Clustering), and Min-Load k-Clustering (i.e., (𝓛1, L∞)-Clustering). A recent line of research (Byrka et al. [STOC’18], Chakrabarty, Swamy [ICALP’18, STOC’19], and Abbasi et al. [FOCS’23]) studies norm objectives that are oblivious to the cluster structure such as k-Median and k-Center. In contrast, our problem models cluster-aware objectives including Min-Sum of Radii and Min-Load k-Clustering.
Our main results are as follows. First, we design a constant-factor approximation algorithm for (topℓ, 𝓛1)- Clustering where the inner norm (topℓ) sums over the ℓ largest distances. This unifies (up to constant factors) the best known results for k-Median and Min-Sum of Radii. Second, we design a constant-factor approximation for (𝓛∞, Ord)-Clustering where the outer norm is a convex combination of topℓ norms (ordered weighted norm). This generalizes known results for k-Center and Min-Sum of Radii. Obtaining a constant-factor approximation for more general settings that include (𝓛1, 𝓛∞)-Clustering (Min-Load k-Clustering) seems challenging because even an o (k )-approximation is unknown for this problem. We can still use our two main results to obtain first (although non-constant) approximations for these problems including general monotone, symmetric norms.
Our algorithm for (topℓ, 𝓛1)-Clustering relies on a reduction to a novel generalization of k-Median, which we call Ball k-Median. In this problem, we aim at selecting k balls (rather than k centers) and pay for connecting the points to these balls as well as for the (scaled) radii of the balls. To obtain a constant-factor approximation for this problem we unify various algorithmic techniques originally designed for the cluster-oblivious k-Median objective (Jain and Vazirani [JACM 2001], Li and Svensson [STOC’13]) and for the cluster-aware Min-Sum of Radii Objective (Charikar and Panigrahi [STOC’01] and Ahmadian and Swamy [ICALP’16]).
* Martin Herold is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) — Project number 399223600. This work is part of the project TIPEA that has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No. 850979). We are grateful to an anonymous reviewer for making concrete suggestions how to substantially simplify our algorithm for (Top, 𝓛1 (-Clustering by applying the techniques of Ahmadian, Swamy [4].
Society for Industrial and Applied Mathematics
Showing the best result for this search. See all results