*Result*: A Global Optimization Algorithm for <italic>K</italic>-Center Clustering of One Billion Samples.

Title:
A Global Optimization Algorithm for K-Center Clustering of One Billion Samples.
Authors:
Ren, Jiayang1 (AUTHOR) rjy12307@mail.ubc.ca, You, Ningning2 (AUTHOR) ningyou@sjtu.edu.cn, Hua, Kaixun1 (AUTHOR) kaixun.hua@ubc.ca, Ji, Chaojie3 (AUTHOR) chaojiej@math.ubc.ca, Cao, Yankai1 (AUTHOR) yankai.cao@ubc.ca
Source:
Management Science (INFORMS). Sep2025, p1. 25p.
Database:
Business Source Premier

*Further Information*

*This paper presents a practical global optimization algorithm for the <italic>K</italic>-center clustering problem, which aims to select <italic>K</italic> samples as the cluster centers to minimize the maximum within-cluster distance. Specifically, we propose a reduced-space branch and bound scheme that guarantees convergence to the global optimum in a finite number of steps by only branching on the regions of centers. To improve efficiency, we design a two-stage decomposable lower bound, the solution of which can be derived in a closed form. In addition, we also propose several structure-exploiting acceleration techniques to narrow down the region of centers, including bounds tightening, sample reduction, and parallelization. Extensive studies on synthetic and real-world data sets have demonstrated that our algorithm can solve the <italic>K</italic>-center problems to global optimal within four hours for 10 million samples in the serial mode and 1 billion samples in the parallel mode, whereas existing studies can only address small-scale problems (e.g., thousands of samples). Moreover, compared with the state-of-the-art heuristic methods, the global optimum obtained by our algorithm reduces the objective function by an average of 25.8% on these synthetic and real-world data sets.<italic>This paper was accepted by Chung Piaw Teo, optimization.</italic><bold>Funding:</bold> This work was supported by the Natural Sciences and Engineering Research Council of Canada [Grant RGPIN-2019-05499].<bold>Supplemental Material:</bold> The data files are available at https://doi.org/10.1287/mnsc.2023.00218. [ABSTRACT FROM AUTHOR]

Copyright of Management Science (INFORMS) is the property of INFORMS: Institute for Operations Research & the Management Sciences and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)*