L. Amorosi, J. Puerto Albandoz, C. Valverde Martín

Hierarchical clustering is a statistical technique to study groups (clusters) within a dataset by creating a hierarchy of clusters, represented by a rooted tree (dendrogram). The leaves correspond to data points, and each internal node represents a cluster containing its descendant leaves. Among hierarchical clustering methods, agglomerative ones rely on greedy procedures that return nested partitions, where each level joins two clusters from the lower partition using a local criterion. In this talk, we focus on a mathematical programming formalization that incorporates single and complete linkage procedures. Through experiments, we compare the dendrograms obtained from the exact resolution of the formulations with those from the greedy approach. Additionally, we present a scalable matheuristic algorithm that generates better quality dendrograms than the greedy approach, even for large datasets.

Keywords: Data science; Hierarchical clustering; Mathematical programming;

Scheduled

Location
June 11, 2025  10:30 AM
Mr 2


Other papers in the same session

The Multi-Commodity Flow Problem with Outsourcing Decisions

E. Fernández, I. Ljubic, N. Zerega Oyarzún

The p-median location problem with lack of capacity

I. Martin Melero, M. Landete Ruiz, J. C. Gonçalves Dosantos, J. Sanchez Soriano


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.