9–11 Dec 2015
Dipartimento di Fisica, Univ. Bari - INFN Sezione di Bari
Europe/Rome timezone
SM&FT 2015 Computational approaches in Quantum Field Theory, Statistical Mechanics and Complex Systems

Fully automated clustering by accurate non-parametric density estimation

10 Dec 2015, 15:40
20m
Aula B

Aula B

Speaker

Dr Maria d'Errico (SISSA)

Description

The use of the right density estimates in the framework of density-based cluster analysis is one of the keys to reveal the properties of a given dataset. We developed a new unsupervised and adaptive density estimator [1], that is able to reconstruct the point local density in an accurate way, even in highly inhomogeneous datasets. By combining the new density estimator with a generalized version of a recently developed clustering approach [2], we can automatically recognize sets of data points organized in clusters, regardless of the dataset characteristics (i.e. space dimensionality, shape of the clusters, distance metrics). We demonstrate the power of the algorithm performing cluster analyses on biological systems to disentangle complexity patterns. In particular, interesting results have been obtained by analysing rRNA sequences from human GUT microbiota. [1] M. d'Errico, A. Laio, E. Facco and A. Rodriguez, "An accurate and unsupervised density estimator for highly inhomogeneous datasets" (In preparation). [2] A. Rodriguez, A. Laio, "Clustering by fast search and find density peaks", Science, 2014, 344, 1492-1496

Primary author

Dr Maria d'Errico (SISSA)

Co-authors

Prof. Alessandro Laio (SISSA) Dr Alex Rodriguez (SISSA) Ms Elena Facco (SISSA)

Presentation materials