Data clustering and noise undressing of correlation matrices

Lorenzo Giada and Matteo Marsili
Phys. Rev. E 63, 061101 – Published 15 May 2001
PDFExport Citation

Abstract

We discuss an approach to data clustering. We find that maximum likelihood leads naturally to an Hamiltonian of Potts variables that depends on the correlation matrix and whose low temperature behavior describes the correlation structure of the data. For random, uncorrelated data sets no correlation structure emerges. On the other hand, for data sets with a built-in cluster structure, the method is able to detect and recover efficiently that structure. Finally we apply the method to financial time series, where the low-temperature behavior reveals a nontrivial clustering.

  • Received 17 January 2001

DOI:https://doi.org/10.1103/PhysRevE.63.061101

©2001 American Physical Society

Authors & Affiliations

Lorenzo Giada and Matteo Marsili

  • Istituto Nazionale per la Fisica della Materia (INFM), Trieste Unit, Trieste I-34014, Italy
  • International School for Advanced Studies (SISSA), V. Beirut 2-4, Trieste I-34014, Italy

References (Subscription Required)

Click to Expand
Issue

Vol. 63, Iss. 6 — June 2001

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×