Kardi Teknomo
Kardi Teknomo Kardi Teknomo Kardi Teknomo
   
 
  Research
  Publications
  Tutorials
  Resume
  Resources
  Contact

Cophenetic Correlation Coefficient

By Kardi Teknomo, PhD.

clustering


< Previous | Next | Content>

Click here to purchase the complete E-book of this tutorial

How good is the clustering that we just performed? There is an index called Cross Correlation Coefficient or Cophenetic Correlation Coefficient (CP) that shows the goodness of fit of our clustering similar to the Correlation Coefficient of regression.

To compute the Cophenetic Correlation Coefficient of hierarchical clustering, we need two informations:

  1. Distance matrix
  2. Cophenetic Matrix

We have distance as the input for Hierarchical clustering computation. Because distance matrix is symmetric, for our purpose, we need only the lower triangular values

distance matrix

To obtain Cophenetic matrix, we need to fill the lower triangular distance matrix with the minimum merging distance that we obtain in the previous section. Remember in our summary of last section,

  1. We merge cluster D and F into cluster (D, F) at distance 0.50
  2. We merge cluster A and cluster B into (A, B) at distance 0.71
  3. We merge cluster E and (D, F) into ((D, F), E) at distance 1.00
  4. We merge cluster ((D, F), E) and C into (((D, F), E), C) at distance 1.41
  5. We merge cluster (((D, F), E), C) and (A, B) into ((((D, F), E), C), (A, B)) at distance 2.50

Using this information, we can fill the Cophenetic Matrix into

cophenetic matrix

Now we got the two required information and we can put them together into a single matrix.

correlation

Cophenetic Correlation Coefficient is simply correlation coefficient between distance matrix and Cophenetic matrix =Correl (Dist, CP) = 86.399%. As the value of the Cophenetic Correlation Coefficient is quite close to 100%, we can say that the clustering is quite fit.

See Also: Correlation Coefficient

Click here to purchase the complete E-book of this tutorial

Do you have question regarding this Clustering tutorial? Ask your question here

< Previous | Next | Content>

This tutorial is copyrighted.

Preferable reference for this tutorial is

Teknomo, Kardi. (2009) Hierarchical Clustering Tutorial.
http://people.revoledu.com/kardi/tutorial/clustering/

 

 
© 2007 Kardi Teknomo. All Rights Reserved.
Designed by CNV Media