| |||||||||||||||||
![]() |
![]() |
![]() |
|||||||||||||||
|
Jaccard’s Coefficient Jaccard's coefficient (measure similarity) and Jaccard's distance (measure dissimilarity) are measurement of asymmetric information on binary (and non-binary) variables. Compare Jaccard's coefficient with Simple matching coefficient. For some applications, the existence of Formula Where
Jaccard's distance can be obtained from
Thus, Example 1:
The coordinate of Apple is (1,1,1,1) and coordinate of Banana is (0,1,0,0). Because each object is represented by 4 variables, we say that these objects has 4 dimensions.
Jaccard's coefficient between Apple and Banana is 1/4 . Jaccard's distance between Apple and Banana is 3/4.
For non binary data, Jaccard's coefficient can also be computed using set relations
Example 2 Suppose we have two sets Then the union is
Of course, the set formula is also work for binary data, but we need to compute each digit using Boolean algebra. (A and B is True if both true, A or B is false if both False). Intersection set is equivalent to AND, while Union operation is equivalent to OR. Example 3 Let us use the example above
Sum of all digits can be used to compute Jaccard's coefficient
This tutorial is copyrighted. Preferable reference for this tutorial is Teknomo, Kardi. Similarity Measurement. http:\\people.revoledu.com\kardi\ tutorial\Similarity\
|
||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||||||||||||||||||||
© 2006 Kardi Teknomo. All Rights Reserved. Designed by CNV Media |
|||||||||||||||||||||||||||||||||||||||||||||||||||