Kardi Teknomo
Kardi Teknomo Kardi Teknomo Kardi Teknomo

Visit Tutorials below:
Adaptive Learning from Histogram
Adjacency matrix
Analytic Hierarchy Process (AHP)
ArcGIS tutorial
Arithmetic Mean
Bayes Theorem
Bootstrap Sampling
Bray Curtis Distance
Break Even Point
Chebyshev Distance
City Block Distance
Conditional Probability
Continued Fraction
Data Analysis from Questionnaire
Data Revival from Statistics
Decimal to Rational
Decision tree
Difference equations
Digital Root
Discriminant analysis
Eigen Value using Excel
Euclidean Distance
Euler Integration
Euler Number
Excel Iteration
Excel Macro
Excel Tutorial
Factorial Function
Feasibility Study
Financial Analysis
Generalized Inverse
Generalized Mean
Geometric Mean
Ginger Bread Man and Chaos
Graph Theory
Growth Model
Hamming Distance
Harmonic Mean
Hierarchical Clustering
Independent Events
Incident matrix
Jaccard Coefficient
Kernel basis function
Kernel Regression
k-Means clustering
K Nearest Neighbor
LAN Connections Switch
Learning from data
Lehmer Mean
Linear Algebra
Logarithm Rules
Mahalanobis Distance
Market Basket Analysis
Mean Absolute Deviation
Mean and Average
Mean, median, mode
Minkowski Distance
Minkowski Mean
Monte Carlo Simulation
Multi Agent System
Multicriteria decision making
Mutivariate Distance
Newton Raphson
Non-Linear Transformation
Normalization Index
Normalized Rank
Ordinary Differential Equation
Page Rank
Power rules
Prime Factor
Prime Number
Q Learning
Quadratic Function
Queueing Theory
Rank Reversal
Recursive Statistics
Regression Model
Reinforcement Learning
Root of Polynomial
Scenario Analysis
Sierpinski gasket
Sieve of Erastosthenes
Similarity and Distance
Solving System Equation
Standard deviation
String Distance
Summation Tricks
Support Vector Machines
System dynamic
Time Average
Tower of Hanoi
Vedic Square
Visual Basic (VB) tutorial
What If Analysis

Similarity Measurement

By Kardi Teknomo, PhD.

Share this: Google+

In this simple tutorial, you will learn the basic knowledge to expand your data type into multivariate (different type of measurement scale, such as nominal, ordinal, and quantitative) data and go beyond 2 dimensional data scale up to N dimensions. Comprehesive example is given at the last part of this tutorial. You also may download the MS Excel companion file of this tutorial here

This knowledge about similarity and dissimilarity is necessary for data mining, pattern recognition, machine intelligent, artificial intelligent and multi-agents system fields. However, the application is not only limited to computer science field. Other fields of natural and social science as well as engineering and statistics have been applied this kind of simple knowledge. Tools such as K means clustering, Discriminant analysis, K-Nearest Neighbors, or Decision Tree and Hierarchical clustering rely heavily on the distance matrix explained in this tutorial.

What is similarity?

What is distance?
What is the relationship between similarity and dissimilarity?
Why do we need to measure similarity? (Applications)
How do we measure similarity or dissimilarity?
How do we compute dissimilarity or similarity for binary variables?
Simple Matching Coefficient
Jaccard's Coefficient
Hamming Distance
How do we compute dissimilarity or similarity for nominal / categorical variables?
Assign each value of category as a binary dummy variable
Assign each value of category into several binary dummy variables
How do we compute dissimilarity or similarity for ordinal variables?
Normalized Rank Transformation
Spearman Distance
Footrule Distance
Kendall Distance
Cayley Distance
Hamming Distance for Ordinal Variable
Ulam Distance
How do we compute dissimilarity or similarity for text and string variables?
How do we compute dissimilarity or similarity for quantitative variables?
Euclidean Distance
City block (Manhattan) distance
Chebyshev Distance
Minkowski Distance
Canberra distance
Bray Curtis (Sorensen) distance
Angular separation
Correlation coefficient
How do we compute dissimilarity between two groups (Mahalanobis distance)?
How do we normalize the similarity or dissimilarity?
How do we aggregate mixed type of variables?
Comprehensive example: Distance matrix of Multivariate data

Rate and give comment for this tutorial

Share and save this tutorial
Add to: Del.icio.us  Add to: Digg  Add to: StumbleUpon   Add to: Reddit   Add to: Slashdot   Add to: Technorati   Add to: Netscape   Add to: Newsvine   Add to: Mr. Wong Add to: Webnews Add to: Icio Add to: Folkd Add to: Yigg Add to: Linkarena Add to: Simpy Add to: Furl Add to: Yahoo Add to: Google Add to: Blinklist Add to: Blogmarks Add to: Diigo Add to: Blinkbits Add to: Ma.Gnolia Information

This tutorial is copyrighted.

Preferable reference for this tutorial is

Teknomo, Kardi. Similarity Measurement. http:\\people.revoledu.com\kardi\ tutorial\Similarity\



© 2006 Kardi Teknomo. All Rights Reserved.
Designed by CNV Media