Multidimensional Scaling Essentials: Algorithms and R Code

kassambara | 16/10/2017 | 71132 | Comments (7) | Principal Component Methods in R: Practical Guide

Multidimensional scaling (MDS) is a multivariate data analysis approach that is used to visualize the similarity/dissimilarity between samples by plotting points in two dimensional plots.

MDS returns an optimal solution to represent the data in a lower-dimensional space, where the number of dimensions k is pre-specified by the analyst. For example, choosing k = 2 optimizes the object locations for a two-dimensional scatter plot.

An MDS algorithm takes as an input data the dissimilarity matrix, representing the distances between pairs of objects.

The input data for MDS is a dissimilarity matrix representing the distances between pairs of objects.

This article describes MDS algorithms and provides R code to compute MDS.

Contents:

Types of MDS algorithms
Compute MDS in R
Visualizing a correlation matrix using Multidimensional Scaling
Comparing MDS and PCA
See also

Types of MDS algorithms

There are different types of MDS algorithms, including

Classical multidimensional scaling

Preserves the original distance metric, between points, as well as possible. That is the fitted distances on the MDS map and the original distances are in the same metric. Classic MDS belongs to the so-called metric multidimensional scaling category.

It’s also known as principal coordinates analysis. It’s suitable for quantitative data.

Non-metric multidimensional scaling

It’s also known as ordinal MDS. Here, it’s not the metric of a distance value that is important or meaningful, but its value in relation to the distances between other pairs of objects.

Ordinal MDS constructs fitted distances that are in the same rank order as the original distance. For example, if the distance of apart objects 1 and 5 rank fifth in the original distance data, then they should also rank fifth in the MDS configuration.

It’s suitable for qualitative data.

Compute MDS in R

R functions

cmdscale() [stats package]: Compute classical (metric) multidimensional scaling.
isoMDS() [MASS package]: Compute Kruskal’s non-metric multidimensional scaling (one form of non-metric MDS).
sammon() [MASS package]: Compute sammon’s non-linear mapping (one form of non-metric MDS).

All these functions take a distance object as the main argument and k is the desired number of dimensions in the scaled output. By default, they return two dimension solutions, but we can change that through the parameter k which defaults to 2.

Demo data

swiss data that contains fertility and socio-economic data on 47 French speaking provinces in Switzerland.

data("swiss")
head(swiss)

##              Fertility Agriculture Examination Education Catholic
## Courtelary        80.2        17.0          15        12     9.96
## Delemont          83.1        45.1           6         9    84.84
## Franches-Mnt      92.5        39.7           5         5    93.40
## Moutier           85.8        36.5          12         7    33.77
## Neuveville        76.9        43.5          17        15     5.16
## Porrentruy        76.1        35.3           9         7    90.57
##              Infant.Mortality
## Courtelary               22.2
## Delemont                 22.2
## Franches-Mnt             20.2
## Moutier                  20.3
## Neuveville               20.6
## Porrentruy               26.6

Classical MDS

# Load required packages
library(magrittr)
library(dplyr)
library(ggpubr)
# Cmpute MDS
mds <- swiss %>%
  dist() %>%          
  cmdscale() %>%
  as_tibble()
colnames(mds) <- c("Dim.1", "Dim.2")
# Plot MDS
ggscatter(mds, x = "Dim.1", y = "Dim.2", 
          label = rownames(swiss),
          size = 1,
          repel = TRUE)

Create 3 groups using k-means clustering. Color points by groups

# K-means clustering
clust <- kmeans(mds, 3)$cluster %>%
  as.factor()
mds <- mds %>%
  mutate(groups = clust)
# Plot and color by groups
ggscatter(mds, x = "Dim.1", y = "Dim.2", 
          label = rownames(swiss),
          color = "groups",
          palette = "jco",
          size = 1, 
          ellipse = TRUE,
          ellipse.type = "convex",
          repel = TRUE)

Non-metric MDS

Load general packages:

library(magrittr)
library(dplyr)
library(ggpubr)

Kruskal’s non-metric multidimensional scaling

# Cmpute MDS
library(MASS)
mds <- swiss %>%
  dist() %>%          
  isoMDS() %>%
  .$points %>%
  as_tibble()
colnames(mds) <- c("Dim.1", "Dim.2")
# Plot MDS
ggscatter(mds, x = "Dim.1", y = "Dim.2", 
          label = rownames(swiss),
          size = 1,
          repel = TRUE)

Sammon’s non-linear mapping:

# Cmpute MDS
library(MASS)
mds <- swiss %>%
  dist() %>%          
  sammon() %>%
  .$points %>%
  as_tibble()
colnames(mds) <- c("Dim.1", "Dim.2")
# Plot MDS
ggscatter(mds, x = "Dim.1", y = "Dim.2", 
          label = rownames(swiss),
          size = 1,
          repel = TRUE)

Visualizing a correlation matrix using Multidimensional Scaling

MDS can be also used to reveal a hidden pattern in a correlation matrix.

Correlation actually measures similarity, but it is easy to transform it to a measure of dissimilarity. Distance between objects can be calculated as 1 - res.cor.

res.cor <- cor(mtcars, method = "spearman")
mds.cor <- (1 - res.cor) %>%
  cmdscale() %>%
  as_tibble()
colnames(mds.cor) <- c("Dim.1", "Dim.2")
ggscatter(mds.cor, x = "Dim.1", y = "Dim.2", 
          size = 1,
          label = colnames(res.cor),
          repel = TRUE)

Positive correlated objects are close together on the same side of the plot.

Comparing MDS and PCA

Mathematically and conceptually, there are close correspondences between MDS and other methods used to reduce the dimensionality of complex data, such as Principal components analysis (PCA) and factor analysis.

PCA is more focused on the dimensions themselves, and seek to maximize explained variance, whereas MDS is more focused on relations among the scaled objects.

MDS projects n-dimensional data points to a (commonly) 2-dimensional space such that similar objects in the n-dimensional space will be close together on the two dimensional plot, while PCA projects a multidimensional space to the directions of maximum variability using covariance/correlation matrix to analyze the correlation between data points and variables.

Recommended for You!

Machine Learning Essentials: Practical Guide in R

Practical Guide to Cluster Analysis in R

Practical Guide to Principal Component Methods in R

R Graphics Essentials for Great Data Visualization

Network Analysis and Visualization in R

More books on R and data science

Recommended for you

This section contains the best data science and self-development resources to help you on your path.

Books - Data Science

Our Books

Practical Guide to Cluster Analysis in R by A. Kassambara (Datanovia)
Practical Guide To Principal Component Methods in R by A. Kassambara (Datanovia)
Machine Learning Essentials: Practical Guide in R by A. Kassambara (Datanovia)
R Graphics Essentials for Great Data Visualization by A. Kassambara (Datanovia)
GGPlot2 Essentials for Great Data Visualization in R by A. Kassambara (Datanovia)
Network Analysis and Visualization in R by A. Kassambara (Datanovia)
Practical Statistics in R for Comparing Groups: Numerical Variables by A. Kassambara (Datanovia)
Inter-Rater Reliability Essentials: Practical Guide in R by A. Kassambara (Datanovia)

Others

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data by Hadley Wickham & Garrett Grolemund
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems by Aurelien Géron
Practical Statistics for Data Scientists: 50 Essential Concepts by Peter Bruce & Andrew Bruce
Hands-On Programming with R: Write Your Own Functions And Simulations by Garrett Grolemund & Hadley Wickham
An Introduction to Statistical Learning: with Applications in R by Gareth James et al.
Deep Learning with R by François Chollet & J.J. Allaire
Deep Learning with Python by François Chollet

Comments

You are not authorized to post a comment

Comment

kassambara

Administrator

#279 10/28/2017 at 06h40

Thank you for your positive feedback! Highly appreciated

Comment

Visitor

#278 10/28/2017 at 02h21

Ok - finally understood!.

I read the post several times
and carefully entered the MDS examples
in my Rstudio, (Ubuntu Linux-32bits)

Thank you, Kassambara
for your patience and very clear explanations.

STHDA is a such a great R website!

One of my top favorites.

SFdude
San Francisco

Comment

kassambara

Administrator

#277 10/28/2017 at 00h29

Hi,

In the last plot, we used MDS to visualize a correlation matrix between variables in the mtcars data set.

We used `1 - correlation.coefficient` as distance measure.

- Points that are on the same side of the plot are positively correlated. Example: am & gear. This means that an increase in am values is associated with an increase in gear values and vice-versa
- Points that are on opposite side on opposite sides of the plot are negatively corrrelated. Example: mpg and wt. When mpg increases then wt decreases......

Comment

Visitor

#276 10/27/2017 at 02h58

Hi Kassambara -

Your explanation ref MDS plots
is very good!.

I now understand the technique
to generate the last MDS plot
in this article, (mtcars).

But I'm still having difficulty
expressing in plain English (to a business person),
what this last plot means
for mtcars,
if the closest distance is between
the am and the gear points?.

How to interpret
in practical,
simple English language,
what this last MDS plot means for the mtcars data ?...
(if these 2 points are closest to each other,
than the other points in this plot...)

help / au secours!
SFdude
San Francisco

Comment

Visitor

#271 10/22/2017 at 17h29

Thank you, Kassambara!.

Will read the suggested
STHDA article, next.

STHDA
is truly one of my favorite learning sites.
A+...

Comment

kassambara

Administrator

#266 10/19/2017 at 22h09

Dim.1 and Dim.2 are principal component analysis dimension 1 and 2.

Read this:

Principal component analysis essentials

Comment

SFdude

Visitor

#264 10/19/2017 at 17h57

This article is really good
and very clear!.

Q:
in the last plot image (for mtcars),
what do the Dim1 and Dim2 axis numbers/values
really mean?
(ie:
- Dim1: axis from -1.0 to 1.0
- Dim2: axis from -0.5 to 0.5)

(in plain English pls!...)

Thanks/Merci
SFdude
San Francisco

STAY UPDATED

Articles - Principal Component Methods in R: Practical Guide