Practical Guide To Principal Component Methods in R

Sale!

Practical Guide To Principal Component Methods in R

Rated 4.56 out of 5 based on 32 customer ratings

€27.95

This book provides a solid practical guidance to summarize, visualize and interpret the most important information in a large multivariate data sets, using principal component methods in R.

You will learn:

Principal Component Analysis (PCA) for summarizing a large dataset of continuous variables
Simple Correspondence Analysis (CA) for large contingency tables formed by two categorical variables
Multiple Correspondence Analysis (MCA) for a data set with more than 2 categorical variables
Methods for analyzing a data set containing a mix of variables (continuous and categorical) structured or not into groups: Factor Analysis of Mixed Data (FAMD) and Multiple Factor Analysis (MFA).
Hierarchical Clustering on Principal Components (HCPC), which is useful for performing clustering with a data set containing only categorical variables or with a mixed data of categorical and continuous variables

Order a Physical Copy on Amazon:

Or, Buy and Download Now a PDF Copy by clicking on the “ADD TO CART” button down below. You will receive a link to download a PDF copy (click to see the book preview)

Category: Book Tags: Multivariate Analysis, Unsupervised Learning

11 11 9 16 11 13 13 8 10

102

Description
Reviews (32)

Description

Although there are several good books on principal component methods (PCMs) and related topics, we felt that many of them are either too theoretical or too advanced.

This book provides a solid practical guidance to summarize, visualize and interpret the most important information in a large multivariate data sets, using principal component methods in R.

The following figure illustrates the type of analysis to be performed depending on the type of variables contained in the data set.

Principal component methods

There are a number of R packages implementing principal component methods. These packages include: FactoMineR, ade4, stats, ca, MASS and ExPosition.

However, the result is presented differently depending on the used package.

To help in the interpretation and in the visualization of multivariate analysis - such as cluster analysis and principal component methods - we developed an easy-to-use R package named factoextra.

No matter which package you decide to use for computing principal component methods, the factoextra R package can help to extract easily, in a human readable data format, the analysis results from the different packages mentioned above. factoextra provides also convenient solutions to create ggplot2-based beautiful graphs.

Methods, which outputs can be visualized using the factoextra package are shown in the figure below:

Principal component methods and clustering methods supported by the factoextra R package

In this book, we’ll use mainly:

the FactoMineR package to compute principal component methods;
and the factoextra package for extracting, visualizing and interpreting the results.

The other packages - ade4, ExPosition, etc - will be also presented briefly.

How this book is organized

This book contains 4 parts.

Principal Component Methods book structure

Part I provides a quick introduction to R and presents the key features of FactoMineR and factoextra.

Key features of FactoMineR and factoextra for multivariate analysis

Part II describes classical principal component methods to analyze data sets containing, predominantly, either continuous or categorical variables. These methods include:

Principal Component Analysis (PCA, for continuous variables),
Simple correspondence analysis (CA, for large contingency tables formed by two categorical variables)
Multiple correspondence analysis (MCA, for a data set with more than 2 categorical variables).

In Part III, you’ll learn advanced methods for analyzing a data set containing a mix of variables (continuous and categorical) structured or not into groups:

Factor Analysis of Mixed Data (FAMD) and,
Multiple Factor Analysis (MFA).

Part IV covers hierarchical clustering on principal components (HCPC), which is useful for performing clustering with a data set containing only categorical variables or with a mixed data of categorical and continuous variables

Key features of this book

This book presents the basic principles of the different methods and provide many examples in R. This book offers solid guidance in data mining for students and researchers.

Key features:

Covers principal component methods and implementation in R
Highlights the most important information in your data set using ggplot2-based elegant visualization
Short, self-contained chapters with tested examples that allow for flexibility in designing a course and for easy reference

At the end of each chapter, we present R lab sections in which we systematically work through applications of the various methods discussed in that chapter. Additionally, we provide links to other resources and to our hand-curated list of videos on principal component methods for further learning.

Examples of plots

Some examples of plots generated in this book are shown hereafter. You’ll learn how to create, customize and interpret these plots.

Eigenvalues/variances of principal components. Proportion of information retained by each principal component.

PCA - Graph of variables:

Control variable colors using their contributions to the principal components.

Highlight the most contributing variables to each principal dimension:

PCA - Graph of individuals:

Control automatically the color of individuals using the cos2 (the quality of the individuals on the factor map)

Change the point size according to the cos2 of the corresponding individuals:

PCA - Biplot of individuals and variables

Correspondence analysis. Association between categorical variables.

FAMD/MFA - Analyzing mixed and structured data

Clustering on principal components

Recommended for you

This section contains best data science and self-development resources to help you on your path.

Coursera - Online Courses and Specialization

Amazon FBA

Amazing Selling Machine

Free Training - How to Build a 7-Figure Amazon FBA Business You Can Run 100% From Home and Build Your Dream Life! by ASM

Books - Data Science

Our Books

Practical Guide to Cluster Analysis in R by A. Kassambara (Datanovia)
Practical Guide To Principal Component Methods in R by A. Kassambara (Datanovia)
Machine Learning Essentials: Practical Guide in R by A. Kassambara (Datanovia)
R Graphics Essentials for Great Data Visualization by A. Kassambara (Datanovia)
GGPlot2 Essentials for Great Data Visualization in R by A. Kassambara (Datanovia)
Network Analysis and Visualization in R by A. Kassambara (Datanovia)
Practical Statistics in R for Comparing Groups: Numerical Variables by A. Kassambara (Datanovia)
Inter-Rater Reliability Essentials: Practical Guide in R by A. Kassambara (Datanovia)

Others

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data by Hadley Wickham & Garrett Grolemund
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems by Aurelien Géron
Practical Statistics for Data Scientists: 50 Essential Concepts by Peter Bruce & Andrew Bruce
Hands-On Programming with R: Write Your Own Functions And Simulations by Garrett Grolemund & Hadley Wickham
An Introduction to Statistical Learning: with Applications in R by Gareth James et al.
Deep Learning with R by François Chollet & J.J. Allaire
Deep Learning with Python by François Chollet

Version: Français

32 reviews for Practical Guide To Principal Component Methods in R

Rated 5 out of 5

Anonymous (verified owner) – November 15, 2018
Rated 5 out of 5

Eko Subagyo (verified owner) – January 19, 2019
Rated 5 out of 5

Christian Larsen (verified owner) – April 15, 2019
Rated 4 out of 5

Hamidou Sy (verified owner) – April 24, 2019
Rated 5 out of 5

Payot Didier (verified owner) – May 13, 2019
Rated 5 out of 5

Manuel Pellicer (verified owner) – May 19, 2019
Rated 5 out of 5

Johann (verified owner) – May 27, 2019

PCA in bivariate space can appear quite intimidating to those learning the concepts. This book is well-written and explains the key concepts in easy-to-understand language. The author has done very well in conveying complicated concepts on a level which most people can understand and this book has become my standard reference for PCA in R. This book is aimed at the beginner and average user of PCA. Key concepts are well-explained, but if you are looking for detailed mathematical proofs, then this is not the book for you.
Rated 1 out of 5

David Fiscus (verified owner) – October 12, 2019

What happened to the book? Its not here, but this email is. Hmmmm?
Rated 5 out of 5

José de França Bueno (verified owner) – January 6, 2020
Rated 5 out of 5

Rita L. (verified owner) – February 26, 2020

Very good books
Rated 5 out of 5

Pavel (verified owner) – March 1, 2020

Very professional level, very helpfull book
Rated 5 out of 5

juan manuel (verified owner) – June 7, 2020

very practical and usefull
Rated 4 out of 5

Thorsten Raff (verified owner) – July 2, 2020

Very good book with good examples.
Rated 5 out of 5

Andre S. (verified owner) – July 29, 2020
Rated 5 out of 5

Minh Huynh (verified owner) – August 19, 2020

Good resource material with easy to follow instructions and and working code
Rated 4 out of 5

Anonymous (verified owner) – September 23, 2020
Rated 5 out of 5

Vincent V. (verified owner) – September 26, 2020

Excellent intro, wonderful learning tool, numerous clear examples
Rated 5 out of 5

Karol L. (verified owner) – October 1, 2020
Rated 4 out of 5

Jean-Carlos Montero-Serrano (verified owner) – January 9, 2021

Excellent book ! but help is missing to make triangle diagrams/boxplot with PCAs and cluster groups in the biplot.
Rated 5 out of 5

Anonymous (verified owner) – January 13, 2021
Rated 5 out of 5

MUSADJI Neil Yohan (verified owner) – April 6, 2021

the book is very interesting. it provides and clearly explains the steps to carry out the factor analyzes.

However, could you clearly mention the script for getting variable weights?
Rated 4 out of 5

Diego Andres Chavarro Bohorquez (verified owner) – May 20, 2021

It is an very good book. Clearly explained and with very relevant examples.
Rated 5 out of 5

Etienne Ntumba (verified owner) – July 2, 2021
Rated 5 out of 5

Anonymous (verified owner) – August 18, 2021

Highly recommended.
Rated 4 out of 5

Erry Ika RHOFITA (verified owner) – October 10, 2021
Rated 5 out of 5

Sebastian Riquelme (verified owner) – December 3, 2021

The book gives an easy way to learn about statistical methods very needed for my master thesis. Besides, it makes a good balance between theory and practice. 100% recommended.
Rated 4 out of 5

Chandan Kumar (verified owner) – January 18, 2022

The book is good but it is feely available
Rated 4 out of 5

Gerardo Mariscal (verified owner) – March 14, 2022

The book is written in a clear way and the results obtained performing the commands suggested are amazing
Rated 4 out of 5

Anonymous (verified owner) – June 8, 2022
Rated 5 out of 5

Jose Rafael Herrera Herrera (verified owner) – July 7, 2022

The book is very explicit and complete in all explanantions of the principal component methods. The purchase process on the Datanovia web page was secure and easy. Thank you!
Rated 5 out of 5

Anonymous (verified owner) – September 29, 2022
Rated 4 out of 5

Sílvia Panzo (verified owner) – October 23, 2022