Affordable Access

Redundancy analysis for characterizing the correlation between groups of variables - Applied to molecular descriptors

Authors
Journal
Chemometrics and Intelligent Laboratory Systems
0169-7439
Publisher
Elsevier
Publication Date
Volume
117
Identifiers
DOI: 10.1016/j.chemolab.2011.05.013
Keywords
  • Variable Blocks
  • Correlation
  • Ra
  • Cca
  • Pls
Disciplines
  • Chemistry

Abstract

Abstract Redundancy analysis (RA) estimates the extent of linear relationships between blocks of variables that are given for a set of objects (samples). RA has only rarely been used in chemometrics. Basic principles and limits of RA are discussed, and RA is briefly compared with canonical correlation analysis (CCA) and partial least-squares (PLS2) regression. The significance of a redundancy index is estimated by permutation tests. For PLS2, an index determining the similarity of variable blocks can be derived that is equivalent to the canonical measure of correlation, CMC. RA is applied to a set of 3708 molecular descriptors (created by software Dragon) for 6458 chemical structures (AMES database). The 27 descriptor groups are characterized by their redundancy indices, which allow a comparison of their multivariate information content. The results guide the selection of the most different descriptor groups, which perform better in a discrimination task (classification of mutagenicity) than the entire groups.

There are no comments yet on this publication. Be the first to share your thoughts.

Statistics

Seen <100 times
0 Comments

More articles like this

Fractal descriptors in the Fourier domain applied...

on Chaos An Interdisciplinary Jou... December 2011

Correlations between the benzene character of acen...

on Bioorganic & Medicinal Chemist... Jan 01, 2004

Correlations between the benzene character of acen...

on Bioorganic & Medicinal Chemist... Mar 08, 2004

Quantitative correlation between theoretical molec...

on Chemical Biology & Drug Design October 2008
More articles like this..