A computational approach for the identification and investigation of correlations between a chemical structure and a selected biological property is described. It. is based on a set of 132 compounds of known chemical structures, which were tested for their binding affinities to the estrogen receptor. Different multivariate modeling methods, i.e., partial least-squares regression, counterpropagation neural network, and error-back-propagation neural network, were applied, and the prediction ability of each model was tested in order to compare the results of the obtained models. To reduce the extensive set of calculated structural descriptors, two types of variable selection methods were applied, depending on the modeling approach used. In particular, the final partial least-squares regression model was built using the "variable importance in projection" variable selection method, while genetic algorithms were applied in neural network modeling to select the optimal set of descriptors. A thorough statistical study of the variables selected by genetic algorithms is shown. The results were assessed with the aim to get insight to the mechanisms involved in the binding of estrogenic compounds to the receptor. The variable selection oil the basis of genetic algorithm wits controlled with the test set of compounds, extracted from the data set available. To compare the predictive ability of all the optimized models, a leave-one-out cross-validation procedure was applied, the best model being the nonlinear neural network model based on error back-propagation algorithm, which resulted in R-2 = 92.2% and Q(2) = 70.8%.

Variable Selection and Interpretation in Structure-Affinity Correlation Modeling of Estrogen Receptor Binders / Marini, Federico; Roncaglioni, A; Novic, M.. - In: JOURNAL OF CHEMICAL INFORMATION AND MODELING. - ISSN 1549-9596. - STAMPA. - 45:6(2005), pp. 1507-1519. [10.1021/ci0501645]

Variable Selection and Interpretation in Structure-Affinity Correlation Modeling of Estrogen Receptor Binders

MARINI, Federico;
2005

Abstract

A computational approach for the identification and investigation of correlations between a chemical structure and a selected biological property is described. It. is based on a set of 132 compounds of known chemical structures, which were tested for their binding affinities to the estrogen receptor. Different multivariate modeling methods, i.e., partial least-squares regression, counterpropagation neural network, and error-back-propagation neural network, were applied, and the prediction ability of each model was tested in order to compare the results of the obtained models. To reduce the extensive set of calculated structural descriptors, two types of variable selection methods were applied, depending on the modeling approach used. In particular, the final partial least-squares regression model was built using the "variable importance in projection" variable selection method, while genetic algorithms were applied in neural network modeling to select the optimal set of descriptors. A thorough statistical study of the variables selected by genetic algorithms is shown. The results were assessed with the aim to get insight to the mechanisms involved in the binding of estrogenic compounds to the receptor. The variable selection oil the basis of genetic algorithm wits controlled with the test set of compounds, extracted from the data set available. To compare the predictive ability of all the optimized models, a leave-one-out cross-validation procedure was applied, the best model being the nonlinear neural network model based on error back-propagation algorithm, which resulted in R-2 = 92.2% and Q(2) = 70.8%.
2005
QSAR; chemometrics; artificial neural networks; variable selection; molecular descriptors
01 Pubblicazione su rivista::01a Articolo in rivista
Variable Selection and Interpretation in Structure-Affinity Correlation Modeling of Estrogen Receptor Binders / Marini, Federico; Roncaglioni, A; Novic, M.. - In: JOURNAL OF CHEMICAL INFORMATION AND MODELING. - ISSN 1549-9596. - STAMPA. - 45:6(2005), pp. 1507-1519. [10.1021/ci0501645]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/145976
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 61
  • ???jsp.display-item.citation.isi??? 56
social impact