In an approach to the protein folding problem by a Genetic Algorithm, the fitness function plays a critical role. Empirical potentials are generally used to build the fitness function, and they must be weighted to obtain a valuable one. The weights are generally found by the comparison with a set of misfolded structures (decoys), but a dependence of the obtained fitness generally arises on the used decoys. Here we describe a general procedure to find out, from a given set of potentials, their better linear combination that could either identify the wild structure or prove their powerlessness. We use topological considerations over the hyperspace of the potentials, and a multiple linear inequalities solver. The iterated method flows through the following steps: it determines a direction in the hyperspace of the potentials, which identifies the native structure as a vertex among a set of misfolded decoys. A multiple linear inequalities solver obtains the direction. A Genetic Algorithm, tailored to the specific problem, uses the fitness function defined by this direction and generally reaches a new structure better than the experimental one, which is added to the ensemble. The decoys so generated are not dependent on a deterministic criterion. This iterative procedure can be stopped either by identifying an effective fitness function or by proving the impossibility of its achievement. In order to test the method under the hardest conditions, we choose numerous and heterogeneous quantities as components of the fitness function. This method could be a useful tool for the scientific community in order to test any fitness proposed and to recognize the most important components on which it is built.
A simple procedure to weight empirical potentials in a fitness function so as to optimize its performance in ab initio protein-folding problem / Luigi, Agostini; Morosetti, Stefano. - In: BIOPHYSICAL CHEMISTRY. - ISSN 0301-4622. - STAMPA. - 105:1(2003), pp. 105-118. [10.1016/s0301-4622(03)00130-3]
A simple procedure to weight empirical potentials in a fitness function so as to optimize its performance in ab initio protein-folding problem
MOROSETTI, Stefano
2003
Abstract
In an approach to the protein folding problem by a Genetic Algorithm, the fitness function plays a critical role. Empirical potentials are generally used to build the fitness function, and they must be weighted to obtain a valuable one. The weights are generally found by the comparison with a set of misfolded structures (decoys), but a dependence of the obtained fitness generally arises on the used decoys. Here we describe a general procedure to find out, from a given set of potentials, their better linear combination that could either identify the wild structure or prove their powerlessness. We use topological considerations over the hyperspace of the potentials, and a multiple linear inequalities solver. The iterated method flows through the following steps: it determines a direction in the hyperspace of the potentials, which identifies the native structure as a vertex among a set of misfolded decoys. A multiple linear inequalities solver obtains the direction. A Genetic Algorithm, tailored to the specific problem, uses the fitness function defined by this direction and generally reaches a new structure better than the experimental one, which is added to the ensemble. The decoys so generated are not dependent on a deterministic criterion. This iterative procedure can be stopped either by identifying an effective fitness function or by proving the impossibility of its achievement. In order to test the method under the hardest conditions, we choose numerous and heterogeneous quantities as components of the fitness function. This method could be a useful tool for the scientific community in order to test any fitness proposed and to recognize the most important components on which it is built.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.