LARGE SCALE MOLECULAR STRUCTURE INVESTIGATION OF FLEXIBLE MOLECULES AND WEB-PORTAL CREATION FOR CONFORMATION RETRIEVAL

Abstract

Triglycerides are involved in important biological functions. Their numerous stable conformations radically influence their properties. We intend to exhaustively investigate the stable conformations of certain short chain triglycerides using well established Molecular Mechanics force fields and sophisticated global optimization techniques. The large amount of results will be handled by a database publicly accessible through a Web-portal. Subsequent refinement with Quantum Mechanical ab-initio calculations will also be offered via the portal.

 

Detailed description

The importance of triglycerides (fats) in the biological cycle can hardly be overestimated [1]. They serve as structural components of cell membranes and as a source of carbon atoms for biosynthetic reactions. Body fats are the most important energy reserve in animals and in addition are excellent thermal and electrical insulators. Among other things they serve as mechanical padding in the subcutaneous tissue and around organs. Triglycerides are also important raw materials in the food industry.

Triglycerides are highly flexible molecules thus adopting a large number of structural arrangements (conformations). In a recent study of triacetin [2], 109 different conformers were discovered, within an energy range of 3 Kcal/mol from the global minimum. The number of conformers grows rapidly with the carbon chain length.

We intend to exhaustively investigate the following triglycerides:

An estimate of the anticipated number of conformers for the above triglycerides along with the required single-cpu time on a 400MHz Sun450 is presented in the following table. Clearly more processing power enables the solution of larger sized problems.

Triglyceride

Number of conformers

Estimated CPU time

Triacetin

1000

15 min

Tripropionin

25000

20 hours

Tributyrin

700000

40 days

Trivalerin

18900000

60 months

We employ the Molecular Mechanics approach [3] using the well established MM2 [4], MM3 [5] (and perhaps CHARMM [6], AMBER [7]) all-atom force fields with appropriate parameters, as implemented by the Tinker [8] molecular modeling package. For a given molecule, the objective is to find the conformation corresponding to the global minimum of the steric energy functional, along with all the conformations corresponding to local minima within a prescribed energy range above the global value.

For the local minimization the Merlin Optimization Environment [9] will be used. This software package has been developed by our research team and the most recent version (3.0) appeared in the literature in 1998. Merlin offers a command driven, friendly user interface and robust implementations of state-of-the-art optimization algorithms. In addition the Merlin Control Language (MCL) [10], has been developed thus enabling the creation of optimization strategies. The global optimization algorithms will be written in MCL. This approach has been successfully applied with earlier versions of the above software packages and our results can be found in [2].

We propose to implement Stochastic [11] (Multistart, Simulated Annealing, Clustering) and Genetic algorithms [12] for global optimization with local tuning. Neural Network classification techniques will be used to accelerate the process by timely abandoning uninteresting points. These stochastic Global Optimization techniques lend themselves to parallel and/or distributed processing. Hence the load of the local optimization tasks will be distributed across a number of processors.

The resulting conformations will be stored in a database (MySQL [13]) in order to ease their handling, retrieval and further employment (visualization, ab-initio calculations).

In order to make our results available to the public we intend to create a web-portal. Through the portal one will be able to retrieve and visualize any of the stored conformations. In addition the portal will accept orders for further refining ab-initio calculations on any of the stored conformers using the GAMESS [14] package. A separate database will keep track of these quantum mechanical results.

 

References

1. Color Atlas of Biochemistry, Koolman J., Rohm K.-H., Thieme,1996.

2. How many Conformers of the 1,2,3-Propanetriol Triacetate are present in gas phase and in aqueous solution ?, D.G. Papageorgiou , I.N. Demetropoulos, I. E. Lagaris and P.T. Papadimitrioõ, Tetrahedron 52 (1996) 677.

3. Molecular Mechanics, Burkert U., Allinger N.L., ACS Monograph No 177, American Chemical Society, Washington DC, 1982.

4. Allinger N.L., J. Am. Chem. Soc. 99 (1977) 8127.

5. Allinger N.L. Yuh Y.H., Lii J.-H., J. Am. Chem. Soc. 111 (1989) 8551.

Lii J.-H., Allinger N.L., J. Am. Chem. Soc. 111 (1989) 8566.

Allinger N.L. Lii J.-H., J. Am. Chem. Soc. 111 (1989) 8576.

6. Brooks B., Bruccoleri R., Olafson B., States D., Swaminathan S. and Karplus M., J. Comput. Chem. 8 (1983) 132.

7. Weiner S., Kollman P., Nguyen D., Case D., J. Comput. Chem. 7 (1986) 230.

8. Ponder J. W., Richards F. M., J. Comput. Chem. 8 (1987) 1016.

Kundrot C. E., Ponder J. W., Richards F. M., J. Comput. Chem. 12 (1991) 402.

Dudek M. J., Ponder J. W., J. Comput. Chem. 16 (1995) 791.

Kong Y., Ponder J. W., J. Chem. Phys. 107 (1997) 481.

Dudek M., Ramnarayan K., Ponder J. W., J. Comput. Chem. 19 (1998) 548.

Pappu R. V., Hart R.K., Ponder J. W., J. Phys. Chem. B 102 (1998) 9725.

TINKER 3.7 of June 1999, Availability: http://dasher.wustl.edu/tinker/

9. MERLIN-3.0, A multidimensional optimization environment. D. G. Papageorgiou, I. N. Demetropoulos and I. E. Lagaris, Comp. Phys. Comm. 109 (1998) 227-249.

MERLIN 2.0, Enhanced and programmable version, D.G. Papageorgiou, C.S. Chassapis & I.E. Lagaris, Comp. Phys. Comm. 52 (1989) 241-247.

MERLIN, A portable system for multidimensional minimization, G.A. Evangelakis, J.P. Rizos, I.E. Lagaris & I.N. Demetropoulos, Comp. Phys. Comm. 46 (1987) 401-415.

10. The Merlin Control Language for strategic optimization, D. G. Papageorgiou, I. N. Demetropoulos and I. E. Lagaris Comp. Phys. Comm. 109 (1998) 250-275.

MCL - Optimization oriented programming language, C.S. Chassapis, D.G. Papageorgiou and I.E. Lagaris, Comp. Phys. Comm. 52 (1989) 223-239.

11. Boender C.G.E, Rinnooy A.H.G., Timmer G.T.,Math. Prog. 22 (1982) 125-140.

12. Genetic Algorithms + Data Structures = Evolution Programs, Michalewicz Z., Springer-Verlag, New York, 1994. 13. Availability: http://www.mysql.net 14. Schmidt M.W., Baldridge K.K., Boatz J.A., Elbert S.T., Gordon M.S., Jensen J.J., Koseki S., Matsunaga N., Nguyen K.A., Su S., Windus T.L., Dupuis M., Montgomery J.A., J.Comput.Chem. 14 (1993) 1347.