Font Size: a A A

Knowledge-based modeling of RNA three-dimensional structure

Posted on:2010-04-26Degree:Ph.DType:Dissertation
University:Stanford UniversityCandidate:Jonikas, Magdalena AnnaFull Text:PDF
GTID:1448390002987649Subject:Engineering
Abstract/Summary:
RNA's unique ability to act as both a messenger of genetic information (mRNA) and carry out complex chemical reactions in the cell distinguishes it from other biological polymers. By adopting complex three-dimensional (3D) structures, RNA molecules are able to perform functions including catalysis and regulation of transcription and translation. Understanding the functions of these molecules depends critically on knowing their structure. However, creating 3D structural models of RNA remains a significant challenge. In this work, I present two methods that apply an informatics approach to this problem, along with applications of these methods to RNA molecules.;The second method, Coarse to Atomic (C2A), uses geometric information in known RNA 3D structures to add full atomic resolution to coarse grain structural models, such as the ones generated by NAST. We use the existing physic-based molecular dynamics engine GROMACS to minimize the full atomic structures and ensure their chemical reasonability. The resulting full atomic structures remain within 1A of the coarse grain template used as input and can be used as starting structures for physics-based modeling and dynamics methods.;In this dissertation, we demonstrate several applications of these methods. First, we modeled RNA folding intermediates and pathways. RNA molecules form their 3D structures through complex folding pathways that include intermediate conformations, some of which last long enough to be considered trapped states. We used NAST to model RNA folding pathways and intermediates, including such "trapped" states.;By pipelining NAST and C2A, we have created an automated system for knowledge-based prediction of full atomic 3D structures. This method requires no modeling experience from the user and only limited information about the molecule, making possible the large-scale prediction of all RNA molecules with predicted secondary structure. We applied this pipeline first to 95 RNA molecules with known structures by constraining only the secondary structure. Finally, we applied the pipeline to size predicted secondary structure for three experimentally observed functional states of a single aptamer primary sequence and observed the effects of the various secondary structures on the possible 3D structures.;The first method, the Nucleic Acid Simulation Tool (NAST), builds low-resolution 3D structure predictions from limited information about the molecule. NAST uses a one-point-per-residue simplification of RNA structure and statistics of geometries observed in known RNA 3D structures to reduce the computational complexity of the problem and generate a large ensemble of solutions in a reasonable amount of time. Starting with the primary sequence and secondary structure prediction, we use this automated tool to build coarse grain models of several RNA molecules and compare them to their known crystal structures. We also use NAST to combine information from different sources, such as partial crystal structures and hand-made models built by modeling experts.
Keywords/Search Tags:RNA, Structure, NAST, Modeling, Information, Full atomic, Models
Related items