Font Size: a A A

Research Of Maize IncRNA Identification Algorithm Based On RNA Secondary Structure

Posted on:2018-01-04Degree:MasterType:Thesis
Country:ChinaCandidate:X D LiFull Text:PDF
GTID:2323330515478438Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
LncRNA is an intracellularfunctional RNA molecule.Its length is greater than 200 nucleotides.They are located in the cytoplasm or nucleus.They have no long open reading frame.They also don't have the ability to code proteins.But IncRNA has many biological functions,such as gene imprinting,gene expression,recruitment of chromatin decoration,regulation of chromosome inactivation and so on.In recent years,a large number of IncRNA have been found in humans and animals,but the progress in plants is relatively slow.LncRNA identification is the basis of its classification,function and mechanism.Therefore,it is very important to study the effective lncRNA identification algorithm.At present,there is a lack of effective IncRNA identification algorithm,which affect the research progress of IncRNA in plants.At present,there are some false positive results.This paper introduce the secondary structure to reduce the false positive results.For the introduction of secondary structurecharacteristics,first need to predict IncRNA secondary structure,the current structurepredictionis limited to the overall structure of the sequence,ignoring the functional components of local structure conservative,which is not suitable for long sequences.In order to meet the demand of IncRNA secondary structure prediction,this paper proposes a new method of IncRNA subsection secondary structure prediction,and then carries on the statistical analysis toextracts the characteristics ofsecondary structure.Maize is an important food crop and industrial raw materials,Therefore,this paper takes maize as an example,mining IncRNA related characteristics,using bioinformatics and statistical methods to build maize IncRNA identification model,effective identification of maize IncRNA.In this paper,the full length cDNA data of maize was collected.RNA sequences satisfy the length conditions according to the definition of lncRNA;then identify the open reading frame of sequence,through the analysis of the existing lncRNA sequence set open reading frame threshold,and then verify homology with known protein,exclude the homologous sequence of lncRNA;With the identification of CPC part constitute candidate lncRNA.All the characteristics of lncRNA in maize were identified by analyzing the known sequence characteristics and unknown structure characteristics,which was used as the screening condition to identify the effective IncRNA and filter the false positive results.In this paper,the identification model is established by IncRNA sequencecharacteristics and secondary structurecharacteristics,and the existing IncRNA sequence and the full length cDNA data of maize were used to identify lncRNA.Compared with the current IncRNA identification algorithm,the identification accuracy is improved by introducing structure characteristics.The research work in this paper can promote the research on the function and mechanism of IncRNA in maize.In the futureresearch,I will further refine the IncRNA sequence characteristics and structurecharacteristics and improve the structure prediction algorithm to get more accurate results.In the case of conditions,combined with biological experimental methods and computational methods to identify IncRNA,rich IncRNA related database.
Keywords/Search Tags:IncRNA, secondary structure, structurecharacteristics, sequence characteristics
PDF Full Text Request
Related items