Font Size: a A A

Version Upgrade And Data Improvement Of PiRNA Database PiRBase

Posted on:2020-08-02Degree:MasterType:Thesis
Country:ChinaCandidate:Y P LuFull Text:PDF
GTID:2370330575451665Subject:Biochemistry and Molecular Biology
Abstract/Summary:PDF Full Text Request
piRNA is a kind of non-coding RNA mainly expressed in germ cells,whose length is range 24-32 nt and can interact with Piwi family proteins.piRNA plays important roles in genomic stability and methylation of reproductive system.With the development of sequencing technology,piRNA has been identified in many species,resulting in a large number of data need to be analyzed and collected.However,the existing database about piRNA has fewer sequences and species,fewer relationships between piRNA and m RNA/lnc RNA,lacking the database about piRNA and disease,low version of genome data,and which is inconvenience in searching and downloading,it can not reflect the achievements of piRNA research in recent years,nor can it satisfy the diversity of species,sequences and annotations of researchers.To upgrade the piRBase database,we searched the Pu Med database with the keyword "piRNA",screens the relevant piRNA published since the first edition of the database was launched in 2014.The latest genome and annotation information were downloaded from UCSC,the sequence and annotation information were collected using Bowtie,Bedtools and other software.In addition to updating the piRNA sequence,the annotations related to piRNA were added,the genome versions of all species were also updated,tools related to piRNA research in recent years were also provided in the database,such as pir Scan,2L-piRNA,and Seq to Name,Name Convert,which are convenient for researchers.The results of updating the database are as follows: the upgraded version of the database is the second edition,named piRBase: a comprehensive database of piRNA sequences.The number of species has increased to 21,which included birds,mammals,insects,amphibians,fish,etc.The number of sequences has increased to150 million,the data set has increased to 264,and the genomes of different species has been updated to the latest version.The annotations included diseases,transposons,as well as Genes,RNA/lnc RNA,methylation.In terms of sequence data,species data and piRNA annotations,piRBase database is in the leading position in similar databases.The piRBase database provides a sufficient variety of species and a sufficient number of sequences for researches and further collation and mining of sequence information.The piRBase database can assist researchers in both piRNA research and communication research.
Keywords/Search Tags:piRNA, database, data
PDF Full Text Request
Related items