RNA(Ribonucleic Acid)is an indispensable molecule in life,which undertakes important functions such as transmitting genetic information,regulating the transcription process and catalyzing the reaction.The exercise of its functions is closely related to its location in cells.Today is an era of information explosion.Driven by the development of various scientific technology,more and more life science data are showing exponential growth,and the field of RNA subcellular localization is no exception.How to quickly find the most useful information for research in these complex data is an urgent problem to be solved in this era.In order to meet the researcher’s massive data retrieval and analysis needs,this study builds a resource platform,RNALocate v2.0,with rich functions such as search,browse,download and analysis based on Hyper Text Markup Language(HTML),Cascading Style Sheets(CSS),JavaScript(JS),Hypertext Preprocessor(PHP),MySQL database and Smarty template engine.Compared with the previous version,version 2.0 expands the comprehensive information related to RNA,realizes the effective integration of data from different sources,designs the most suitable way to display information,and embeds three sequence-based tools for predicting the subcellular localization of RNA.Relying on this platform,researchers can use the abundant comprehensive information on the subcellular localization of RNA to deeply explore its functional mechanism.The collection and integration of massive information is an inescapable part of updating and maintaining the resource platform.In the era of rapid data growth,it is impractical to complete this work manually.In order to meet this practical need,this research also developed corresponding text mining tools.This tool adopts the strategy of deep learning model Albert combined with the stacking ensemble learning method to realize the recognition of three types of named entities(RNA,subcellular location and disease)in the text,and complete the extraction of corresponding interaction information,which meets the requirements of rapid screening of documents to obtain key information.In summary,this study successfully constructed a comprehensive information resource platform for RNA subcellular localization,and developed text mining tools for this specific field.In the future,the tool will become a powerful assistant for platform updating,and the platform will also provide a data basis for the optimization of the tool.The two cooperate with each other to form a virtuous circle,helping to explore the mysteries of the subcellular localization and functional mechanism of RNA. |