| In recent years,the scope of use of the pharmacopoeia has become wider and wider,and deficiencies in the current use of the pharmacopoeia have gradually emerged.At present,the pharmacopoeia has only a paper version and PDF electronic version.The paper version is expensive and cumbersome.PDF version is relatively easy to read,but it is difficult to meet more humanized human needs.For example,copying and editing,attribute-based query and other more granular search,comparing the similarities and differences of the same drug in different pharmacopoeia,recommending similar drugs in the traits,curative effect and other characteristics.Currently,there are a few websites that provide pharmacopoeial inquiries,but the drug information is still in PDF format and it still cannot meet the above requirements.In order to allow pharmacopoeia to meet the above individual needs,better serve practitioners who use pharmacopoeia.This article aims to develop a pharmacopoeial data retrieval,comparison,and recommendation platform that is easy to use and can provide powerful capabilities for retrieving drug information,comparing similarities and differences among drugs in different pharmacopoeias,and recommending similar drugs from different attributes.The search function includes ordinary search and advanced search.Ordinary search allows the user to query based on the drug name,and advanced search provides the function of querying various attributes of the drug as key information.In order to achieve the goal of this article,the author needs to completely electronicize and structure the current pharmacopoeia information,then analyze it and develop a Web platform.First of all,this article introduces and analyzes the current situation of pharmacopoeia,elaborates the current problems of using pharmacopoeia.And sorts out the functions and performance requirements of the platform to be developed.In order to obtain the original data,this paper studies the PDF analysis,web crawler and other techniques,and uses mature and effective tools such as PDF Box,POI,and PySpider to digitize pharmacopoeia data.In order to structure the pharmacopoeial information,this paper introduces and sorts out the current data structuring technology and selects a structured method which uses TF-IDF to find public attributes and uses regular expressions and other text processing methods to standardize data,and save structured data to MySQL.On this basis,this paper uses Lucene,an open source retrieval framework,to implement the platform's retrieval function.Using open source natural language processing tools LingPipe and Lucene to calculate the similarity of the same drug in different Pharmacopoeia,the similarity of drugs in each attribute dimension.This platform uses the index to save the calculation results and provides comparison and recommendation functions directly.Finally,a web platform was built based on frameworks such as Spring,SpringMVC,and MyBatis.The realized platform basically meets the design goals and can provide more powerful and personalized functions than the existing platform.It provides richer information for relevant practitioners who use the pharmacopoeia and greatly improves the query efficiency.And the drug data analysis function greatly facilitates decision support. |