Font Size: a A A

Research On Online Drug Information Extraction Algorithm

Posted on:2011-06-17Degree:MasterType:Thesis
Country:ChinaCandidate:Y Y ShenFull Text:PDF
GTID:2178360305997854Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Currently, the Internet is flooded with false drug information, and thus an advanced web information extraction technology is extremely necessary to reinforce the supervision of related departments of the country over medical electronic business markets. For this purpose, the research group that the author joined has probed into the technology in supervision of online drug information. The author participated in one of the key technologies, research on the extraction algorithm of online drug information, and has gained some outstanding achievements.The traditional web information extraction technology commonly used cannot meet the need of comprehensive, accurate, real-time and automatical extraction of online drug information due to many of its defects such as low accuracy, low coverage, and manual intervention required, etc.Based on the related studies at home and broad, this article puts forward a novel online drug information extraction algorithm. The algorithm sets up a three-dimentional semantic dictionary by introduction of the semantics technology, masks the isomerisms of the web page contents and structures of different drug trade websites, and at the same time, taking advantage of the fact that the attributes of the target drug needed to be extracted from the drug information website tend to have a character of aggregation to some extent, designs a way of intellectually locating and extracting the target information based on the fundamental theory of information entropy. The article also introduces the concrete design and implementation of this algorithm, and through related experiments proves that the algorithm is able to greatly reduce the requirement of manual intervention of the information extraction, and has a high accuracy and recall rate.The application of this algorithm can automatically, comprehensively, and accurately obtain, supervise and administrate online drug information in real time, offers abundant basis of supervision and a technical approach of intellectual full-range online monitoring for the drug supervision related departments of the government, and therefore has a significant practical meaning of normalizing medical electronic markets and ensuring secure medication.
Keywords/Search Tags:Information Extraction, Semantic Dictionary, Information Entropy, Medical E-Business
PDF Full Text Request
Related items