Font Size: a A A

Research On Management And Analysis Of Pedigree Big Data

Posted on:2018-11-11Degree:MasterType:Thesis
Country:ChinaCandidate:M M XuFull Text:PDF
GTID:2428330512483578Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Pedigree,also known as genealogy,is a kind of tabular form to record the family lineage and family development history of the book literature.In the long history of the Chinese nation culture,pedigree culture also will be developed.At the present stage,with the rise of the ancestral culture,pedigree has been paid more and more attention.At the same time,the research topic of pedigree is developed rapidly.Through the pedigree literature,people can help people understand the human history.Pedigree digitization using computer technology to organize genealogical data and text image information reasonably,it can provide great convenience for pedigree research,and provide people with personalized services related to pedigree.Corresponding to the pedigree system need to store a large number of pedigree data,how to organize these pedigree data and management,is a prominent research point.At the same time,behind the stored pedigree data,there is a wealth of valuable information,which is of great significance to history,culture and humanistic research.Based on which topics and analytical methods are effective analysis of these large number of pedigree data,We are exploring the direction.Finally,for the results of data analysis,how to use the visualization of technology,the results of an effective display,it is worth our in-depth study.This paper will improve and enrich the process of pedigree from the optimization of pedigree system and the analysis based on the analysis of pedigree data for the problem of digital development of pedigree and the application points worthy of study.The main achievements of this paper are as follows:(1)Based on the table partition technology and memory database SQLite to optimize the pedigree systemBased on the analysis of the genealogical system,this paper analyzes the characteristics and shortcomings of the genealogical data management in the gene system,and introduces the method of partitioning technology and memory in the pedigree system.Database SQLite,which optimizes the management of pedigree data.(2)Data analysis and visualization of pedigree dataUsing the method of statistical analysis to count the value field in the genealogy data,so as to deepen the understanding of the genealogy data,and to study the correlation between the educational level of the parents and the education level of the children through the correlation analysis.Based on the results of the data analysis,Data visualization technology,the results of the graphical display.(3)Family roots"Family roots" is the essence of the relationship between the blood relationship between people,the branch of the relationship between the family and the relationship between the study.From the character name and other information query matching and character migration history similarity match two angles to start.The query matching is divided into three kinds of fuzzy query,fuzzy query and extended fuzzy query.Based on the similarity of Chinese characters,the database query technology,indexing technique and SSC coding method are used to propose effective solutions.The migration history information matching problem Based on the similarity calculation and clustering algorithm of migration trajectory,the corresponding solutions are put forward respectively.
Keywords/Search Tags:pedigree, pedigree system, data management, data analysis, data visualization, family roots
PDF Full Text Request
Related items