Font Size: a A A

Research On The Resource Integration Of Chinese Name Authority Files And Wikipedia

Posted on:2018-06-22Degree:MasterType:Thesis
Country:ChinaCandidate:Q H XueFull Text:PDF
GTID:2348330521450224Subject:Information Science
Abstract/Summary:PDF Full Text Request
CCCNA(Cooperative Committee for Chinese Name Authority)was co-sponsored by NLC,CCS,CALIS,JULAC-HKCAN in 2003.Under the unified coordination of CCCNA,four local libraries store their name authority files in one place and establish a one-stop search platform.However,the level of resource integration of the platform is only the integration of the data level,although which solve the problem of decentralized preservation of resources,to a certain extent,which did not solve the technical standards of heterogeneous and the problem of the resource duplication.The heterogeneous of the technical standards form led to the open degree of the Chinese name authority files is still confined to the library,cannot be effective interaction with the network resources;The duplication of the resource content led the Chinese name authority files began to appear some problems,that is the same individual has different name identification,different individuals has the same identification,the change of the name identification and the name's format is not standardized when communicating during the process of cross-system communication,which led to the platform cannot meet the needs of users in the information retrieval.The world's largest network encyclopedia —Wikipedia was established in 2001,is now ranked fifth in the global website,and become the most influential free online encyclopedia.At present,the Chinese Wikipedia has 811305 items,the characters(the number of fields in the info-box is greater than 3)has more than 8,000 items,Every items described in biographies,detailing information such as profile,life,work,academic research,contribution,evaluation,friend,student and influence.Wikipedia can be an effective supplement to the Chinese name authority files to help solve the current existing problems in the Chinese name authority files.Based on literature of development of the name authority files at home and abroad,the resource integration of the name authority files,and the integration between the name authority data and the network resources,this thesis analyzes the resource usage of the existing CCCNA database retrieval system,discusses the advantages of the network resources—Wikipedia as a goal to achieve integration,as well as the necessity of integrating name authority files with Wikipedia resources.After the realization of data integration,Chinese name authority files can realize further information integration with Wikipedia,and even semantic integration.In the information integration with Wikipedia,this paper focuses on Wikipedia interface MediaWiki API,discusses the link of personal name authority file and Wikipedia from two aspects.The first method is to achieve links from name authority file to Wikipedia,and the second is to generate dynamic profile to provide the necessary source of the information for name authority files.By effectively creating links Chinese name authority file with the Wikipedia to help identify and retrieve personal names.In the aspect of semantic integration,this thesis focuses on Wikidata,with the research methods of induction and comparison,expounds the development history of Wikidata,analyzes its data characteristic and model,and then summarizes fives methods to access data.Wikidata has the characteristics of open collaboration,multiple languages and well-structured.The data model of Wikidata defines the entities and attributes based on its items.It also provides diversified data access methods.Then this paper designs the semantic integration model of the Chinese name authority file and Wikidata,based on the status of the data organization of Chinese name authority file.In the realization of semantic integration,This thesis Uses the method of associating data technology,firstly,semantics the data of the Chinese name authority file and downloads the human character data in Wikidata.On the basis of this jobs,using the PARIS algorithm to realize the entity alignment between the semantics of the Chinese name authority file and Wikidata and finally the use of data visualization of the way to show the result.
Keywords/Search Tags:Chinese name authority file, Resource Integration, Wikipedia, Wikidata, Information organization
PDF Full Text Request
Related items