Font Size: a A A

Research On Data Acquisition Technology Of Social Network Under Cloud Computing Platform

Posted on:2014-02-06Degree:MasterType:Thesis
Country:ChinaCandidate:N N LiFull Text:PDF
GTID:2248330398972283Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the development of the Internet and the extension of human interaction and communication needs, social network has begun to influence people’s life profoundly. Social network makes the amount, speed and range of information transmission increase significantly. People use these social networking sites or platforms to write, share, comment, discuss, interact and communicate. The social network has become an important part of people’s life. Now, several PB data of social network can be produced every day, including log, micro-blog, photos, video, etc., which update in real-time and constantly. So new techniques and methods are necessary when we obtain and process the massive social network date in real-time.This paper mainly studies the data acquisition technologies and methods of social network under the real-time cloud computing platform. The main works of this paper are as follows:(1) Using the real-time cloud computing platform for acquisition of data in massive social network. Design the task scheduling strategy of data acquisition and the protocol parse method of social networking under the real-time cloud computing platform. Then we get the original data of the social network on the base of parsing the social network protocol.(2) Analysis the original file we have obtained after parsing social network protocol under the real-time cloud computing platform. In order to get the target data, this paper design the regular expression for information extracting.(3) After researching on approximate pattern matching for implicit deformation text processing method, a new approximate string matching algorithm is proposed and implemented to achieve the purpose of cleaning the text we have obtained after information extracting, which has been accelerate through using GPU.
Keywords/Search Tags:Cloud Computing, Protocol Parsing, Social Network, Information Extracting, Approximate match
PDF Full Text Request
Related items