Font Size: a A A

Design And Implementation Of Data Content Resource Management System For Publishing Industry

Posted on:2017-01-12Degree:MasterType:Thesis
Country:ChinaCandidate:Y T WangFull Text:PDF
GTID:2348330512952069Subject:Software engineering
Abstract/Summary:PDF Full Text Request
In the Internet age, traditional publishing is facing many challenges. The network has become the main channel for dissemination of information, but the paper book sales slow growth. To achieve leapfrog traditional publishers and development, we must do a good job of internal information technology construction, upgrading existing internal information system. Existing internal information system has accumulated a large amount of business data that become publishers'valuable asset. How to tap the "hidden" data in the background, eliminating information silos, to grasp business conditions for business decision-making is one of the key points of this research. In addition, the publishing house has accumulated a wealth of manuscript content. With the changing habits of readers the way, the diversity of using terminal equipment, the layout format for the diversity of needs, the use of the contents of your remodeling needs to work, in accordance with the need to decide for print, etc., need to re-spin the manuscript file and processing for conversion many content into "debris." This research focuses on the development of a manuscript fragmented system. On the main work done as follows:First, data analysis part, its main contents include the establishment of subject-oriented data warehouse, using Microsoft SQL Server Integration Services technology to extract, convert key business indicators for business data, loading it to the data warehouse, using SQL Server Analysis Services technology creates cubes of data in multiple dimensions and uses On-Line Analytical Processing (OLAP) technology for accurate data analysis, enabling multidimensional browsing of multidimensional data sets. And ultimately the deployment of data analysis system and show, so that business decision-makers in a timely manner, easy access to business data.Second, it mainly includes content management, which is the fragmented part of the manuscript. By using the designated document Word document (Doc/Docx) input processing component, according to the "scientific and technical content structure indexing and processing specification" formulated by the publishing house. NET. XML and other technologies, the operation of the Word object library for information recognition and extraction, access to relevant information block, the document paragraphs, pictures, formulas, tables, etc. split, generate XML files, and in accordance with the aforementioned specification will be split out Debris structured storage, to adapt to the current and future period of time business and digital publishing development needs.The third part mainly includes extracting the fragmented information into the newly created data warehouse while fragmenting the Word manuscript. This does not need to make big changes to the old information system, but also to meet the publishing house management access to digital content processing information data needs to complete the new development system, the integration of two modules.This thesis developed a data analysis and content management system that is supplementing and upgrading for traditional information systems and business data, but also reserving of the future of dynamic publishing good content resources. After the system is on-line, it not only strengthens the publishing house manager's insight into the business data, but also can analyze the progress of the fragmented manuscript resource and other content information at any time.
Keywords/Search Tags:Business Intelligence, Data Warehouse, OLAP, Dynamic Publishing, XML
PDF Full Text Request
Related items