Font Size: a A A

Design And Implementation Of High-performance Data Warehouse DW2.0Architecture In Enterprise Environment

Posted on:2013-05-22Degree:MasterType:Thesis
Country:ChinaCandidate:Y YuFull Text:PDF
GTID:2248330392960515Subject:Software engineering
Abstract/Summary:PDF Full Text Request
By years working with informatization, enterprises have remained a lot of historical data.After the first-generation data warehouse s establishment, they can already take advantage ofexisting historical data to make transaction data analysis. But with data cost s continuousdecreasing, the whole world is in big bang of digital era, the quantity of data human beingcreated has increased rapidly that make enterprise survival environment change a lot. To them,how to adapt to this kind of change has become one of the decisive factors, which maintaintheir position at the forefront of industry. The first-generation data warehouse has showedproblems in unstructured data processing, capacity and economy, which could not satisfy thedemand of enterprises for the storage and analysis of data anymore.The main object of this thesis is to achieve the application of the second-generation datawarehouse (DW2.0) on the basis of first generation through introducing the framework ofDW2.0in combination with current popular parallel processing computing technology.The thesis introduced the technology background of achieving DW2.0, devised theoverall architecture of data warehouse platform, described elaborately and designedcomprehensively each component of the platform, and achieved three essential technologiesin DW2.0. The first is achieving parallel processing computing technology includinghardware composition and the application of database software, which is capable of fulfillingthe demand of a large number and high-performance storage and analysis on the premise ofenterprises limited increased cost. The second is achieving the storage and analysis ofunstructured data to assist enterprises to analyze unstructured or semi-structured data thattakes60%in enterprise, which would help an enterprise take advantage of all data in theenterprise to acquire profit. The last is dividing the data in data warehouse according to theusing probability and access pattern into four sectors which are interactive sector, integratedsector, near line sector and archival sector to manage data life cycle for enhancement of data warehouse performance.The thesis will quote examples to explain how DW2.0will support effectively theoperation of strategic decision-making tools BSC (Balanced Score Card) system in anenterprise. BSC system is a management system which helps an enterprise turn strategies intoaction. The prominent function of BSC project is to divide strategic targets of an enterpriseinto four basic aspects which are financing, customer, internal processes, learning andgrowing, to classify the above four goals with BSC strategy map into specific targets withreciprocal causation among them, and to improve the management capacity of an enterprisethrough high-quality implement and evaluation of the targets. The implementation of DW2.0will provide comprehensive data support for BSC system; these data includes not onlystructured data from ERP (Enterprise Resource Planning) system of an enterprise, but alsounstructured data produced during the operation of the enterprise. The overall analysis ofstructured and unstructured data is able to show important information of an enterpriseneglected during its operation and management, help the enterprises gain more accurate andeffective BSC target grade, facilitate it to improve constantly its business process andmanagement methods so as to increase their core competence.
Keywords/Search Tags:DW2.0, unstructured data, parallel data processing, data life cycle, BSC
PDF Full Text Request
Related items