《数据仓库技术架构及方案》由会员分享,可在线阅读,更多相关《数据仓库技术架构及方案(60页珍藏版)》请在金锄头文库上搜索。
1、SPDB Project Training数据仓库技术架构及方案黄予辉黄予辉 2008年年12月月13日日AgendaTeradata简介简介架构设计原理架构设计原理整体架构说明整体架构说明ETL架构说明架构说明Teradata 公司概况Teradata Corporation 2007年10月1日纽交所上市企业级数据仓库全球领导者企业级数据仓库全球领导者 EDW/ADW 数据库技术数据库技术 分析型解决方案分析型解决方案 咨询服务咨询服务 自自1999年开始,连续年开始,连续9年被年被Gartner评为数据仓库第一名评为数据仓库第一名美国前10大上市软件公司S&P 500 标准普尔标准普尔5
2、00成员成员 纽交所上市代码纽交所上市代码: “TDC” NYSE Arca Tech 100世界级的客户遍布全球 超过超过 850 个世界级客户个世界级客户 超过超过2000个安装系统个安装系统全球员工超过5,500名Teradata 市场份额50% of Top 10 Global Retailers 40% of Top 10 Global Commercial & Savings Banks90% of Top 10 Global Telco Firms60% of the Top 10 Transportation Logistic FirmsFORTUNE Global Ranki
3、ngs, July 2006Teradata Top 1070% of Top 10 Global AirlinesLeading industries Banking/Financial Services Government Insurance & Healthcare Manufacturing Retail Telecommunications Transportation Logistics Travel World class customer list More than 850 customers More than 2,000 installationsGlobal pres
4、ence Over 100 countriesTeradata 驱动世界级企业的可持续发展RetailFinancial TravelAoyamaShojiCommunications InsuranceManufacturingPostTeradata 数据仓库技术的领导者The Magic Quadrant is copyrighted 9/12/06 by Gartner, Inc. and is reused with permission. The Magic Quadrant is a graphical representation of a marketplace at and
5、 for a specific time period. It depicts Gartners analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner. Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those ven
6、dors placed in the “Leaders“ quadrant. The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action. Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a partic
7、ular purpose. Gartner Magic Quadrant for Data Warehouse DBMS Servers, 2006 Feinberg, Hardcastle, Butler, Dawson (8/25/2006)Gartner Magic Quadrant for Data Warehouse DBMS, 2006 Feinberg & Beyer (9/2006)软件硬件软件硬件Teradata 系统扩展能力10 TB15 TB20 TBData Storage (raw, user data)Workload MixQuery ComplexityActi
8、ve Data Warehousing3-5 Way JoinsNormalizedTBsMBsGBsQuery Data VolumesMultiple, Integrated Stars and Normalized15+ way Joins + OLAP operations + Aggregation + Complex “Where” constraints + Views ParallelismBatch Reporting, Repetitive Queries“Iterative”, Ad Hoc Queries Data Analysis/MiningNear Real Ti
9、me Data FeedsSimple StarMultiple, Integrated StarsData Model Sophistication5-10 Way Joins5 TB# of Concurrent Queries1,000AgendaTeradata简介简介架构设计原理架构设计原理整体架构说明整体架构说明ETL架构说明架构说明架构立方逻辑架构层逻辑架构层物理物理业务信息应用技术当前转换目标逻辑层方案项目操作的顺序定义的 等级起止经成功由好的架构设计方法开始业务 What is the business model, where is it going, how does i
10、t plan to get there? The requirements. The business process. 信息 What data do we have and need to support the Business View? Information is also calculations and rules. Typically we see Logical & Physical data models here, all subject areas of the business. The data is worked on by the applications,
11、used by the business.应用 What functions and interrelations of functions do the applications have and need? Sales, Marketing, Pricing, Manufacturing, Customer Management. Works against information to support the Business View. The applications work within the confines of the Information architecture,
12、creating and consuming the data elements, rules and definitions of that architecture view.技术 The bit IT cares about most. The easiest to get WRONG because we dont concentrate on the other aspects of architecture FIRST! What do we have and need to support the other 3 Views without limitation? Choosin
13、g an ETL tool before you have defined an architecture is “SUB OPTIMAL”A separation of Business, Data, Process and Technology as appropriate业务信息应用技术转换当前逻辑层目标方案项目Tier 3 Semantic LayerTier 3 Semantic Layer视图 逻辑数据集市 依赖型数据集市 分析型知识库视图 逻辑数据集市 依赖型数据集市 分析型知识库PHYLogEDW 应用逻辑架构应用逻辑架构M E T A - D A T AM E T A - D
14、 A T AM E T A - D A T AM E T A - D A T A多功能模型 历史数据 经转换后多功能模型 历史数据 经转换后Tier 2 Tier 2 Single Version Of Corporate MemorySingle Version Of Corporate MemoryCUSTOMER CUSTOMER NUMBER CUSTOMER NAME CUSTOMER CITY CUSTOMER POST CUSTOMER ST CUSTOMER ADDR CUSTOMER PHONE CUSTOMER FAXORDER ORDER NUMBER ORDER DAT
15、E STATUSORDER ITEM BACKORDERED QUANTITYITEM ITEM NUMBER QUANTITY DESCRIPTIONORDER ITEM SHIPPED QUANTITY SHIP DATETier 1 Operational ImageTier 1 Operational Image操作型源数据影像操作型源数据影像Reference Architecture - Source ViewEnterprise Service BusTX1 APPLNWDA-MWTX2 APPLMSG-MWDA-MWTX3 APPLMSG-MWDA-MWTX4 APPLMSG-
16、MWDA-MWBI APPLMSG-MWDA-MWTactical APPLMSG-MWDA-MWStrategic APPLMSG-MWDA-MWBusiness Process AutomationAnalytic & Decision Making RepositoriesAnalytic & Decision Making ServicesTransactional RepositoriesBatchStreamingData Acquisition & IntegrationTransactional ServicesEnterprise Users (Browsers and/or Portal)Back-office usersFrontline UsersService BrokersBusiness RulesMSG-MWEvent