Ch02 Conceptual Data warehouse design
Yang Haoran 11/2/2021 DataData warehouse
# Conceptual Data warehouse design
# Data warehouse design process

- Requirements analysis
- Conceptual design
- Logical design
- Physical design
# Multidimensional model:
从三个维度看销量,每个维度又可以细分

DataCube

- 一个n维的base cube叫做 base cuboid
- 最上面的0-D cuboid,叫做 apex cuboid,有最高的总结度,如总销量
- 图中的线称为lattice,构成了这个datacube
- 图中越往下越多维,越详细
- 在一个n维的base cube中一共有2的n次方个单元
# Basic Elements of a Conceptual Model:
- Fact data
- Attributes
- Qualities
- Dimensions
# Conformed Dimensions

# Dimensional fact model(DFM)
- basic elements of a fact schema: f=(M,A,N,R,O,S)
- quasi-tree
- Facts and measures
- Attributes and dimensions
- attributes 分为dimensional和non-dimensional
- 图中左边方框是一个Fact,例如sale
- Hierarchies

- Aggregation: 图中mj是method,di是dimension
- 图中如果sum操作不在可聚合的范围之内,那么就要连线(虚线),并且写上可聚合的操作
- 如果连线上什么也没写,那么就表示不能sum
- 如果只有sum操作,那么不需要连线
- 如果有包括sum的各种操作,那么在连线(虚线)上写上‘+’跟着其他聚合操作
- 图中如果sum操作不在可聚合的范围之内,那么就要连线(虚线),并且写上可聚合的操作
# StarER

# UML Profile for Multidimensional Modelling
- Three levels of detail
- Model definition

- Star schema definition


- Dimension definition fact definition
- Dimension definition
- Types of Classification Hierarchies
- Strict Hierarchy
- Non-strict Hierarchy
- Completeness for drill down
- Completeness for roll up

- Types of Classification Hierarchies
- Fact definition
- Degenerate fact-->辅助表示m:n关系
- Degenerate dimension:没有维度表的维度

- Dimension definition
- Model definition