Ch02 Conceptual Data warehouse design

11/2/2021 DataData warehouse

# Conceptual Data warehouse design

# Data warehouse design process

img

  • Requirements analysis
  • Conceptual design
  • Logical design
  • Physical design

# Multidimensional model:

  • 从三个维度看销量,每个维度又可以细分img

  • DataCubeimg

    • 一个n维的base cube叫做 base cuboid
    • 最上面的0-D cuboid,叫做 apex cuboid,有最高的总结度,如总销量
    • 图中的线称为lattice,构成了这个datacube
    • 图中越往下越多维,越详细
    • 在一个n维的base cube中一共有2的n次方个单元

# Basic Elements of a Conceptual Model:image-20220408232907564

  • Fact data
  • Attributes
  • Qualities
  • Dimensions

# Conformed Dimensions

image-20220408233000497

# Dimensional fact model(DFM)image-20220408233533242

  • basic elements of a fact schema: f=(M,A,N,R,O,S)
  • quasi-tree
  • Facts and measures
image
  • Attributes and dimensions
img
  • attributes 分为dimensional和non-dimensional
  • 图中左边方框是一个Fact,例如sale
  • Hierarchiesimg
  • Aggregation: 图中mj是method,di是dimensionimg
    • 图中如果sum操作不在可聚合的范围之内,那么就要连线(虚线),并且写上可聚合的操作
      • 如果连线上什么也没写,那么就表示不能sum
    • 如果只有sum操作,那么不需要连线
    • 如果有包括sum的各种操作,那么在连线(虚线)上写上‘+’跟着其他聚合操作

# StarER

image-20220408233559369

# UML Profile for Multidimensional Modelling

  • Three levels of detail
    • Model definitionimage-20220408233625364
    • Star schema definitionimage-20220408233634092image-20220408233647249
    • Dimension definition fact definition
      • Dimension definitionimage-20220408233746241
        • Types of Classification Hierarchies
          • Strict Hierarchy
          • Non-strict Hierarchy
          • Completeness for drill down
          • Completeness for roll upimg
      • Fact definition
        • Degenerate fact-->辅助表示m:n关系
        • Degenerate dimension:没有维度表的维度image-20220408233934945
Last Updated: 11/19/2024, 1:54:38 PM