5/5/17

Dimensional data modelling approach

Data modeling is the process of building a database to meet the business requirement. The dimensional approach is one of the data modeling approaches to establishing a data mart or data warehouse database.
There are two types of schema building and it will be selected based on advantages and disadvantages between these two.

    1. Star Schema
    2. Snowflake schema
  1. Star Schema:
    This type schema contains the fact table in center position. As we know that fact table contains a reference to dimension tables.
    Then the fact table will be surrounded by dimension tables with foreign key reference. The dimension table will not have a reference with any other dimension.
    Because of this, whole structure looks like a star, this type of schema is called star schema.
    Star schema example
  2. Snowflake Schema
    This type also contains a fact table in center position. The fact table has a reference to dimension tables.
    The dimension table will have a reference to another dimension.
    The data will be stored in the more normalized form.
    Because of this, whole structure looks like a snow spread on the earth, it’s called Snowflake schema.
    In below example, the Project dimension having a reference with Role  dimension.
    Snowflake schema example
Difference Between Star Schema and Snowflake schema:
FeatureStar SchemaSnowflake schema
PerformanceAs there is no relationship between dimensions to other dimensions the performance will be high.Due to multiple links between dimensions the performance will be low.
Query complexityThe number of joins will be less which makes query complexity lowThe number of joins will be more which makes query complexity high
Database sizeConsider the Project dimension mentioned in above example it has Role column where the Role name value will be stored against for each project in case of start schema, the size of the table will be highThe role information is separately stored in a table and the reference will be linked in Project dimension, it reduces the table size
NormalizationData will be stored in de-normalized format in dimension tableData will be stored in more normalized format in dimension tables
How to Select Schema Type:
  1. These are the deciding factors to select the schema type if you worry about the size of the database, then go ahead with snowflake which gives normalized approach.
  2. Same time if you look for more performance then start with Star schema approach.

1 comment:

  1. Thanks for posting it.
    It is amazing sharing for us.

    ReplyDelete

Google Q&A Forum