logo
Data Warehousing Informatica - Interview Questions and Answers
What is a star schema and a snowflake schema?

When discussing data warehousing, the star schema and snowflake schema are two common ways to organize data for efficient analysis. Here's a breakdown of each:

Star Schema :

  • Structure:
    • The star schema is the simplest data warehouse schema.
    • It consists of a central fact table surrounded by dimension tables.
    • The fact table contains the quantitative data (measures), and the dimension tables contain the descriptive attributes.
    • The arrangement resembles a star, hence the name.
  • Characteristics:
    • Dimension tables are denormalized, meaning they may contain redundant data.
    • This denormalization simplifies queries and improves performance.
    • It is well-suited for simple and fast queries.
  • Advantages:
    • Simple to understand and implement.
    • Fast query performance.
    • Easy for users to navigate.
  • Disadvantages:
    • Potential for data redundancy.
    • May require more storage space.

Snowflake Schema :

  • Structure:
    • The snowflake schema is an extension of the star schema.
    • It normalizes the dimension tables, breaking them down into further sub-dimension tables.
    • This creates a more complex, hierarchical structure.
    • The resulting diagram resembles a snowflake.
  • Characteristics:
    • Dimension tables are normalized, reducing data redundancy.
    • This normalization can increase query complexity, as more joins may be required.
    • It is better suited for situations where data integrity and storage space are critical.
  • Advantages:
    • Reduced data redundancy.
    • Improved data integrity.
    • Efficient use of storage space.
  • Disadvantages:
    • Increased query complexity.
    • Potentially slower query performance due to more joins.

Key Differences Summarized :

  • Normalization:
    • Star schema: Denormalized dimension tables.
    • Snowflake schema: Normalized dimension tables.
  • Query Performance:
    • Star schema: Generally faster.
    • Snowflake schema: Potentially slower.
  • Complexity:
    • Star schema: Simpler.
    • Snowflake schema: More complex.
  • Storage Space:
    • Star schema: May require more space.
    • Snowflake schema: Uses less space.