Data Storage Systems
Data Lake
A place where you dump all forms of data of your business.
Structured / Un Structured / Semi-Structured.
Example
- Customer service chat logs, voice recordings, email, website comments, social media.
- Need a cheap way to store different types of data in large quantities.
- Data is not needed now, but planning to use it for later use.
- Larger organizations need all kinds of data to analyze and improve business.
Data Warehouse
- Data Warehouse - Stores already modeled/structured data and ready for use.
- Data from the Warehouse can be used for analyzing its operational data.
- There will be developers to support the data.
- It’s multi-purpose storage for different use cases.
Data Mart
A subset of Data Warehouse for a specific use case.
A specific group of users uses it, so it is more secure and performs better.
Example: Pandemic Analysis
Dependent Data Marts - constructed from an existing data warehouse.
Example: Grocery / School Supplies
Independent Data Marts - built from scratch and operated in silos.
Example: Mask / Glove Sales
Hybrid Data Marts - Mix and match both.