Often referred to as ETL, data integration involves the following three core operations:
1. Extract:
Retrieving data from specified data sources.
2. Transform:
Adjusting the source data, as required, through rules, merges, lookup tables, or other conversion methods to align with the target.
3. Load:
Incorporating the resulting transformed data into a target database.
More recently, the term ELT has gained prominence, highlighting that the transformation operation doesn't necessarily have to occur before loading. This is particularly relevant in systems like Snowflake, which support transformation during or after loading.
Furthermore, the scope of data integration has broadened to encompass a wider array of operations, including:
- Data preparation.
- Data migration, movement, and management.
- Data warehouse automation.