Semantic Layer vs. Data Catalog: Complementary, Not Competing
Explains the distinct roles of data catalogs and semantic layers in data architecture, arguing they are complementary tools.
Explains the distinct roles of data catalogs and semantic layers in data architecture, arguing they are complementary tools.
A comprehensive guide to data modeling, explaining its meaning, three abstraction levels, techniques, and importance for modern data systems.
A step-by-step guide to building a robust semantic layer for consistent data metrics, covering architecture, stakeholder alignment, and implementation.
Explains the difference between a metrics layer and a semantic layer in data architecture, clarifying their distinct roles and relationship.
Explores the shift from traditional pull queries to using materialized views and data duplication for better performance, format, and location in data systems.
Explores implementing a data mesh architecture using dbt, outlining how dbt Mesh projects can align with data mesh principles for large-scale organizations.
A monthly roundup of 78 curated links on data engineering, architecture, AI, and tech trends, with top picks highlighted.
A comprehensive guide to the data lakehouse architecture, its core components (Iceberg, Delta, Hudi, Paimon), and the surrounding ecosystem for modern data platforms.
Explores core principles of scalable data engineering, including parallelism, minimizing data movement, and designing adaptable pipelines for growing data volumes.
An introduction to data warehousing concepts, covering architecture, components, and performance optimization for analytical workloads.
Explains data lakes, their key characteristics, and how they differ from data warehouses in modern data architecture.
Explores the modern data stack, cloud platforms, and principles for building flexible, cloud-native data engineering architectures.
Explains the data lakehouse architecture, a unified approach combining data lake scalability with warehouse management features like ACID transactions.
Explores reimagining Apache Kafka for the cloud, proposing a diskless, partition-free design with key-centric streams and topic hierarchies.
Explores reimagining Apache Kafka as a cloud-native event log, proposing features like partitionless design, key-centric access, and topic hierarchies.
A monthly roundup of interesting links and articles about data engineering, databases, streaming tech, and data infrastructure.
A technical guide on designing and implementing a modern data lakehouse architecture using the Apache Iceberg table format in 2025.
Explores how combining data lakehouse, virtualization, and mesh architectures with Dremio solves modern data scaling and silo challenges.
Explains why data professionals should adopt Dremio and Apache Iceberg for flexible, high-performance data lakehouse architecture.
An introduction to data lakehouses, explaining what they are, why they're used, and how to migrate to this modern data architecture.