Blog

Insights from the Squish team

Technical articles, product updates, and insights about data engineering, database optimization, and building scalable data platforms.

Latest Articles

Learn about database relationship discovery, data cataloging, and modern data engineering practices.

Technical

You Do Not Need to Connect Every Database to Get Value from Squish

Discover relationships in your data warehouse without touching production databases. Start with Snowflake or BigQuery, expand to operational sources when ready.

February 1, 2026

7 min read

Technical

dbt Best Practices That Actually Matter in Production

Opinionated guidance on dbt project structure, staging conventions, naming, testing, and incremental models based on patterns that survive real production workloads.

February 1, 2026

8 min read

Technical

Your Database Has More Foreign Keys Than You Think

Production databases typically have 3-5x more implicit relationships than documented foreign keys. Where they hide and why they matter for your data team.

January 31, 2026

6 min read

Technical

How dbt Macros Actually Work Under the Hood

A practical look at Jinja compilation in dbt. What macros are, how ref() works internally, when to write your own, and when macros make things worse.

January 30, 2026

7 min read

Industry Insights

The Semantic Layer, Explained for People Who Actually Build Data Pipelines

A practical guide to semantic layers for data engineers. What they actually do, when they help, and when they are unnecessary overhead.

January 29, 2026

7 min read

Technical

How Semantic Layers Work on the Backend

A technical look at what semantic layers actually do under the hood. How MetricFlow and Cortex translate business questions into SQL, and what they need from your schema.

January 29, 2026

8 min read

Technical

Why Squish Never Touches Your Data

How metadata-only access works: information_schema queries, read-only users, and AES-256 encryption. No row data, no PII, no production risk.

January 28, 2026

6 min read

Technical

The dbt Relationships Test is Not Enough

The built-in dbt relationships test catches broken foreign keys but misses implicit ones entirely. Here is how to close the gap.

January 26, 2026

5 min read

Industry Insights

Buying a Data Catalog in 2026: What Actually Matters

What actually matters when evaluating data catalogs: time to value, maintenance burden, and integration. Based on patterns we have seen across dozens of teams.

January 24, 2026

8 min read

Industry Insights

Why Your AI Agent Cannot Query Your Database (And How to Fix It)

AI agents struggle with databases because they lack schema context. Semantic layers and automated discovery solve this. Here is the practical path forward.

January 22, 2026

6 min read

Technical

How to Discover Hidden Relationships in Your Database

Systematic approaches to uncover implicit foreign keys and undocumented relationships. Automated discovery vs manual analysis for modern data stacks.

January 20, 2026

8 min read

Technical

Cross-Database Relationship Discovery: A Complete Guide

How to discover and document relationships across multiple databases, warehouses, and sources. Practical strategies for modern multi-database architectures.

January 20, 2026

10 min read

Industry Insights

Why Manual Data Cataloging is Holding Your Team Back

Automated vs manual data cataloging: why spreadsheets cannot keep up with modern data volumes. How to transition to automated discovery and maintenance.

January 20, 2026

7 min read

Industry Insights

Data Contracts Need Relationship Context

Data contracts define what individual datasets promise. They rarely cover the relationships between datasets, and that gap matters.

January 6, 2026

7 min read

Technical

Why Your ERD is Lying to You

Entity-relationship diagrams show what was documented, not what exists. The gap between the two grows over time, and most teams do not realize how wide it has become.

December 10, 2025

6 min read

Technical

Confidence Scoring for Database Relationships

Not all discovered relationships are equal. Here is how multiple signals combine into a confidence score that separates real relationships from false positives.

November 22, 2025

8 min read

Industry Insights

Why Data Lineage Tools Miss Relationships

Data lineage tracks where data flows. Relationship discovery tracks how data connects. They solve different problems, and you probably need both.

November 3, 2025

7 min read

Technical

ORMs and the Hidden Schema Problem

ORMs define relationships in application code that may never reach the database. Here is how Rails, Django, and SQLAlchemy each handle this differently.

October 8, 2025

8 min read

Industry Insights