Anthropic’s Automated Analytics 🔍, PostgreSQL 19 Beta 🐘, McKinney on Agentic Engineering 🛠️

Ground truth is a process, not a dataset (4 minute read)

Ground truth is a process, not a static dataset. For complex AI report fact-checking, Amazon's audit-then-score protocol lets AI challenge benchmark labels with evidence. A human auditor reviews disputes and updates the ground truth when warranted, lifting expert accuracy to 90.9%.

TLDR Data 2026-06-08

Anthropic’s Automated Analytics 🔍, PostgreSQL 19 Beta 🐘, McKinney on Agentic Engineering 🛠️

Deep Dives

How Anthropic enables self-service data analytics with Claude (5 minute read)

Dynamic Repartitioning for Time Series Workloads (11 minute read)

The Join-Aware Materialized View Query Rewrite Gap (4 minute read)

Opinions & Advice

Ground truth is a process, not a dataset (4 minute read)

Vibe Coding Is Dangerous, Agentic Engineering Isn't (15 minute read)

Structure vs. Concept (9 minute read)

Launches & Tools

Mozilla Data Collective - Your Models Are Only as Good as the Datasets You Train On (Sponsor)

What is Apache Arrow Flight? (8 minute read)

PostgreSQL 19 Beta 1 Released! (5 minute read)

Miscellaneous

A Deep Dive into Calibration of Language Models: Platt Scaling, Isotonic Regression, Temperature Scaling (7 minute read)

Your Obsidian Vault Can Now Run SQL (and Your Agent Can Read It) (5 minute read)

The Tableau Exodus Has Begun (4 minute read)

Quick Links

Broker-Visible vs Client-Local Parallelism (4 minute read)

The Basic Spark Concept Beginners Don't Know (3 minute read)

5 dbt mistakes I see in every startup (7 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊