Databricks’ Agent Orchestrator 🕹️, Ecosystems Beat Models 🔁, LinkedIn’s Search Brain 🔍

Linux Foundation Announces OpenSharing Project to Standardize AI Asset and Data Exchange (4 minute read)

Databricks has handed the Delta Sharing protocol over to the Linux Foundation. OpenSharing extends Delta Sharing to AI models, agent skills, and unstructured data across clouds and platforms. It adds standard APIs for discovery, authorization, and access, with support for existing Delta Sharing recipients plus Apache Iceberg/REST Catalog clients. The project aims to replace proprietary marketplaces with a single standard for enterprise AI asset distribution.

TLDR Data 2026-06-15

Databricks’ Agent Orchestrator 🕹️, Ecosystems Beat Models 🔁, LinkedIn’s Search Brain 🔍

Deep Dives

Encoding Your Domain Expert: The Context Layer Behind Spotify's Data Assistant (6 minute read)

How Feldera Works: A True Incremental View Maintenance Engine (3 minute read)

Semantic Search for AI Agents at Scale: Retrieval and Ranking for LinkedIn's Hiring Assistant (15 minute read)

Opinions & Advice

The Mythical Agent-Month (10 minute read)

The Bill Arrives: How to Manage Agentic AI Costs at Scale (17 minute read)

A frontier without an ecosystem is not stable (4 minute read)

Launches & Tools

Join renowned data strategist Doug Laney and Matia CEO Benjamin Segal for a discussion on the future of the data stack. (Sponsor)

Introducing Flights: Agent-Native Ingest in MotherDuck (4 minute read)

Introducing Omnigent: A Meta-Harness to Combine, Control, and Share Your Agents (7 minute read)

Apache DataFusion 54.0.0 Released (7 minute read)

Miscellaneous

Linux Foundation Announces OpenSharing Project to Standardize AI Asset and Data Exchange (4 minute read)

The Hidden Cost of ai_parse_document in Production (10 minute read)

Quick Links

New framework for auditing machine unlearning (6 minute read)

SQL to ER Diagram (Tool)

Feature Stores from Scratch: A Minimal Working Implementation (5 minute read)

Curated deep dives, tools and trends in big data, data science and data engineering 📊