3 posts tagged with "llm"

Stop Chatting With a Stranger — Make Your AI an Assistant That Knows Your Business

April 22, 2026 · 11 min read

Metadata Morph

AI & Data Engineering Team

Your AI assistant is only as useful as the context it can reach. Ask Claude to summarize your latest GitHub PR and it draws a blank — unless you've given it a way in. Ask ChatGPT to pull last week's Slack thread and it can't — unless it's connected.

Model Context Protocol (MCP) is the open standard that changes this. Originally developed by Anthropic, now adopted by OpenAI, Google DeepMind, and the broader AI ecosystem — it lets any compatible model connect to any compatible tool through a single, consistent interface. One protocol. Any model. Any tool.

dbt Testing Strategies Before Feeding Data to LLMs: Preventing Garbage-In, Garbage-Out

November 27, 2025 · 5 min read

Metadata Morph

AI & Data Engineering Team

An AI agent is only as reliable as the data it reasons from. Feed it nulls, duplicates, or stale data and it will produce confident, coherent, and wrong answers — often without any obvious signal that something is off. The LLM doesn't know what it doesn't know.

dbt's testing framework is the right place to enforce data quality before data reaches your agents. This post covers a layered testing strategy that catches the most common failure modes before they become AI failures.

Building a RAG Pipeline on Your Existing Data Warehouse

November 6, 2025 · 6 min read

Metadata Morph

AI & Data Engineering Team

The most common failure mode in enterprise AI projects is asking an LLM questions about your business data and getting confidently wrong answers. The model doesn't know your revenue figures, your customer data, or your internal processes — it only knows what it was trained on.

Retrieval-Augmented Generation (RAG) fixes this by giving the model the relevant context it needs at query time, retrieved from your actual data. The surprising part: you probably don't need a new data infrastructure to do it. Your existing warehouse already has the data — you just need the retrieval layer on top.