#observability

5 posts

ai deep-dive Jun 4, 2026

Agent Observability: From OTel Traces to Catching Hallucinations, Tool Misuse, and Infinite Loops

The industry has converged on using OpenTelemetry GenAI semantic conventions to turn every LLM call and tool call into a span. Detecting the three major failure modes then splits into three tracks: faithfulness + semantic entropy for hallucinations, framework-level symbolic guardrails for tool misuse, and max steps + action hash deduplication for infinite loops — all wired into a Final / Trajectory / Single-step three-layer evaluation framework.

#observability #ai-agent #tool-use #llm #opentelemetry

ai guide Apr 17, 2026

Lessons from the Trenches: What AI Native Teams Must Get Right

Not everyone should use a coding agent to modify code directly. AI Native teams need interface specs, test-first development, monorepo, security guardrails, human-in-the-loop, and token budget controls. Building an agent platform layer on top of coding agents and clearly redefining developer roles is the right path forward.

#ai-native #coding-agent #spec-driven-development #monorepo #ci-cd #code-review #agent-platform #security #observability #git-worktree #adr #human-in-the-loop #cost-management #model-selection #developer-role #failure-handling

ai guide Mar 26, 2026

Langfuse Complete Guide: LLM Application Observability from Scratch

Langfuse is currently the most mature open-source LLM Observability platform. This post covers four core capabilities — Tracing, Prompt Management, Evaluation, and Datasets — showing you how to use them in real projects.

#langfuse #observability #tracing #llm #prompt-management #evaluation #monitoring

ai guide Mar 12, 2026

RAG Observability Tool Landscape: Choices in 2026

Rolling your own traces is good enough, but open-source tools save you a lot of work. Langfuse, Phoenix, and LangSmith each have their niche — the right choice depends on your trade-offs around self-hosting, open source, and integration complexity.

#rag #observability #langfuse #phoenix #langsmith #tracing #monitoring

ai guide Mar 12, 2026

RAG Observability: 17-Step Tracing to Turn the Black Box Transparent

The hardest part of a RAG system isn't building it — it's figuring out why a particular answer went wrong. Pipeline Tracing records every step's decisions and data so debugging has a clear trail to follow.

#rag #observability #tracing #debugging #pipeline #monitoring