#semantic-cache — quidproquo

ai guide Apr 3, 2026

AI Agent Caching Goes Beyond One Layer: From Claude Code's 18 Cache Types to Multi-Layer ReAct Agent Design

After dissecting Claude Code's 18+ caching mechanisms, I found that you can't touch provider-level prompt cache, but embedding cache, tool result cache, and entity cache are not only within your reach — they deliver even better results. Includes a complete AgentCache interface design and per-tool TTL strategy.

#react-agent #cache #prompt-cache #semantic-cache #claude-code #cloudflare-kv #llm-cost-optimization

ai guide Mar 12, 2026

Semantic Caching: Run the RAG Pipeline Only Once for Semantically Similar Queries

Caching doesn't have to match exact query strings -- semantically similar questions can hit the cache too, skipping the entire RAG pipeline execution.

#rag #semantic-cache #caching #vector-search #performance