#reasoning

6 篇文章

ai deep-dive 2026年6月4日

Agent 的資源理性：在 token、工具呼叫、延遲之間做最優決策

資源受限下的 agent 決策是 bounded rationality 的復活：Rational Metareasoning 用 VOC 獎勵省 20–37% token、BATS 證明沒有 budget awareness 加預算也沒用、FrugalGPT cascade 最高省 98% 成本、Speculative Actions 降 20% 延遲。三約束最後收斂成一條 Pareto 曲線，主線是「從人手調旋鈕走向模型自己做資源理性決策」。

#ai-agent #reasoning #test-time-compute #llm #cost-optimization

ai deep-dive 2026年6月4日

Machine Theory of Mind：Agent 如何推斷其他 agent 的意圖、知識與目標

從觀察行為反推他者的信念/目標/意圖，學界叫 Machine Theory of Mind。三條血脈：符號 BDI、貝氏逆向規劃、深度學習 ToMnet。LLM 時代最大爭議是 ToMBench 上 GPT-4 仍落後人類 >10 分——高分到底是真推理還是統計捷徑。

#theory-of-mind #multi-agent #ai-agent #llm #reasoning

ai guide 2026年3月28日

OpenClaw 工具篇（三）：Exec 工具、Thinking 層級與 Slash Commands

Exec 支援前景/背景/PTY 執行 + 三種安全等級（deny/allowlist/full），Thinking 有 7 個層級（off 到 adaptive），Slash Commands 分指令和 directive 兩類。

#openclaw #exec #thinking #slash-commands #fast-mode #verbose #reasoning

ai project 2026年3月26日

Kimi：月之暗面的長文本 AI 模型，憑什麼挑戰 GPT 和 Claude？

Kimi 是中國 AI 新創月之暗面（Moonshot AI）推出的大型語言模型，以超長 context window、開源策略和極具競爭力的定價聞名。從 2023 年的 200K context 到 2026 年的 K2.5 Agent Swarm，Kimi 已成為全球 AI 市場不可忽視的力量。

#kimi #moonshot-ai #llm #long-context #reasoning #月之暗面 #ai-model #moe #open-source

ai guide 2026年3月17日

AI Agent 的三個核心支柱：Context、Cognition、Action

AI Agent 不是黑盒子——它由三層構成：知道什麼（Context）、怎麼想（Cognition）、能做什麼（Action）。搞清楚這三層，才能理解 agent 為什麼有時聰明、有時失控，以及怎麼設計一個真正好用的 agent 系統。

#ai-agent #context-engineering #llm #reasoning #ReAct #agentic-ai #memory #mcp

ai guide 2026年3月12日

Plan-and-Execute：先規劃再執行的 RAG 模式

對複雜問題，先讓 LLM 規劃出需要哪些資訊、分幾步取得，再按計畫執行，比邊搜邊想更系統化。

#rag #plan-execute #agentic #multi-step #reasoning